Loading…

AI & Machine Learning News Hub

Research, releases, and applied work in AI & ML

What's New

Top 5 Across All Sources
  1. Lobachevsky’s integral formula

    John D. Cook · 14h ago
  2. Encoding Categorical Data for Outlier Detection

    Towards Data Science · 17h ago
  3. How to Use Claude Code in Your Browser

    Towards Data Science · 19h ago
  4. GLM-5.2 is the step change for open agents

    Interconnects AI · 19h ago
Latest
Hugging Face - BlogShipping huggingface_hub every week with AI, open tools, and a human in the loopJohn D. CookLobachevsky’s integral formulaTowards Data ScienceEncoding Categorical Data for Outlier DetectionTowards Data ScienceHow to Use Claude Code in Your BrowserInterconnects AIGLM-5.2 is the step change for open agentsTowards Data ScienceWhen RAG Users Ask Vague Questions: Clarify Once, Learn the DefaultHugging Face - BlogPP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M ParametersTowards Data ScienceNeural Networks, Explained for Beginners: Start Here If They’ve Confused YouJohn D. CookQueens on a prime order boardHugging Face - BlogWe got local models to triage the OpenClaw repo for FREE!*Towards Data ScienceTool Calling, Explained: How AI Agents Decide What to Do NextTowards Data ScienceReconstructing the Table of Contents a PDF Forgot to Ship, So RAG Can Scope by SectionTowards Data ScienceWhat Are the Possibilities to Build Date Tables in Self-Service Environments?Eugene YanPatterns for Building Cybersecurity EvalsJohn D. CookAll pieces on a 6 by 5 boardTowards Data Science7 Crucial Barriers Between Data Teams and Self-Healing Data ArchitectureTowards Data ScienceMaking a PDF’s Images Searchable for RAG, Without Paying to Read Them AllTowards Data ScienceMaterialized Lake Views in Microsoft Fabric: When Your Medallion Fits in a SELECT StatementTowards Data SciencePython 3.14 and its New JIT CompilerTowards Data ScienceBuilding a Custom GStreamer Plugin for NVIDIA DeepStreamHugging Face - BlogShipping huggingface_hub every week with AI, open tools, and a human in the loopJohn D. CookLobachevsky’s integral formulaTowards Data ScienceEncoding Categorical Data for Outlier DetectionTowards Data ScienceHow to Use Claude Code in Your BrowserInterconnects AIGLM-5.2 is the step change for open agentsTowards Data ScienceWhen RAG Users Ask Vague Questions: Clarify Once, Learn the DefaultHugging Face - BlogPP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M ParametersTowards Data ScienceNeural Networks, Explained for Beginners: Start Here If They’ve Confused YouJohn D. CookQueens on a prime order boardHugging Face - BlogWe got local models to triage the OpenClaw repo for FREE!*Towards Data ScienceTool Calling, Explained: How AI Agents Decide What to Do NextTowards Data ScienceReconstructing the Table of Contents a PDF Forgot to Ship, So RAG Can Scope by SectionTowards Data ScienceWhat Are the Possibilities to Build Date Tables in Self-Service Environments?Eugene YanPatterns for Building Cybersecurity EvalsJohn D. CookAll pieces on a 6 by 5 boardTowards Data Science7 Crucial Barriers Between Data Teams and Self-Healing Data ArchitectureTowards Data ScienceMaking a PDF’s Images Searchable for RAG, Without Paying to Read Them AllTowards Data ScienceMaterialized Lake Views in Microsoft Fabric: When Your Medallion Fits in a SELECT StatementTowards Data SciencePython 3.14 and its New JIT CompilerTowards Data ScienceBuilding a Custom GStreamer Plugin for NVIDIA DeepStream

By Source

Feeds organized so you can skim by site.

Density Sort
HF
Hugging Face - Blog
10h ago · 20 items
Shipping huggingface_hub every week with AI, open tools, and a human in the loop 10h ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters 20h ago A Blog post by PaddlePaddle on Hugging Face We got local models to triage the OpenClaw repo for FREE!* 1d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. MosaicLeaks: Can your research agent keep a secret? 4d ago A Blog post by ServiceNow on Hugging Face Beyond LoRA: Can you beat the most popular fine-tuning technique? 5d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. Is it agentic enough? Benchmarking open models on your own tooling 5d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. MolmoMotion: Language-guided 3D motion forecasting 5d ago A Blog post by Ai2 on Hugging Face From the Hugging Face Hub to robot hardware with Strands Agents and LeRobot 5d ago A Blog post by Amazon on Hugging Face GLM-5.2: Built for Long-Horizon Tasks 6d ago A Blog post by Z.ai on Hugging Face Agentic Resource Discovery: Let agents search 6d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science.
20 loaded
JD
John D. Cook
14h ago · 20 items
20 loaded
TD
Towards Data Science
17h ago · 20 items
Encoding Categorical Data for Outlier Detection 17h ago How to Use Claude Code in Your Browser 19h ago When RAG Users Ask Vague Questions: Clarify Once, Learn the Default 20h ago Neural Networks, Explained for Beginners: Start Here If They’ve Confused You 22h ago Tool Calling, Explained: How AI Agents Decide What to Do Next 1d ago Reconstructing the Table of Contents a PDF Forgot to Ship, So RAG Can Scope by Section 1d ago What Are the Possibilities to Build Date Tables in Self-Service Environments? 1d ago 7 Crucial Barriers Between Data Teams and Self-Healing Data Architecture 2d ago Making a PDF’s Images Searchable for RAG, Without Paying to Read Them All 2d ago Materialized Lake Views in Microsoft Fabric: When Your Medallion Fits in a SELECT Statement 2d ago
20 loaded
IA
Interconnects AI
19h ago · 20 items
20 loaded
EY
Eugene Yan
2d ago · 20 items
20 loaded
DS
David Stutz
8d ago · 10 items
Domain-Specific AI Should Focus on Workflows Rather Than Modeling 8d ago I spent the past few years working on AI for health. Starting with custom multimodal encoders, post-training, and sophisticated multi-agent architecturs, I now see modeling work becoming less and less important for domain-specific applicati... AI Evaluation is Becoming an Exciting Standalone Discipline 37d ago Having worked on robustness problems during my PhD, I see many of the characteristics appearing in the evaluation of LLMs and AI systems. Adversarial attacks such as jailbreaks are becoming more relevant, edge cases finally become relevant,... RAISE 2025 Panel Statement on Aligning AI to Clinical Values 261d ago Recently, I attended the Responsible AI for Social and Ethical Healthcare 2025 “2.0” Symposium organized by, among others, Harvard Medical School. The symposium featured various panels on topics surrounding generative AI, in particular mult... Some Lessons on Reviews and Rebuttals 505d ago Writing and responding to reviews is the bread and butter of any academic and especially in AI research, PhD students are confronted with both rather early compared to other displicines. Unfortunately, I found that drafting reviews and rebu... Thoughts on Watermarking AI-Generated Content 523d ago Watermarking AI-generated content has the potential to address various problems that generative AI threatens to aggravate — misinformation, impersonation, copyright infringement, web pollution, etc. However, it is also controversial with ma... Thoughts and Lessons for Planning Rater Studies in AI 532d ago With the goal of deploying generative AI systems, rater studies are becoming increasingly common and important. This means more and more researchers and engineers face the challenge of actually planning and conducting rater studies for AI s... Open-Sourcing Relabeled MedQA and Dermatology DDx Datasets 587d ago Dealing with rater disagreement is becoming more important in AI, especially for LLMs and in specialized domains such as health. In the past year, I helped open source two datasets allowing to study rater disagreement in the health domain: ... Thinking About Research Ideas vs. Technology 589d ago In this article, I want to share some thoughts on the difference between research ideas and technology, particularly in machine learning. This distinction is have been contemplating since starting my PhD. After joining Google DeepMind and b... The Importance of Effectively Experimenting in an AI PhD 714d ago Engineering and running experiments are a key component of most PhDs in AI. While there are plenty of more theoretical topics that are often limited to smaller scale experimentation, the trend has definitely been to scale up models, dataset... FAQ for our Monte Carlo Conformal Prediction 771d ago Over the past months, I have given several talks about Monte Carlo conformal prediction and the problem of calibrating with uncertain ground truth, for example, stemming from annotator disagreement. Each time, the audience had great questio...
OU
One Useful Thing
13d ago · 20 items
20 loaded
AO
Ahead of AI
16d ago · 20 items
20 loaded
SAP Sapphire 2026: The Complete Breakdown 32d ago All 25 announcements from Orlando, what's actually shipping vs. what's marketing — and what it means for the document and data foundation of the autonomous enterprise. SAP's Sapphire 2026 in Orlando was the most AI-dense keynote in the comp... Claude for Legal Teams: Contract Review, Compliance and Due Diligence 62d ago See how the Claude legal plugin helps in-house legal teams with contract review, compliance scanning, due diligence, obligations tracking, and drafting. Vibe Coding Best Practices: 5 Claude Code Habits for Better Agentic Coding 67d ago Learn 5 practical vibe coding best practices for Claude Code and coding agents: CLAUDE.md, planning, review agents, safer prompts, and diff review. AI Benchmarks Explained: GPQA, SWE-bench, Chatbot Arena and What They Actually Measure 73d ago Learn what MMLU, GPQA Diamond, SWE-bench, HealthBench, and Chatbot Arena actually measure, and how labs game benchmark scores. Why AI-Native IDP Platforms Outperform ABBYY and Kofax in Modern Document Workflows 74d ago Evaluating IDP vendors? Compare Nanonets vs ABBYY and Kofax across architecture, operating model, and TCO to see why AI-native wins for IDP. Why You Hit Claude Limits So Fast: AI Token Limits Explained 76d ago Learn what AI tokens are, why Claude hits limits fast, and how to cut waste from context windows, history, files, tools, and reasoning. Did Google's TurboQuant Actually Solve AI Memory Crunch? 81d ago Google’s TurboQuant promises 6x KV-cache compression. Here’s what it means for AI memory, HBM demand, and the broader memory crunch. Claude for Finance Teams: Investment Banking, DCF Models, Reconciliation & Variance Analysis 91d ago See how finance teams use Claude for one-pagers, CIMs, comps, DCF models, reconciliations, and variance commentary, plus key human checks. AI Agent Hacks McKinsey: 5 Situations When You Should Not Deploy Agents 100d ago McKinsey hacked in 2 hours. 5 situations where AI agents will fail. Production permissions, regulated data, legacy systems—check before deploy. Are OpenAI and Google intentionally downgrading their models? 103d ago Yes, OpenAI and Google degrade their models. OpenAI admitted silent updates after denying it. Gemini redirects models. With full evidence.
15 loaded
AW
AI Weirdness
82d ago · 15 items
Get working on your April Fools Eiffel Tower 82d ago Elevator Surprise: Place a tiny camera in the elevator, and when someone gets in, snap a photo saying, "Welcome to Space Station!" Or build a miniature model of the Eiffel Tower next to it for a dramatic effect. Tower of Pancakes: Create a ... Bonus: More April Fools pranks from Eiffel Tower Llama 82d ago AI Weirdness: the strange side of machine learning When a chatbot runs your store 185d ago You may have heard of people hooking up chatbots to controls that do real things. The controls might run internet searches, run commands to open and read documents and spreadsheets, or even edit or delete entire databases. Whether this soun... Bonus: Incorrect Christmas Carols 186d ago AI Weirdness: the strange side of machine learning Tiny neural net Halloween costumes are the best 237d ago I've been experimenting with getting a tiny circa-2015 recurrent neural network to generate Halloween costumes. Running on a single cat hair-covered laptop, char-rnn has no internet training, but learns from scratch to imitate the data I gi... More tiny neural net costumes 237d ago AI Weirdness: the strange side of machine learning Halloween costumes by tiny neural net 248d ago I've recently been experimenting with one of my favorite old-school neural networks, a tiny program that runs on my laptop and knows only about the data I give it. Without internet training, char-rnn doesn't have outside references to draw ... Bonus: more halloween costumes from tiny neural net 248d ago AI Weirdness: the strange side of machine learning Botober 2025: Terrible recipes from a tiny neural net 265d ago After seeing generated text evolve from the days of tiny neural networks to today's ChatGPT-style large language models, I have to conclude: there's something special about the tiny guys. Maybe it's the way the tiny neural networks string t... Bonus: Char-rnn's jello creations 265d ago AI Weirdness: the strange side of machine learning
15 loaded
DB
Damian Bogunowicz - dtransposed
123d ago · 10 items
AK
Andrej Karpathy blog
131d ago · 10 items
SR
Salmon Run
141d ago · 20 items
Book Review: Software Engineering for Data Scientists 141d ago As a Software Engineer (backend Web Development then Search) turned Data Scientist, I was particularly interested in what the book Software ... Book Review: Transformers In Action 163d ago The Attention Is All You Need paper proposed the Transformer Architecrture as an improvement to the dominant encoder-decoder models of the ... Trip Report: PyData Global 2025 178d ago I attended PyData Global 2025 earlier this month. I had hoped to write this up earlier, but I've been busy, so only now getting the time Ch... Book Review: Time Series Forecasting using Foundation Models 253d ago As someone who primarily works in NLP and Search in the Health Domain, I don't have much use for Time Series. However, while exploring the F... Book Review: Statistics every Programmer Needs 275d ago I recently read Statistics every Programmer Needs by Gary Sutton. I am probably a good target audience for the book since I used to be a so... Book Review: Hands-On Artificial Intelligence for IoT 359d ago For those in similar professional circles as I am in, i.e. looking forward into the Generative AI space, yet with one foot pragmatically and... Book Review: Essential Graph RAG 372d ago Coming from a background of Knowledge Graph (KG) backed Medical Search, I don't need to be convinced about the importance of manually curate... Packaging ML Pipelines from Experiment to Deployment 538d ago As an ML Engineer, we are generally tasked with solving some business problem with technology. Typically it involves leveraging data assets ... Trip Report - PyData Global 2024 561d ago I attended PyData Global 2024 last week. Its a virtual conference, so I was able to attend it from the comfort of my home, although presenta... Using Knowledge Graphs to enhance Retrieval Augmented Generation 625d ago Retrieval Augmented Generation (RAG) has become a popular approach to harness LLMs for question answering using your own corpus of data. Typ...
20 loaded
LL
Lil'Log
418d ago · 20 items
Why We Think 418d ago Special thanks to John Schulman for a lot of super valuable feedback and direct edits on this post. Test time compute (Graves et al. 2016, Ling, et al. 2017, Cobbe et al. 2021) and Chain-of-thought (CoT) (Wei et al. 2022, Nye et al. 2021), ... Reward Hacking in Reinforcement Learning 572d ago Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing the intended task. Reward hacking exists because RL enviro... Extrinsic Hallucinations in LLMs 716d ago Hallucination in large language models usually refers to the model generating unfaithful, fabricated, inconsistent, or nonsensical content. As a term, hallucination has been somewhat generalized to cases when the model makes mistakes. Here,... Diffusion Models for Video Generation 802d ago Diffusion models have demonstrated strong results on image synthesis in past years. Now the research community has started working on a harder task—using it for video generation. The task itself is a superset of the image case, since an ima... Thinking about High-Quality Human Data 869d ago [Special thank you to Ian Kivlichan for many useful pointers (E.g. the 100+ year old Nature paper “Vox populi”) and nice feedback. 🙏 ] High-quality data is the fuel for modern data deep learning model training. Most of the task-specific lab... Adversarial Attacks on LLMs 972d ago The use of large language models in the real world has strongly accelerated by the launch of ChatGPT. We (including my team at OpenAI, shoutout to them) have invested a lot of effort to build default safe behavior into the model during the ... LLM Powered Autonomous Agents 1096d ago Building agents with LLM (large language model) as its core controller is a cool concept. Several proof-of-concepts demos, such as AutoGPT, GPT-Engineer and BabyAGI, serve as inspiring examples. The potentiality of LLM extends beyond genera... Prompt Engineering 1196d ago Prompt Engineering, also known as In-Context Prompting, refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without updating the model weights. It is an empirical science and the effect of prompt eng... The Transformer Family Version 2.0 1243d ago Many new Transformer architecture improvements have been proposed since my last post on “The Transformer Family” about three years ago. Here I did a big refactoring and enrichment of that 2020 post — restructure the hierarchy of sections an... Large Transformer Model Inference Optimization 1259d ago [Updated on 2023-01-24: add a small section on Distillation.] Large transformer models are mainstream nowadays, creating SoTA results for a variety of tasks. They are powerful but very expensive to train and use. The extremely high inferenc...
20 loaded
DA
Datumbox
422d ago · 20 items
20 loaded
JA
Jay Alammar
454d ago · 10 items
Moving To Substack 454d ago I’m freezing this blog and starting to post on my Substack instead. The authoring experience is much more convenient for me there. Please follow me there, and check out The Illustrated DeepSeek R-1 if you haven’t yet. And check out our How ... Generative AI and AI Product Moats 1141d ago Here are eight observations I’ve shared recently on the Cohere blog and videos that go over them.: Article: What’s the big deal with Generative AI? Is it the future or the present? Article: AI is Eating The World Remaking Old Computer Graphics With AI Image Generation 1269d ago Can AI Image generation tools make re-imagined, higher-resolution versions of old video game graphics? Over the last few days, I used AI image generation to reproduce one of my childhood nightmares. I wrestled with Stable Diffusion, Dall-E ... The Illustrated Stable Diffusion 1358d ago Translations: Chinese, Vietnamese. (V2 Nov 2022: Updated images for more precise description of forward diffusion. A few more images in this version) AI image generation is the most recent AI capability blowing people’s minds (mine included... Applying massive language models in the real world with Cohere 1569d ago A little less than a year ago, I joined the awesome Cohere team. The company trains massive language models (both GPT-like and BERT-like) and offers them as an API (which also supports finetuning). Its founders include Google Brain alums in... The Illustrated Retrieval Transformer 1632d ago Discussion: Discussion Thread for comments, corrections, or any feedback. Translations: Korean, Russian Summary: The latest batch of language models can be much smaller yet achieve GPT-3 like performance by being able to query a database or... Explainable AI Cheat Sheet 1876d ago Introducing the Explainable AI Cheat Sheet, your high-level guide to the set of tools and methods that helps humans understand AI/ML models and their predictions. I introduce the cheat sheet in this brief video: Finding the Words to Say: Hidden State Visualizations for Language Models 1981d ago By visualizing the hidden state between a model's layers, we can get some clues as to the model's Interfaces for Explaining Transformer Language Models 2014d ago Interfaces for exploring transformer language models by looking at input saliency and neuron activation. Explorable #1: Input saliency of a list of countries generated by a language model Tap or hover over the output tokens: Explorable #2: ... How GPT3 Works - Visualizations and Animations 2157d ago Discussions: Hacker News (397 points, 97 comments), Reddit r/MachineLearning (247 points, 27 comments) Translations: German, Korean, Chinese (Simplified), Russian, Turkish The tech world is abuzz with GPT3 hype. Massive language models (lik...
CH
Chip Huyen
523d ago · 10 items
SS
Seita's Place
1103d ago · 10 items
My Faculty Application Experience 1103d ago I spent roughly a year preparing, and then interviewing, for tenure-trackfaculty positions. My job search is finally done, and I am joining theUniversity of ... Books Read in 2022 1269d ago At the end of every year I have a tradition where I write summaries of thebooks that I read throughout the year. Unfortunately this year wasexceptionally bus... Conference on Robot Learning 2022 1273d ago The airplanes on display at the CoRL 2022 banquet. The 2022 Robotics: Science and Systems Conference 1289d ago A photo I took while at RSS 2022 in New York City, on the dinner cruise arranged by the conference. The (In-Person) ICRA 2022 Conference in Philadelphia 1419d ago A photo I took while at ICRA 2022 in Philadelphia. This is the Two New Papers: Learning to Fling and Singulate Fabrics 1425d ago The system for our IROS 2022 paper on singulating layers of cloth with tactile sensing. A Plea to End Harassment 1437d ago Scott Aaronson is a professor of computer science at UT Austin, where hisresearch area is in theoretical computer science. However, he may be more wellknown ... My Paper Reviewing Load 1521d ago This is a regularly updated post, last updated June 13, 2026. I Stand with Ukraine 1578d ago I stand with Ukraine and firmly oppose Vladimir Putin’s invasion. Books Read in 2021 1634d ago At the end of every year I have a tradition where I write summaries of thebooks that I read throughout the year. Here’s the following post with the roughset ...
MI
ML in Production
1506d ago · 10 items
Driving Experimentation Forward through a Working Group (Experimentation Program Series: Guide 03) 1506d ago We describe how diverse stakeholders can drive experimentation forward through the formation of a working group and what role data science plays. What is an Experimentation program and Who is Involved? (Experimentation Program Series: Guide 02) 1542d ago We define what an experimentation program is and discuss which stakeholder groups should participate in order to drive experimentation forward. Building An Effective Experimentation Program (Experimentation Program Series: Guide 01) 1555d ago An introduction to building an effective experimentation program at your company. Lessons Learned from Writing Online 1598d ago Where I share my story writing MLinProduction and key metrics from building my audience and monetization. Newsletter #087 2029d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #086 2037d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #085 2043d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #084 2050d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #083 2057d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #082 2064d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems.
SB
Software 2.0 3145d ago Software 2.0 I sometimes see people refer to neural networks as just “another tool in your machine learning toolbox”. They have some pros and cons, they work here or there, and sometimes you can … AlphaGo, in context 3309d ago AlphaGo, in context Update Oct 18, 2017: AlphaGo Zero was announced. This post refers to the previous version. 95% of it still applies. I had a chance to talk to several people about the recent … ICML accepted papers institution stats 3316d ago ICML accepted papers institution stats The accepted papers at ICML have been published. ICML is a top Machine Learning conference, and one of the most relevant to Deep Learning, although NIPS has a … A Peek at Trends in Machine Learning 3363d ago A Peek at Trends in Machine Learning Have you looked at Google Trends? It’s pretty cool — you enter some keywords and see how Google Searches of that term vary through time. I thought — hey, I … ICLR 2017 vs arxiv-sanity 3388d ago ICLR 2017 vs arxiv-sanity I thought it would be fun to cross-reference the ICLR 2017 (a popular Deep Learning conference) decisions (which fall into 4 categories: oral, poster, workshop, reject) with … Virtual Reality: still not quite there, again. 3443d ago Virtual Reality: still not quite there, again. The first time I tried out Virtual Reality was a while ago — somewhere in the late 1990's. I was quite young so my memory is a bit hazy, but I … Yes you should understand backprop 3472d ago Yes you should understand backprop When we offered CS231n (Deep Learning class) at Stanford, we intentionally designed the programming assignments to include explicit calculations involved in … CS183c Assignment #3 3873d ago CS183c Assignment #3 The last few weeks we heard from several excellent guests, including Selina Tobaccowala from Survey Monkey, Patrick Collison from Stripe, Nirav Tolia from Nextdoor, Shishir …

No matching sources found.