AI & Machine Learning Blogs ✍️ - Machine Learning News Hub

Shipping huggingface_hub every week with AI, open tools, and a human in the loop Hugging Face - Blog · 10h ago

Lobachevsky’s integral formula John D. Cook · 14h ago

Encoding Categorical Data for Outlier Detection Towards Data Science · 17h ago

How to Use Claude Code in Your Browser Towards Data Science · 19h ago

GLM-5.2 is the step change for open agents Interconnects AI · 19h ago

When RAG Users Ask Vague Questions: Clarify Once, Learn the Default Towards Data Science · 20h ago

PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters Hugging Face - Blog · 20h ago

Neural Networks, Explained for Beginners: Start Here If They’ve Confused You Towards Data Science · 22h ago

Queens on a prime order board John D. Cook · 1d ago

We got local models to triage the OpenClaw repo for FREE!* Hugging Face - Blog · 1d ago

Tool Calling, Explained: How AI Agents Decide What to Do Next Towards Data Science · 1d ago

Reconstructing the Table of Contents a PDF Forgot to Ship, So RAG Can Scope by Section Towards Data Science · 1d ago

What Are the Possibilities to Build Date Tables in Self-Service Environments? Towards Data Science · 1d ago

Patterns for Building Cybersecurity Evals Eugene Yan · 2d ago

All pieces on a 6 by 5 board John D. Cook · 2d ago

7 Crucial Barriers Between Data Teams and Self-Healing Data Architecture Towards Data Science · 2d ago

Making a PDF’s Images Searchable for RAG, Without Paying to Read Them All Towards Data Science · 2d ago

Materialized Lake Views in Microsoft Fabric: When Your Medallion Fits in a SELECT Statement Towards Data Science · 2d ago

Python 3.14 and its New JIT Compiler Towards Data Science · 3d ago

Building a Custom GStreamer Plugin for NVIDIA DeepStream Towards Data Science · 3d ago

I Tried to Schedule My ETL Pipeline. Here’s What I Didn’t Expect. Towards Data Science · 3d ago

Parse Scanned PDFs for RAG with EasyOCR: Free OCR Gives You Words, Not a Document Towards Data Science · 3d ago

Banning Open Source AI Would Be A Mistake Interconnects AI · 3d ago

GPU-Resident Top-K for Agentic RAG: I Built a CUDA Kernel So My Retrieval Step Would Stop Bouncing Off the GPU Towards Data Science · 3d ago

MosaicLeaks: Can your research agent keep a secret? Hugging Face - Blog · 4d ago

Structured Outputs with LLMs: JSON Mode, Function Calling, and When to Use Each Towards Data Science · 4d ago

How Powerful is Claude Fable (Mythos) 5 for Coding? Towards Data Science · 4d ago

Proteins: A Mosaic Pattern to Rule Them All? Towards Data Science · 4d ago

Dispatching the Parsed RAG Question: Chunk Strategy, Model Tier, Activations, Audit Towards Data Science · 4d ago

The Power and Pitfalls of Vector-Based Image Search Towards Data Science · 4d ago

Beyond LoRA: Can you beat the most popular fine-tuning technique? Hugging Face - Blog · 5d ago

Is it agentic enough? Benchmarking open models on your own tooling Hugging Face - Blog · 5d ago

MolmoMotion: Language-guided 3D motion forecasting Hugging Face - Blog · 5d ago

State of the blog, mid-2026 Interconnects AI · 5d ago

Formalizing a ring theorem with Lean 4 and Claude John D. Cook · 5d ago

From the Hugging Face Hub to robot hardware with Strands Agents and LeRobot Hugging Face - Blog · 5d ago

GLM-5.2: Built for Long-Horizon Tasks Hugging Face - Blog · 6d ago

Agentic Resource Discovery: Let agents search Hugging Face - Blog · 6d ago

Partial fraction decomposition John D. Cook · 6d ago

Frontier post-training recipe review with Finbarr Timbers Interconnects AI · 6d ago

Three examples suffice John D. Cook · 6d ago

Testing pentagonal numbers John D. Cook · 7d ago

Quaternion Rotations, Claude, and Lean John D. Cook · 7d ago

Writing Prolog with ChatGPT John D. Cook · 7d ago

Welcome to the AGI era of AI governance Interconnects AI · 8d ago

Domain-Specific AI Should Focus on Workflows Rather Than Modeling David Stutz · 8d ago

RSA munitions T-shirt John D. Cook · 9d ago

Solving a chess puzzle with Claude and Prolog John D. Cook · 11d ago

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP Hugging Face - Blog · 12d ago

Formally proving a calculation with Claude and Lean John D. Cook · 12d ago

Pulling on a thread John D. Cook · 12d ago

Claude Fable 5 and new AI safety fables Interconnects AI · 13d ago

What it feels like to work with Mythos One Useful Thing · 13d ago

How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces Hugging Face - Blog · 13d ago

Migrating Your GitHub CI to Hugging Face Jobs Hugging Face - Blog · 14d ago

The Open Source Community is backing OpenEnv for Agentic RL Hugging Face - Blog · 15d ago

Aitken acceleration before Aitken John D. Cook · 15d ago

The Laplace limit John D. Cook · 15d ago

A crank formula for π John D. Cook · 15d ago

From Kepler to Bessel John D. Cook · 16d ago

LLM Research Papers: The 2026 List (January to May) Ahead of AI · 16d ago

Mr. Bessel’s eponymous functions John D. Cook · 17d ago

Co-Existence and the End of Co-Intelligence One Useful Thing · 18d ago

The Latin of Linux John D. Cook · 18d ago

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI Hugging Face - Blog · 18d ago

Integrating smooth periodic functions John D. Cook · 18d ago

Designing the hf CLI as an agent-optimized way to work with the Hub Hugging Face - Blog · 19d ago

Direct Preference Optimization Beyond Chatbots Hugging Face - Blog · 19d ago

Adding MCP Tools to Reachy Mini Hugging Face - Blog · 20d ago

Farewell Ai2 Interconnects AI · 20d ago

Holo3.1: Fast & Local Computer Use Agents Hugging Face - Blog · 20d ago

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains Hugging Face - Blog · 21d ago

Open and closed models are on different exponentials Interconnects AI · 21d ago

Using LLMs to Secure Source Code Eugene Yan · 27d ago

Choosing to Stay Human One Useful Thing · 27d ago

Some ideas for what comes next, May 2026 Interconnects AI · 27d ago

SAP Sapphire 2026: The Complete Breakdown Nanonets Blog | AI Document Processing & Workflow Automation · 32d ago

AI Evaluation is Becoming an Exciting Standalone Discipline David Stutz · 37d ago

Latest open artifacts (#21): Open model bonanza! Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1 & others. On CAISI's V4 assessment. Interconnects AI · 37d ago

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention Ahead of AI · 37d ago

How open model ecosystems compound Interconnects AI · 41d ago

Notes from inside China's AI labs Interconnects AI · 46d ago

The distillation panic Interconnects AI · 49d ago

How to Work and Compound with AI Eugene Yan · 51d ago

Sign of the future: GPT-5.5 One Useful Thing · 60d ago

Claude for Legal Teams: Contract Review, Compliance and Due Diligence Nanonets Blog | AI Document Processing & Workflow Automation · 62d ago

Reading today's open-closed performance gap Interconnects AI · 63d ago

My Workflow for Understanding LLM Architectures Ahead of AI · 65d ago

Vibe Coding Best Practices: 5 Claude Code Habits for Better Agentic Coding Nanonets Blog | AI Document Processing & Workflow Automation · 67d ago

My bets on open models, mid-2026 Interconnects AI · 68d ago

What I’ve been building: ATOM Report, post-training course, finishing my book, and ongoing research Interconnects AI · 69d ago

The inevitable need for an open model consortium Interconnects AI · 72d ago

AI Benchmarks Explained: GPQA, SWE-bench, Chatbot Arena and What They Actually Measure Nanonets Blog | AI Document Processing & Workflow Automation · 73d ago

Why AI-Native IDP Platforms Outperform ABBYY and Kofax in Modern Document Workflows Nanonets Blog | AI Document Processing & Workflow Automation · 74d ago

Claude Mythos and misguided open-weight fearmongering Interconnects AI · 74d ago

Why You Hit Claude Limits So Fast: AI Token Limits Explained Nanonets Blog | AI Document Processing & Workflow Automation · 76d ago

Components of A Coding Agent Ahead of AI · 79d ago

Gemma 4 and what makes an open model succeed Interconnects AI · 80d ago

Did Google's TurboQuant Actually Solve AI Memory Crunch? Nanonets Blog | AI Document Processing & Workflow Automation · 81d ago

Get working on your April Fools Eiffel Tower AI Weirdness · 82d ago

Bonus: More April Fools pranks from Eiffel Tower Llama AI Weirdness · 82d ago

Claude Dispatch and the Power of Interfaces One Useful Thing · 83d ago

Latest open artifacts (#20): New orgs! New types of models! With Nemotron Super, Sarvam, Cohere Transcribe, & others Interconnects AI · 84d ago

Claude for Finance Teams: Investment Banking, DCF Models, Reconciliation & Variance Analysis Nanonets Blog | AI Document Processing & Workflow Automation · 91d ago

A Visual Guide to Attention Variants in Modern LLMs Ahead of AI · 92d ago

AI Agent Hacks McKinsey: 5 Situations When You Should Not Deploy Agents Nanonets Blog | AI Document Processing & Workflow Automation · 100d ago

The Shape of the Thing One Useful Thing · 102d ago

Are OpenAI and Google intentionally downgrading their models? Nanonets Blog | AI Document Processing & Workflow Automation · 103d ago

We ran 16 AI Models on 9,000+ Real Documents. Here's What We Found. Nanonets Blog | AI Document Processing & Workflow Automation · 103d ago

AI Arms Race Has Real Numbers: Pentagon vs China 2026 Nanonets Blog | AI Document Processing & Workflow Automation · 109d ago

Stop Paying for AI You Don't Use: The Case for Fine-Tuned Models Nanonets Blog | AI Document Processing & Workflow Automation · 112d ago

A Dream of Spring for Open-Weight LLMs: 10 Architectures from Jan-Feb 2026 Ahead of AI · 117d ago

Running Log: Trail du Mont Agel Damian Bogunowicz - dtransposed · 123d ago

Claude's Role in Capturing Nicolás Maduro Nanonets Blog | AI Document Processing & Workflow Automation · 124d ago

A Guide to Which AI to Use in the Agentic Era One Useful Thing · 125d ago

microgpt Andrej Karpathy blog · 131d ago

Claude vs Open AI: Real Fight Is Business Model Nanonets Blog | AI Document Processing & Workflow Automation · 131d ago

Book Review: Software Engineering for Data Scientists Salmon Run · 141d ago

Management as AI superpower One Useful Thing · 146d ago

Categories of Inference-Time Scaling for Improved LLM Reasoning Ahead of AI · 149d ago

Book Review: Transformers In Action Salmon Run · 163d ago

Claude Code and What Comes Next One Useful Thing · 166d ago

The State Of LLMs 2025: Progress, Problems, and Predictions Ahead of AI · 174d ago

LLM Research Papers: The 2025 List (July to December) Ahead of AI · 174d ago

Trip Report: PyData Global 2025 Salmon Run · 178d ago

The Shape of AI: Jaggedness, Bottlenecks and Salients One Useful Thing · 184d ago

When a chatbot runs your store AI Weirdness · 185d ago

Bonus: Incorrect Christmas Carols AI Weirdness · 186d ago

2025 Year in Review Eugene Yan · 191d ago

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates Ahead of AI · 201d ago

Product Evals in Three Simple Steps Eugene Yan · 212d ago

Three Years from GPT-3 to Gemini 3 One Useful Thing · 216d ago

Giving your AI a Job Interview One Useful Thing · 223d ago

Beyond Standard LLMs Ahead of AI · 230d ago

Tiny neural net Halloween costumes are the best AI Weirdness · 237d ago

More tiny neural net costumes AI Weirdness · 237d ago

An Opinionated Guide to Using AI Right Now One Useful Thing · 246d ago

Advice for New Principal Tech ICs (i.e., Notes to Myself) Eugene Yan · 247d ago

Halloween costumes by tiny neural net AI Weirdness · 248d ago

Bonus: more halloween costumes from tiny neural net AI Weirdness · 248d ago

Book Review: Time Series Forecasting using Foundation Models Salmon Run · 253d ago

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch) Ahead of AI · 260d ago

RAISE 2025 Panel Statement on Aligning AI to Clinical Values David Stutz · 261d ago

Botober 2025: Terrible recipes from a tiny neural net AI Weirdness · 265d ago

Bonus: Char-rnn's jello creations AI Weirdness · 265d ago

Real AI Agents and Real Work One Useful Thing · 266d ago

Book Review: Statistics every Programmer Needs Salmon Run · 275d ago

Training an LLM-RecSys Hybrid for Steerable Recs with Semantic IDs Eugene Yan · 282d ago

On Working with Wizards One Useful Thing · 284d ago

Understanding and Implementing Qwen3 From Scratch Ahead of AI · 289d ago

Mass Intelligence One Useful Thing · 298d ago

From GPT-2 to gpt-oss: Analyzing the Architectural Advances Ahead of AI · 317d ago

ChatGPT will apologize for anything AI Weirdness · 318d ago

Revised instructions for troubleshooting the warp confabulator AI Weirdness · 318d ago

GPT-5: It Just Does Stuff One Useful Thing · 319d ago

The Bitter Lesson versus The Garbage Can One Useful Thing · 329d ago

The Big LLM Architecture Comparison Ahead of AI · 338d ago

Against "Brain Damage" One Useful Thing · 350d ago

LLM Research Papers: The 2025 List (January to June) Ahead of AI · 356d ago

Book Review: Hands-On Artificial Intelligence for IoT Salmon Run · 359d ago

Using AI Right Now: A Quick Guide One Useful Thing · 364d ago

Evaluating Long-Context Question & Answer Systems Eugene Yan · 366d ago

Understanding and Coding the KV Cache in LLMs from Scratch Ahead of AI · 370d ago

Book Review: Essential Graph RAG Salmon Run · 372d ago

AI Engineer 2025 - Improving RecSys & Search with LLM techniques Eugene Yan · 384d ago

Exceptional Leadership: Some Qualities, Behaviors, and Styles Eugene Yan · 401d ago

Coding LLMs from the Ground Up: A Complete Course Ahead of AI · 408d ago

Building News Agents for Daily News Recaps with MCP, Q, and tmux Eugene Yan · 415d ago

Why We Think Lil'Log · 418d ago

VernamVeil: A Fresh Take on Function-Based Encryption Datumbox · 422d ago

An LLM-as-Judge Won't Save The Product—Fixing Your Process Will Eugene Yan · 429d ago

The State of Reinforcement Learning for LLM Reasoning Ahead of AI · 429d ago

Frequently Asked Questions about My Writing Process Eugene Yan · 450d ago

First Look at Reasoning From Scratch: Chapter 1 Ahead of AI · 450d ago

Moving To Substack Jay Alammar · 454d ago

NVIDIA GTC 2025 - Building LLM-Powered Applications Eugene Yan · 462d ago

Improving Recommendation Systems & Search in the Age of LLMs Eugene Yan · 464d ago

Some Lessons on Reviews and Rebuttals David Stutz · 505d ago

Minecraft with object impermanence AI Weirdness · 519d ago

Bonus: In Which The Adventurer Attempts to Build a Website AI Weirdness · 519d ago

Common pitfalls when building generative AI applications Chip Huyen · 523d ago

Thoughts on Watermarking AI-Generated Content David Stutz · 523d ago

Building AI Reading Club: Features & Behind the Scenes Eugene Yan · 527d ago

Agents Chip Huyen · 532d ago

Thoughts and Lessons for Planning Rater Studies in AI David Stutz · 532d ago

Packaging ML Pipelines from Experiment to Deployment Salmon Run · 538d ago

2024 Year in Review Eugene Yan · 548d ago

Trip Report - PyData Global 2024 Salmon Run · 561d ago

Seemingly Paradoxical Rules of Writing Eugene Yan · 569d ago

Reward Hacking in Reinforcement Learning Lil'Log · 572d ago

How to Run a Weekly Paper Club (and Build a Learning Community) Eugene Yan · 576d ago

My Minimal MacBook Pro Setup Guide Eugene Yan · 583d ago

Open-Sourcing Relabeled MedQA and Dermatology DDx Datasets David Stutz · 587d ago

Thinking About Research Ideas vs. Technology David Stutz · 589d ago

Using Knowledge Graphs to enhance Retrieval Augmented Generation Salmon Run · 625d ago

Botober 2024 AI Weirdness · 630d ago

Experiments with Prompt Compression Salmon Run · 693d ago

Building A Generative AI Platform Chip Huyen · 698d ago

The Importance of Effectively Experimenting in an AI PhD David Stutz · 714d ago

Extrinsic Hallucinations in LLMs Lil'Log · 716d ago

Table Extraction from PDFs using Multimodal (Vision) LLMs Salmon Run · 722d ago

Book Report: Pandas Workout Salmon Run · 729d ago

Finetuning RAGAS Metrics using DSPy Salmon Run · 765d ago

Performance Analysis of Float vs Byte vs Binary Vectors on OpenSearch Salmon Run · 769d ago

FAQ for our Monte Carlo Conformal Prediction David Stutz · 771d ago

KGC/HCLS 2024 Trip Report Salmon Run · 776d ago

Measuring personal growth Chip Huyen · 797d ago

Diffusion Models for Video Generation Lil'Log · 802d ago

Book Report: Machine Learning for Drug Discovery Salmon Run · 821d ago

Hierarchical (and other) Indexes using LlamaIndex for RAG Content Enrichment Salmon Run · 827d ago

What I learned from looking at 900 most popular open source AI tools Chip Huyen · 831d ago

Predictive Human Preference: From Model Ranking to Model Routing Chip Huyen · 846d ago

Thoughts on using LangChain LCEL with Claude Salmon Run · 849d ago

I Completed The David Goggins Challenge And Asked My Garmin How I Did Damian Bogunowicz - dtransposed · 867d ago

Thinking about High-Quality Human Data Lil'Log · 869d ago

Book Report: Allen B Downey's Probably Overthinking It Salmon Run · 870d ago

Generation configurations: temperature, top-k, top-p, and test time compute Chip Huyen · 889d ago

A Video Game that Pays: Lessons Learned from Working Remotely Damian Bogunowicz - dtransposed · 964d ago

Adversarial Attacks on LLMs Lil'Log · 972d ago

Multimodality and Large Multimodal Models (LMMs) Chip Huyen · 987d ago

Open challenges in LLM research Chip Huyen · 1042d ago

LLM Powered Autonomous Agents Lil'Log · 1096d ago

My Faculty Application Experience Seita's Place · 1103d ago

Generative AI Strategy Chip Huyen · 1112d ago

Generative AI and AI Product Moats Jay Alammar · 1141d ago

Prompt Engineering Lil'Log · 1196d ago

The Transformer Family Version 2.0 Lil'Log · 1243d ago

Large Transformer Model Inference Optimization Lil'Log · 1259d ago

Remaking Old Computer Graphics With AI Image Generation Jay Alammar · 1269d ago

Books Read in 2022 Seita's Place · 1269d ago

Conference on Robot Learning 2022 Seita's Place · 1273d ago

The 2022 Robotics: Science and Systems Conference Seita's Place · 1289d ago

The Illustrated Stable Diffusion Jay Alammar · 1358d ago

Some Math behind Neural Tangent Kernel Lil'Log · 1383d ago

Deep Dive into NeRF (Neural Radiance Fields) Damian Bogunowicz - dtransposed · 1417d ago

The (In-Person) ICRA 2022 Conference in Philadelphia Seita's Place · 1419d ago

Two New Papers: Learning to Fling and Singulate Fabrics Seita's Place · 1425d ago

A Plea to End Harassment Seita's Place · 1437d ago

Generalized Visual Language Models Lil'Log · 1474d ago

The journey of Modernizing TorchVision – Memoirs of a TorchVision developer – 3 Datumbox · 1493d ago

Driving Experimentation Forward through a Working Group (Experimentation Program Series: Guide 03) ML in Production · 1506d ago

My Paper Reviewing Load Seita's Place · 1521d ago

Learning with not Enough Data Part 3: Data Generation Lil'Log · 1529d ago

What is an Experimentation program and Who is Involved? (Experimentation Program Series: Guide 02) ML in Production · 1542d ago

Building An Effective Experimentation Program (Experimentation Program Series: Guide 01) ML in Production · 1555d ago

Deep Neural Nets: 33 years ago and 33 years from now Andrej Karpathy blog · 1562d ago

Applying massive language models in the real world with Cohere Jay Alammar · 1569d ago

I Stand with Ukraine Seita's Place · 1578d ago

Learning with not Enough Data Part 2: Active Learning Lil'Log · 1584d ago

Lessons Learned from Writing Online ML in Production · 1598d ago

The Illustrated Retrieval Transformer Jay Alammar · 1632d ago

Books Read in 2021 Seita's Place · 1634d ago

Learning by Doing - the DeFi Quest (Part 2 out of 2) Damian Bogunowicz - dtransposed · 1642d ago

Learning by Doing - the DeFi Quest (Part 1 out of 2) Damian Bogunowicz - dtransposed · 1655d ago

Learning with not Enough Data Part 1: Semi-Supervised Learning Lil'Log · 1661d ago

A sneak peek at TorchVision v0.11 – Memoirs of a TorchVision developer – 2 Datumbox · 1716d ago

How to Train Really Large Models on Many GPUs? Lil'Log · 1733d ago

New Blog series – Memoirs of a TorchVision developer Datumbox · 1766d ago

What are Diffusion Models? Lil'Log · 1808d ago

A from-scratch tour of Bitcoin in Python Andrej Karpathy blog · 1828d ago

Contrastive Representation Learning Lil'Log · 1849d ago

Explainable AI Cheat Sheet Jay Alammar · 1876d ago

So You Want to Study Computer Science (in Europe)? Damian Bogunowicz - dtransposed · 1879d ago

Mixed Martial Maths - Simple Reasoning Tools For Complex Phenomena Damian Bogunowicz - dtransposed · 1905d ago

Short Story on AI: Forward Pass Andrej Karpathy blog · 1914d ago

Reducing Toxicity in Language Models Lil'Log · 1920d ago

Finding the Words to Say: Hidden State Visualizations for Language Models Jay Alammar · 1981d ago

Notes - Prisoners of Geography (Tim Marshall) Damian Bogunowicz - dtransposed · 1997d ago

Controllable Neural Text Generation Lil'Log · 1998d ago

Interfaces for Explaining Transformer Language Models Jay Alammar · 2014d ago

Newsletter #087 ML in Production · 2029d ago

Newsletter #086 ML in Production · 2037d ago

Newsletter #085 ML in Production · 2043d ago

Newsletter #084 ML in Production · 2050d ago

Newsletter #083 ML in Production · 2057d ago

Newsletter #082 ML in Production · 2064d ago

Robotic Assembly Using Deep Reinforcement Learning Damian Bogunowicz - dtransposed · 2071d ago

How to take S3 backups with DejaDup on Ubuntu 20.10 Datumbox · 2073d ago

Datumbox Machine Learning Framework v0.8.2 released Datumbox · 2147d ago

How GPT3 Works - Visualizations and Animations Jay Alammar · 2157d ago

Biohacking Lite Andrej Karpathy blog · 2203d ago

How to get around Dropbox’s symlink limitations on Linux Datumbox · 2312d ago

Efficient multi-lingual language model fine-tuning fast.ai NLP · 2478d ago

A Recipe for Training Neural Networks Andrej Karpathy blog · 2616d ago

Introducing state of the art text classification with universal language models fast.ai NLP · 2961d ago

The Batch Normalization layer of Keras is broken Datumbox · 2988d ago

5 tips for multi-GPU training with Keras Datumbox · 3074d ago

(started posting on Medium instead) Andrej Karpathy blog · 3075d ago

Software 2.0 Stories by Andrej Karpathy on Medium · 3145d ago

Ubuntu 17.10: a last minute review Datumbox · 3179d ago

Datumbox Machine Learning Framework v0.8.1 released Datumbox · 3217d ago

AlphaGo, in context Stories by Andrej Karpathy on Medium · 3309d ago

ICML accepted papers institution stats Stories by Andrej Karpathy on Medium · 3316d ago

A Peek at Trends in Machine Learning Stories by Andrej Karpathy on Medium · 3363d ago

ICLR 2017 vs arxiv-sanity Stories by Andrej Karpathy on Medium · 3388d ago

Drilling into Spark’s ALS Recommendation algorithm Datumbox · 3404d ago

Getting the GPU usage of NVIDIA cards with the Linux dstat tool Datumbox · 3418d ago

Virtual Reality: still not quite there, again. Stories by Andrej Karpathy on Medium · 3443d ago

Datumbox Machine Learning Framework version 0.8.0 released Datumbox · 3446d ago

Yes you should understand backprop Stories by Andrej Karpathy on Medium · 3472d ago

A Survival Guide to a PhD Andrej Karpathy blog · 3575d ago

Deep Reinforcement Learning: Pong from Pixels Andrej Karpathy blog · 3674d ago

Datumbox Machine Learning Framework 0.7.0 Released Datumbox · 3747d ago

Datumbox Machine Learning Framework 0.6.1 Released Datumbox · 3824d ago

CS183c Assignment #3 Stories by Andrej Karpathy on Medium · 3873d ago

Short Story on AI: A Cognitive Discontinuity. Andrej Karpathy blog · 3873d ago

Datumbox Machine Learning Framework 0.6.0 Released Datumbox · 4068d ago

How to install and use the Datumbox Machine Learning Framework Datumbox · 4243d ago

New open-source Machine Learning Framework written in Java Datumbox · 4264d ago

Clustering with Dirichlet Process Mixture Model in Java Datumbox · 4369d ago

AI & Machine Learning News Hub

What's New

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

Lobachevsky’s integral formula

Encoding Categorical Data for Outlier Detection

How to Use Claude Code in Your Browser

GLM-5.2 is the step change for open agents

Gigafeed (310 entries)

By Source