Loading…

AI & Machine Learning News Hub

Research, releases, and applied work in AI & ML

What's New

Top 5 Across All Sources
  1. New chip could help tiny robots traverse complex environments

    MIT News - Artificial intelligence · 5h ago
Latest
Latent.Space[AINews] SpaceX is already a $28B/yr NeocloudNVIDIA BlogNVIDIA Brings Trusted, 24/7 AI Agents to Telecom OperationsNVIDIA Technical BlogHow Telcos Build Autonomous Networks with Agentic AIMIT News - Artificial intelligenceNew chip could help tiny robots traverse complex environmentsMIT News - Machine learningNew chip could help tiny robots traverse complex environmentscs.LG updates on arXiv.orgTowards CSI-Native Foundation Models: A Channel-Adaptive Roadmap for 6Gcs.LG updates on arXiv.orgNeuroShield: A Device-Agnostic Foundation Model for EEG Authenticationcs.LG updates on arXiv.orgMassive Activations Are Architecturally Robust: A Controlled Scratch/Commitment Residual Stream Testcs.LG updates on arXiv.orgCIExplainer++: Generating Causal and Interpretable Explanations for Graph Neural Networkscs.LG updates on arXiv.orgEvidential Fusion Network for Multimodal Survival Prediction under Missing Modalitiescs.LG updates on arXiv.orgELADO: Elliptic PDE Assessment Datasets for Operator Learningcs.LG updates on arXiv.orgB[FM]$^2$: Brain Foundation Model via Flow Matching with SplitUNetcs.LG updates on arXiv.orgCELEUS: Certifiable and Efficient LLM Evaluation via E-Processescs.LG updates on arXiv.orgEvolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learningcs.LG updates on arXiv.orgMachine Learning Classification of Cryopathy Syndromes: A Comprehensive Comparative Studycs.LG updates on arXiv.orgUnderstanding Latent Flow Models for Tabular Data Synthesis: Targets, Paths, and Samplingcs.LG updates on arXiv.orgTemporal Causal Prior-Data Fitted Networks for Panel Data with Learned Reliability Signalscs.LG updates on arXiv.orgMMGNN: Multi-level, multi-color graph neural networks for molecular property predictioncs.LG updates on arXiv.orgPhysics-Guided Dual-Stream Heterogeneous Graph Neural Network for Predicting Full-Field Structural Response of Stiffened Panelscs.LG updates on arXiv.orgShort-Term Electricity Demand Forecasting for New England Using a Hybrid Transformer-XGBoost Framework with Weather, Calendar, and COVID-19 IndicatorsLatent.Space[AINews] SpaceX is already a $28B/yr NeocloudNVIDIA BlogNVIDIA Brings Trusted, 24/7 AI Agents to Telecom OperationsNVIDIA Technical BlogHow Telcos Build Autonomous Networks with Agentic AIMIT News - Artificial intelligenceNew chip could help tiny robots traverse complex environmentsMIT News - Machine learningNew chip could help tiny robots traverse complex environmentscs.LG updates on arXiv.orgTowards CSI-Native Foundation Models: A Channel-Adaptive Roadmap for 6Gcs.LG updates on arXiv.orgNeuroShield: A Device-Agnostic Foundation Model for EEG Authenticationcs.LG updates on arXiv.orgMassive Activations Are Architecturally Robust: A Controlled Scratch/Commitment Residual Stream Testcs.LG updates on arXiv.orgCIExplainer++: Generating Causal and Interpretable Explanations for Graph Neural Networkscs.LG updates on arXiv.orgEvidential Fusion Network for Multimodal Survival Prediction under Missing Modalitiescs.LG updates on arXiv.orgELADO: Elliptic PDE Assessment Datasets for Operator Learningcs.LG updates on arXiv.orgB[FM]$^2$: Brain Foundation Model via Flow Matching with SplitUNetcs.LG updates on arXiv.orgCELEUS: Certifiable and Efficient LLM Evaluation via E-Processescs.LG updates on arXiv.orgEvolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learningcs.LG updates on arXiv.orgMachine Learning Classification of Cryopathy Syndromes: A Comprehensive Comparative Studycs.LG updates on arXiv.orgUnderstanding Latent Flow Models for Tabular Data Synthesis: Targets, Paths, and Samplingcs.LG updates on arXiv.orgTemporal Causal Prior-Data Fitted Networks for Panel Data with Learned Reliability Signalscs.LG updates on arXiv.orgMMGNN: Multi-level, multi-color graph neural networks for molecular property predictioncs.LG updates on arXiv.orgPhysics-Guided Dual-Stream Heterogeneous Graph Neural Network for Predicting Full-Field Structural Response of Stiffened Panelscs.LG updates on arXiv.orgShort-Term Electricity Demand Forecasting for New England Using a Hybrid Transformer-XGBoost Framework with Weather, Calendar, and COVID-19 Indicators

By Source

Feeds organized so you can skim by site.

Density Sort
LS
Latent.Space
2h ago · 20 items
20 loaded
NB
NVIDIA Blog
3h ago · 18 items
NVIDIA Brings Trusted, 24/7 AI Agents to Telecom Operations 3h ago At DTW Ignite 2026, NVIDIA and its partners are showcasing the data, models, simulation and secure runtime stack enabling telcos to build more secure agentic workflows across autonomous networks and operations. At ISC, JUPITER Shows What Exascale Science Looks Like 20h ago Europe’s first exascale supercomputer — running on NVIDIA Grace Hopper Superchips — is mapping the brain, modeling climate, advancing 6G AI and breaking records in quantum computing simulation. NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure 20h ago The U.S. National Science Foundation’s NAIRR pilot program has driven innovative research across the U.S. for over 700 projects — spanning protein prediction and infectious disease outbreak management. From Materials Simulation to Experimental Astronomy, New NVIDIA AI Software Unlocks Scientific Discoveries 20h ago NVIDIA CUDA-X libraries, microservices and reference code accelerate AI for science. NVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory 20h ago Mission, Vision and Veritas supercomputers with Vera CPUs to advance materials simulation, scientific AI agents and molecular design. Eco Wave Power Turns Waves Into Watts With NVIDIA AI Infrastructure and Digital Twins 20h ago Eco Wave Power, a member of the NVIDIA Inception startup program’s Sustainable Futures initiative, is developing technology — powered by NVIDIA AI infrastructure and digital twins — that converts energy from ocean waves into clean electrici... Hotter Than a Hot Tub: The 45°C Breakthrough to Cool AI’s Biggest Machines 1d ago NVIDIA’s latest AI servers can run on coolant warmer than a hot tub — and that counterintuitive choice is one of the biggest efficiency leaps in data center history. How FERC’s Large-Load Interconnection Actions Help Address Grid Stress, Improve Affordability 4d ago The U.S. Federal Energy Regulatory Commission’s new actions on energy — a foundational layer of AI — are poised to reduce costs for ratepayers, grow the industrial base and strengthen the nation’s electrical grid. At Cannes Lions, NVIDIA Partners Reshape Advertising and Marketing With AI 4d ago At Cannes Lions, running June 22-26 in France, industry leaders including Alembic, Amazon Web Services (AWS), Criteo, Higgsfield, KERV.ai and Taboola are showcasing how NVIDIA technologies help unlock greater creativity and enable faster, a... Sync and Stream: GeForce NOW Connects to Members’ Game Libraries Across Devices 4d ago Learn how the cloud brings top game stores to nearly any screen, save big with GeForce NOW’s summer sale and discover seven new games arriving this week.
18 loaded
NT
NVIDIA Technical Blog
3h ago · 20 items
How Telcos Build Autonomous Networks with Agentic AI 3h ago Telecom operators are adopting AI across network operations, customer care, and back-office workflows, but most are still early in the journey to autonomy. CCCL Runtime: A Modern C++ Runtime for CUDA 17h ago The NVIDIA CUDA Core Compute Libraries (CCCL) provides delightful and efficient abstractions for CUDA developers in C++ and Python. It features: This post… Enable Real-Time AI for High-Speed Data Acquisition with DAQIRI 18h ago When AlphaFold2 revolutionized drug discovery in 2020, its success relied entirely on the roughly 170,000 protein structures collected by scientists since 1971… Inside NVIDIA Halos for Robotics: A Full-Stack Functional Safety System for Physical AI 20h ago Physical AI—robots working autonomously alongside people in factories, warehouses, hospitals, and homes—is arriving faster than most expected. Building AI Agents for AR Glasses and XR Devices with NVIDIA XR AI 6d ago Developers building for AR glasses and wearable devices face an infrastructure gap. The hardware is ready, but creating AI experiences requires integrating live… Build Your Own Transaction Foundation Model for Financial Intelligence 6d ago Every swipe, transfer, and payment on a modern financial network encodes a pattern of human behavior. Transaction data is one of the richest signals an… Build On-Device AI Companions with the NVIDIA ACE Game Agent SDK and Unreal Engine 5 Plugins 6d ago NVIDIA RTX technologies are deeply integrated into Unreal Engine 5 through the NVIDIA RTX Branch of Unreal Engine and the NVIDIA DLSS Unreal Engine plugin. How to Optimize Transformer-Based Models for Low-Precision Training 6d ago Transformer architectures are the backbone of many modern large language and generative AI models. As these models grow in size, training runs consume more GPU… NVIDIA Blackwell Tops MLPerf Training 6.0 with Industry-Leading Scale and Performance 6d ago NVIDIA delivered a clean sweep in MLPerf Training v6.0, the latest edition of industry-standard AI training benchmarks developed by the MLCommons consortium. Fine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes 7d ago Foundation models are reshaping computational biology. Pretrained on massive corpora of protein or genomic sequences, models such as ESM2 (a protein language…
20 loaded
MN
New chip could help tiny robots traverse complex environments 5h ago Gleanmer is a new system that can construct detailed 3D maps of a robot’s environment at high speed while operating at extremely low power. The advance could enable tiny devices to avoid obstacles and safely navigate in the real world. A better way to model the behavior of metal alloys 3d ago MIT researchers created a technique that captures chemical arrangements across materials to improve predictions of how metal alloys and other complex materials will behave. MIT in the media: For the future of tech, "Massachusetts can absolutely lead" 5d ago Leaders, faculty across MIT discuss fostering innovation and talent in Greater Boston in special series of articles published alongside the outlet's annual list of 'Tech Power Players' In game theory, generalists sometimes win out over specialists 5d ago In a new paper, MIT LIDS researchers show that for certain kinds of games, an overlooked class of algorithms performs much better than expected against better-trained opponents. Could AI tell you where you left your keys? 6d ago A new memory framework known as DAAAM enables a robot to rapidly recall rich descriptions and precise locational information about objects it encountered while exploring its environment. This efficient approach could help an autonomous agen... MIT’s Initiative for New Manufacturing builds momentum 6d ago MIT’s Initiative for New Manufacturing (INM) is advancing research, workforce development, and industry collaboration to accelerate new manufacturing technologies, strengthen industrial competitiveness, and help shape the future of manufact... Jinhua Zhao named head of the Department of Urban Studies and Planning 11d ago MIT Professor Jinhua Zhao, a noted scholar and transportation planner, has been appointed head of the MIT Department of Urban Studies and Planning. When it comes to predicting people’s preferences, it pays to consider “the power of three” 11d ago An MIT team proved that it is impossible to get information about correlations from two-way comparisons alone. Correlations can be discerned, however, when large numbers of people rate three alternatives in their order of preference. MIT affiliates win 2026 Hertz Foundation Fellowships 11d ago Three MIT students and an incoming graduate student have won 2026 Hertz Foundation Fellowships. The fellowships in applied sciences, engineering, and mathematics recognize doctoral students who are pursuing solutions to the most pressing ch... Startup’s nuclear-inspired cooling system could make data centers more sustainable 13d ago The startup Ferveret, founded by MIT Associate Professor Matteo Bucci and former MIT postdoc Reza Azizian, reduces the energy required to cool chips in data centers that power AI.
20 loaded
MN
MIT News - Machine learning
5h ago · 20 items
New chip could help tiny robots traverse complex environments 5h ago Gleanmer is a new system that can construct detailed 3D maps of a robot’s environment at high speed while operating at extremely low power. The advance could enable tiny devices to avoid obstacles and safely navigate in the real world. A better way to model the behavior of metal alloys 3d ago MIT researchers created a technique that captures chemical arrangements across materials to improve predictions of how metal alloys and other complex materials will behave. In game theory, generalists sometimes win out over specialists 5d ago In a new paper, MIT LIDS researchers show that for certain kinds of games, an overlooked class of algorithms performs much better than expected against better-trained opponents. Could AI tell you where you left your keys? 6d ago A new memory framework known as DAAAM enables a robot to rapidly recall rich descriptions and precise locational information about objects it encountered while exploring its environment. This efficient approach could help an autonomous agen... When it comes to predicting people’s preferences, it pays to consider “the power of three” 11d ago An MIT team proved that it is impossible to get information about correlations from two-way comparisons alone. Correlations can be discerned, however, when large numbers of people rate three alternatives in their order of preference. Startup helps retailers track their products in real-time 18d ago The startup Cartesian is helping retailers keep track of inventory with a technology invented at MIT. Using wireless signals from attached RFID tags, the system finds items’ precise location in a store, from the stockroom to the shop floor. NSF renews support for MIT-led AI and physics institute, expanding a new model for discovery 18d ago The MIT-led Institute for Artificial Intelligence and Fundamental Interactions (IAIFI) has received renewed support from the National Science Foundation for an additional five years. Teaching AI agents to ask better questions by playing “Battleship” 19d ago AI models played “Collaborative Battleship” together and struggled to ask informative questions about hidden ships. A Monte Carlo inference strategy helped small agents carefully consider each inquiry to outperform larger systems at a fract... MIT researchers teach AI models to interpret charts 20d ago Researchers used a novel data generation pipeline to build ChartNet, a large synthetic dataset of chart images paired with corresponding information. They used this training dataset to improve the performance of generative AI models at chal... MIT economist Whitney Newey awarded Erwin Plein Nemmers Prize in Economics 32d ago MIT economist Whitney Newey was awarded the Erwin Plein Nemmers Prize in Economics, a biennial award from Northwestern University that celebrates lasting contributions to economic scholarship.
20 loaded
CL
cs.LG updates on arXiv.org
5h ago · 20 items
Towards CSI-Native Foundation Models: A Channel-Adaptive Roadmap for 6G 5h ago Abstract page for arXiv paper 2606.20670: Towards CSI-Native Foundation Models: A Channel-Adaptive Roadmap for 6G NeuroShield: A Device-Agnostic Foundation Model for EEG Authentication 5h ago Abstract page for arXiv paper 2606.20673: NeuroShield: A Device-Agnostic Foundation Model for EEG Authentication Massive Activations Are Architecturally Robust: A Controlled Scratch/Commitment Residual Stream Test 5h ago Abstract page for arXiv paper 2606.20743: Massive Activations Are Architecturally Robust: A Controlled Scratch/Commitment Residual Stream Test CIExplainer++: Generating Causal and Interpretable Explanations for Graph Neural Networks 5h ago Abstract page for arXiv paper 2606.20747: CIExplainer++: Generating Causal and Interpretable Explanations for Graph Neural Networks Evidential Fusion Network for Multimodal Survival Prediction under Missing Modalities 5h ago Abstract page for arXiv paper 2606.20757: Evidential Fusion Network for Multimodal Survival Prediction under Missing Modalities ELADO: Elliptic PDE Assessment Datasets for Operator Learning 5h ago Abstract page for arXiv paper 2606.20771: ELADO: Elliptic PDE Assessment Datasets for Operator Learning B[FM]$^2$: Brain Foundation Model via Flow Matching with SplitUNet 5h ago Abstract page for arXiv paper 2606.20812: B[FM]$^2$: Brain Foundation Model via Flow Matching with SplitUNet CELEUS: Certifiable and Efficient LLM Evaluation via E-Processes 5h ago Abstract page for arXiv paper 2606.20820: CELEUS: Certifiable and Efficient LLM Evaluation via E-Processes Evolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learning 5h ago Abstract page for arXiv paper 2606.20858: Evolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learning Machine Learning Classification of Cryopathy Syndromes: A Comprehensive Comparative Study 5h ago Abstract page for arXiv paper 2606.20874: Machine Learning Classification of Cryopathy Syndromes: A Comprehensive Comparative Study
20 loaded
SM
stat.ML updates on arXiv.org
5h ago · 20 items
Beyond Importance: Interchange-Sobol Sensitivity Reveals Task-Specific Content Channels in Transformer Components 5h ago Abstract page for arXiv paper 2606.20678: Beyond Importance: Interchange-Sobol Sensitivity Reveals Task-Specific Content Channels in Transformer Components Betting on Moments: Legendre Jumper Martingales for Online Exchangeability Testing 5h ago Abstract page for arXiv paper 2606.20859: Betting on Moments: Legendre Jumper Martingales for Online Exchangeability Testing Adversarial observations in probabilistic State-Space Models for robust Reinforcement Learning 5h ago Abstract page for arXiv paper 2606.20880: Adversarial observations in probabilistic State-Space Models for robust Reinforcement Learning Diffusion-Driven State Space Models 5h ago Abstract page for arXiv paper 2606.21036: Diffusion-Driven State Space Models Bayesian Model Averaging under Predictor Redundancy via Density-Ratio Posterior Compression 5h ago Abstract page for arXiv paper 2606.21080: Bayesian Model Averaging under Predictor Redundancy via Density-Ratio Posterior Compression Two Layers of Instability in Causal Estimation 5h ago Abstract page for arXiv paper 2606.21185: Two Layers of Instability in Causal Estimation Orthogonal Discrepancy Kernels for Learning with Partial Physics 5h ago Abstract page for arXiv paper 2606.21199: Orthogonal Discrepancy Kernels for Learning with Partial Physics Subsampling for supervised learning in reproducing kernel Hilbert spaces 5h ago Abstract page for arXiv paper 2606.21260: Subsampling for supervised learning in reproducing kernel Hilbert spaces Finite-Sample Performance of Gradient Descent in Logistic Regression with Gaussian Design 5h ago Abstract page for arXiv paper 2606.21683: Finite-Sample Performance of Gradient Descent in Logistic Regression with Gaussian Design Signed Evidence Flow: Conflict-Aware and Stability-Calibrated Data Analysis 5h ago Abstract page for arXiv paper 2606.21875: Signed Evidence Flow: Conflict-Aware and Stability-Calibrated Data Analysis
20 loaded
HF
Hugging Face - Blog
9h ago · 20 items
Shipping huggingface_hub every week with AI, open tools, and a human in the loop 9h ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters 19h ago A Blog post by PaddlePaddle on Hugging Face We got local models to triage the OpenClaw repo for FREE!* 1d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. MosaicLeaks: Can your research agent keep a secret? 4d ago A Blog post by ServiceNow on Hugging Face Beyond LoRA: Can you beat the most popular fine-tuning technique? 5d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. Is it agentic enough? Benchmarking open models on your own tooling 5d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. MolmoMotion: Language-guided 3D motion forecasting 5d ago A Blog post by Ai2 on Hugging Face From the Hugging Face Hub to robot hardware with Strands Agents and LeRobot 5d ago A Blog post by Amazon on Hugging Face GLM-5.2: Built for Long-Horizon Tasks 6d ago A Blog post by Z.ai on Hugging Face Agentic Resource Discovery: Let agents search 6d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science.
20 loaded
ML
Machine Learning
9h ago · 589 items
Non-deterministic Vulnerability Detection Benchmark System [P] 9h ago Syntactically robust NLI for semantics of imperfectly generated text? [R] 11h ago Recommendations for speech annotation tools [D] 13h ago Some new updates to Papers with Code [P] 18h ago [ECCV 2026] Paper Decision Appeals Discussion [D] 1d ago An Update on Matrix Recurrent Units, an Attention Alternative [R] 1d ago Data-centric debugging for teams training neural nets [P] 1d ago Best current methods for finetuning whisper on domain specific vocabulary? [P] 1d ago EMA on LoRA ? [R] 1d ago A slightly improved DVD-JEPA demo [P] 1d ago
589 loaded
ML
Machine Learning Street Talk
10h ago · 15 items
15 loaded
FA
Apple’s iPhone 18 Pro Features: Everything We Know So Far 12h ago iPhone 18 Pro rumors point to AI upgrades, a 2nm A20 Pro chip, camera changes, a smaller Dynamic Island, and possible pricing shifts. X Targets ‘Neglected’ Meta Engineers in Public Recruiting Pitch 12h ago X product head Nikita Bier targeted “neglected” Meta employees with a snack-budget hiring pitch as Meta works to improve morale. Apple Supplier Bets on Robots, AI Servers in $1.1B Hong Kong Listing 13h ago Apple supplier Lingyi iTech seeks a US$1.1B Hong Kong IPO to fund expansion in AI hardware, robotics, smart glasses, and AI servers. India’s Ultrahuman Launches No-Prescription Glucose Tracking Platform in the US 14h ago Ultrahuman M2 Live brings no-prescription glucose tracking to the US using Abbott’s Lingo CGM, with plans starting at $99 per month. Mukesh Ambani’s Reliance AI Roadmap Puts Jio CallAgent Inside the Network 18h ago Mukesh Ambani’s Reliance AI roadmap ties Jio CallAgent to telecom-scale AI, Jamnagar compute, local languages, and enterprise governance risks. Anthropic’s Fable 5 Withdrawal Underscores Importance and Difficulty of ‘sovereign AI’ Strategies 21h ago Anthropic’s abrupt Fable 5 withdrawal due to US export controls underscores the urgent need for UK and European sovereign AI infrastructure. As the World Claims Tech Sovereignty, Where Does Australia Stand? 4d ago Australian enterprises remain dependent on foreign technology infrastructure, while major economies treat that dependency as a strategic liability. Top 5 Prompt Engineering Certifications That Are Worth Taking (2026) 4d ago These prompt engineering courses can help you refine and structure natural language requests to get the most out of generative AI. Alibaba Cloud Bets on France as Europe Seeks More Control Over AI 4d ago Alibaba Cloud opened two Paris availability zones as European enterprises weigh data sovereignty, resilience, and AI infrastructure needs. Anthropic Adds Brand Controls, Code Sync to Claude Design 4d ago Anthropic updated Claude Design with design system imports, Claude Code syncing, canvas editing, and more export options for enterprise teams.
20 loaded
AI
Artificial Intelligence
15h ago · 20 items
Building pay-per-intelligence for AI agents: How Ampersend uses Amazon Bedrock AgentCore Payments 15h ago In this post, you will learn how Ampersend built a pay-per-intelligence routing layer on top of Amazon Bedrock AgentCore Payments. AI agents autonomously route tasks to the most effective model, pay per request, and operate within spending ... Embed the world: Multimodal AI for searchable aerial imagery at scale 16h ago In this post, we walk through the problem space, our architecture on Amazon Bedrock and Amazon OpenSearch Serverless, the evaluation methodology we built on OpenStreetMap ground truth, four experiments that compared embedding models, fusion... Running ComfyUI workflows on Amazon SageMaker AI processing jobs 16h ago In this post, we walk you through how to deploy ComfyUI workflows on Amazon SageMaker AI processing jobs to generate hundreds of high-quality images in a single batch. You learn how to set up the infrastructure using AWS Cloud Development K... Introducing Web Search on Amazon Bedrock AgentCore 3d ago Web Search on Amazon Bedrock AgentCore is now generally available. In this post, we walk through what makes Web Search on Amazon Bedrock AgentCore different, why it matters, and how to wire it in with a few lines of code. Accelerate campaign workflow with insights from Adobe Marketing Agent for Amazon Quick 3d ago This post shows how to enable Adobe Marketing Agent for Amazon Quick using a Model Context Protocol (MCP). We walk you through how to configure the integration, authenticate using your Adobe credentials, and get the latest insights in Amazo... Monitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch 4d ago Amazon SageMaker AI provides fully managed real-time inference hosting for machine learning models. You deploy a model to a SageMaker endpoint backed by one or more compute instances, and SageMaker handles provisioning and scaling. SageMake... Amazon Bedrock AgentCore harness is now generally available: Go from idea to production-grade agent in minutes 4d ago Today, Amazon Bedrock AgentCore harness is generally available. Two API calls (CreateHarness to define an agent, and InvokeHarness to run it), and you have an agent running in seconds. The agent runs in its own isolated environment with a f... Amazon SageMaker AI Async Inference now supports inline request payloads 5d ago Today, we’re announcing inline payload support for Amazon SageMaker AI Async Inference. Customers can now send inference payloads directly in the request body of the InvokeEndpointAsync API, removing the need to upload input data to Amazon ... Get back hours every day with autonomous agents in Amazon Quick 5d ago Today, Quick gets even more powerful: new autonomous agents that work continuously on your behalf, an activity feed that helps you prioritize your most important work, and the ability to find insights across every data source your business ... Context intelligence for your data and AI agents at scale 5d ago Agents are only as intelligent as the context they can reason over. Today, that context is scattered across data lakes, data warehouses, lakehouses, databases, and streams, and in institutional knowledge that has never been written down. Yo...
20 loaded
TD
Towards Data Science
16h ago · 20 items
Encoding Categorical Data for Outlier Detection 16h ago How to Use Claude Code in Your Browser 18h ago When RAG Users Ask Vague Questions: Clarify Once, Learn the Default 19h ago Neural Networks, Explained for Beginners: Start Here If They’ve Confused You 21h ago Tool Calling, Explained: How AI Agents Decide What to Do Next 1d ago Reconstructing the Table of Contents a PDF Forgot to Ship, So RAG Can Scope by Section 1d ago What Are the Possibilities to Build Date Tables in Self-Service Environments? 1d ago 7 Crucial Barriers Between Data Teams and Self-Healing Data Architecture 2d ago Making a PDF’s Images Searchable for RAG, Without Paying to Read Them All 2d ago Materialized Lake Views in Microsoft Fabric: When Your Medallion Fits in a SELECT Statement 2d ago
20 loaded
TM
Two Minute Papers
17h ago · 15 items
15 loaded
QM
Quanta Magazine
18h ago · 5 items
IA
Interconnects AI
18h ago · 20 items
20 loaded
IA
Import AI
20h ago · 20 items
Import AI 462: Superpersuasion; self-sustaining AI; paths to ASI 20h ago Import AI 461: "Alignment is not on track"; FrontierCode; and synthetic research interns 7d ago Import AI 460: Reward hacking society, RSI data from Anthropic; and RL-based quadcopter racing 14d ago Import AI 459: AI oversight is difficult; scaling laws for protein folding models; and pricing the extinction risk of AI systems 21d ago Import AI 458: Reckoning with the future; and a singularity story 27d ago Import AI 457: AI stuxnet; cursed Muon optimizer; and positive alignment 35d ago Import AI 456: RSI and economic growth; radical optionality for AI regulation; and a neural computer 42d ago Import AI 455: AI systems are about to start building themselves. 49d ago Import AI 454: Automating alignment research; safety study of a Chinese model; HiFloat4 63d ago Import AI 453: Breaking AI agents; MirrorCode; and ten views on gradual disempowerment 70d ago
20 loaded
HU
ΑΙhub
23h ago · 20 items
Engineering Out Loud: S13E1 – How many robots can a single human supervise? 23h ago Everything, eco-where, AI at once? 3d ago AI is making journalistic language more repetitive and predictable – and it’s a problem for all of us 5d ago AIhub monthly digest: June 2026 – biodiversity, resource allocation, and color metaphors 6d ago AAAI presidential panel – AI agents 7d ago Interview with AAAI Fellow Tanya Berger-Wolf: AI for ecology, biodiversity, and conservation 11d ago Statistical or embodied? Comparing people and LLMs in their processing of color metaphors: an interview with Douglas Guilbeault 13d ago The Good Robot podcast: the battle over data centres with Tara Merk 14d ago Congratulations to the #AAMAS2026 best paper award winners 17d ago Interview with AAAI Fellow Sanmay Das: multiagent systems 18d ago
20 loaded
LD
Linear Digressions
1d ago · 20 items
Agent Economics (The Agents Season, Episode 11) 1d ago What if building more highways made your commute *slower*? That's the paradox at the heart of AI agent economics: even as per-token inference costs have plummeted dramatically over the past two years, total LLM spending keeps climbing. Draw... Agent Trust, Oversight and Control (The Agents Season, Episode 9) 8d ago Capabilities get all the attention when it comes to AI agents — but what happens when a highly capable agent makes a bad decision in the real world? Trust, oversight, and control are the unglamorous but critically important flip side of the... Many Agents, Many Problems (The Agents Season, Episode 8) 15d ago Whether you work best solo or thrive in a team, you know collaboration is complicated — and it turns out AI agents face the same tensions. This episode dives into multi-agent systems, exploring how networks of AI agents can overcome the ind... How Do You Evaluate An AI Agent? (The Agents Season, Episode 7) 22d ago Knowing when an AI agent has failed sounds straightforward — until it isn't. Agents have a frustrating habit of finishing confidently while quietly doing the wrong thing, or looping endlessly without ever crashing in an obvious way. This ep... AI Agent Failure Modes (The Agents Season, Episode 6) 28d ago Despite what the marketing hype might suggest, AI agents are far from infallible — and if you've ever actually used one, you already know this. Today's episode dives deep into the many, varied, and sometimes surprising ways AI agents can fa... Agentic Planning (The Agents Season, Episode 5) 36d ago When tackling a complex, multi-step task, even the smartest AI agent can fail without a solid game plan. This episode dives into the research around agentic planning — how agents move beyond simply reacting to what's in front of them and in... Memory Management for AI Agents (The Agents Season, Episode 4) 42d ago Context windows are powerful — but finite, and surprisingly easy to overwhelm. When an AI agent is tackling a long, complex task, the information it needs has to fit inside that limited real estate, and research shows that anything buried i... Lost in the Middle (The Agents Season, Episode 3) 50d ago Just like a memorable talk lives or dies by its opening and closing, LLMs have a surprisingly similar quirk: they pay close attention to what's at the beginning and end of their context window — and kind of zone out in the middle. This "los... ReAct and Tool Usage (The Agents Season, Episode 2) 57d ago Before 2022, there was a wall between AI and the real world — models could reason impressively, but couldn't look anything up, run code, or check whether anything they said was actually true. This episode traces the moment that wall came do... What's an AI Agent? And Why Is that Hard to Define? (The Agents Season, Episode 1) 64d ago AI agents are having a moment — and unpacking them properly takes more than a single conversation. This episode kicks off a dedicated multi-part season exploring AI agents from every angle, building up a complete picture piece by piece rath...
20 loaded
Will AI spark a scientific renaissance — or a diffuse monoculture? 1d ago Artificial intelligence’s ability to enrich science will depend not only on model capability, but also on whether researchers, reviewers and funders reward originality over speed. Ovo, an open-source ecosystem for de novo protein design 3d ago The protein design field is rapidly advancing, with frequent emergence of new models and pipelines for designing de novo proteins with tailored properties and functions not found in nature. However, the current tool landscape is fragmented,... Memorization in large language models in medicine prevalence characteristics and implications 4d ago Large Language Models (LLMs) have demonstrated significant potential in medicine, with many studies adapting them through continued pretraining or fine-tuning on medical data. However, a key question remains: to what extent do LLMs memorize... A universal deep learning framework for empowering nanopore identification by reinforcing temporal signals 5d ago Nanopore sensing holds transformative potential for revolutionizing protein and glycan sequencing. However, translating this potential into practical, high-fidelity identification is severely bottlenecked by the challenge of processing mass... SplitSeek-Pro: accurate prediction of splittable sites on protein structures 5d ago Understanding protein architecture and predicting its structural tolerance to profound remodeling is pivotal for engineering functional proteins. We present SplitSeek-Pro, a deep learning model that evaluates amino acid-level splittability ... Is AI ruining our skills? Early results are in — and they’re not good 5d ago Reliance on artificial-intelligence tools degrades the abilities of physicians and software engineers, studies show. SAMJ: fast image annotation on ImageJ/Fiji via segment anything model 5d ago Accurate image annotation is essential for training artificial intelligence (AI) systems in biomedical image analysis, enabling tasks such as cell detection, tissue quantification, and disease characterization. However, creating pixel-level... Modeling visual memorability assessment with autoencoders reveals characteristics of memorable images 6d ago Image memorability refers to the phenomenon where certain images are more likely to be remembered than others. It is a quantifiable and intrinsic image attribute, defined as the likelihood of an image being remembered upon a single exposure... Reimagining machine vision with optical computing 6d ago A specialized ‘metasurface’ can preprocess incoming scene information on image-generating devices. Multi-omic analysis of deep learning-derived phenotypes links ophthalmic imaging to cardiovascular and neurological traits 7d ago The eye is a recognized source of biomarkers for cardiovascular and neurodegenerative disease risk. Here we characterize the breadth of these associations and identify biological axes that may mediate them. Using UK Biobank data, we develop...
158 loaded
DP
Dwarkesh Patel
1d ago · 15 items
15 loaded
EY
Eugene Yan
2d ago · 20 items
20 loaded
WL
Welch Labs
3d ago · 15 items
15 loaded
SE
sentdex
3d ago · 15 items
15 loaded
FO
Future of Life Institute
3d ago · 20 items
Should AIs be people too? 3d ago Statement: Anthropic warns of AI self-improvement risks, considers a pause 15d ago FLI President on the White House Executive Order 19d ago Magnificent Humanity – The Pope’s First Encyclical Concerns AI 33d ago White House working group on AI – Statement from FLI’s Anthony Aguirre 48d ago FLI’s President and CEO on Trump’s support for an AI ‘kill switch’ 67d ago FLI CEO’s statement on the attack against Sam Altman’s home 73d ago Prominent Scientists, Faith Leaders, Policymakers and Artists Call for a Prohibition on Superintelligence, as Poll Shows Americans Don’t Want It 87d ago Statement: Head of US Policy on the White House AI legislative recommendations 92d ago Governor DeSantis Directs Florida State Agencies to Partner with Future of Life Institute to Shield Families from AI Harm 105d ago
20 loaded
AB
Ai2 Blog
5d ago · 20 items
How Domyn and AISquared built on Ai2's open releases 5d ago Domyn and AISquared show how Ai2’s open releases are helping AI labs build models for regulated industries, where transparency, provenance, licensing, and control are essential for customer trust and compliance. MolmoMotion: Language-guided 3D motion forecasting 6d ago MolmoMotion is an open, language-guided 3D motion forecasting model that predicts how object points will move in the future, enabling stronger motion prediction for robotics, video generation, and other systems that need to reason about wha... olmo-eval: An evaluation workbench for the model development loop 11d ago olmo-eval is an open evaluation workbench that helps model developers add, run, and analyze benchmarks across changing LLM checkpoints, extending OLMES from final-score reproducibility into the day-to-day model development loop. Building accessibility tools on a truly open foundation 33d ago PointCheck, an independent project, uses Molmo, MolmoWeb, and Olmo 3 to test web accessibility the way a keyboard user would—by navigating real pages and inspecting what's actually on screen. OlmoEarth v1.1: A more efficient family of models 35d ago OlmoEarth v1.1 is a more efficient family of remote-sensing models that cuts compute costs by up to 3x while maintaining similar performance, making large-scale satellite mapping faster and cheaper to run. Introducing AIMIP: The AI weather and climate model intercomparison project 41d ago AIMIP is a new open benchmark and dataset for evaluating AI climate models, showing they can match or beat conventional models on some historical climate metrics while still struggling to generalize reliably to long-term warming trends and ... Why Artificial Analysis uses Ai2's IFBench instruction-following eval 43d ago Artificial Analysis uses Ai2’s open IFBench eval because it captures a stubborn, real-world capability many benchmarks miss: whether models can reliably follow complex, multi-part user instructions. EMO: Pretraining mixture of experts for emergent modularity 46d ago EMO is a new mixture-of-experts model trained so modular expert groups emerge from data, enabling users to select small task-specific expert subsets while preserving near full-model performance. Open by design: Ai2 brings fully open AI infrastructure online with NSF OMAI 47d ago Ai2 is bringing NSF OMAI compute online to power a fully open AI research ecosystem, turning national infrastructure investment into reusable models, data, methods, and tools that can accelerate scientific discovery. MolmoAct 2: An open foundation for robots that work in the real world 49d ago MolmoAct 2 is a fully open robotics foundation model that brings faster, stronger 3D action reasoning to real-world robot tasks, alongside a major new bimanual manipulation dataset for researchers to study, reproduce, and build on.
20 loaded
AI
AI
5d ago · 20 items
New research shows how AMIE, our medical AI, could help manage health conditions. 5d ago Research in “Nature” shows our conversational AI system matches primary care physicians in complex disease management. We’re strengthening our presence in Alabama through new investments and community support. 7d ago Google has announced a $1.5 billion investment for 2026 and 2027 to expand its data center campus in Jackson County, Alabama. Operating since 2019 on a repurposed former… Our new community investments in Virginia support local jobs and expand energy affordability. 11d ago We’re helping build the state’s next-generation workforce and investing in energy programs. The latest AI news we announced in May 2026 17d ago Here are Google’s latest AI updates from May 2026 5 ways Google Search can level up your thrift and vintage shopping 19d ago Uncover second-hand scores with AI tools in Google Search and Shopping. How we used Gemini to build Google I/O 2026 21d ago Learn how Googlers used AI to produce Google I/O 2026. Take our I/O 2026 quiz, vibe coded in Google AI Studio. 24d ago We used Google AI Studio to vibe code a quiz about our top I/O 2026 announcements. 9 demos of Gemini Omni and Gemini 3.5 in action 24d ago Watch 11 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026. Check out real-life AI prototypes from the Futures Lab. 24d ago University of Waterloo students develop AI prototypes like sign language tutors to reshape the future of education and work. Catch up on 12 major I/O 2026 moments 25d ago Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.
20 loaded
GD
Google DeepMind News
6d ago · 20 items
Unlocking UK house-building with AI-accelerated planning 6d ago Google DeepMind is working alongside the UK government to co-develop an AI-powered prototype to help cut application decision times by 50%. Securing the future of AI agents 6d ago Discover our AI Control Roadmap: a defense-in-depth system to securely manage advanced, potentially misaligned AI agents. DiffusionGemma: 4x faster text generation 12d ago An overview of DiffusionGemma, an exceptionally fast text generation model with up to 4x faster speeds. Investing in multi-agent AI safety research 12d ago Google DeepMind and partners are announcing a new technical research funding call of up to $10M for researchers worldwide to strengthen multi-agent safety. Fluid, natural voice translation with Gemini 3.5 Live Translate 13d ago Gemini 3.5 Live Translate brings near real-time, natural speech translation to Google AI Studio, Google Translate and Google Meet. Introducing Gemma 4 12B: a unified, encoder-free multimodal model 13d ago An overview of Gemma 4 12B, a model designed to bring high-performance multimodal intelligence directly to your laptop. Powering the future of robotics in Europe 13d ago AI has the potential to help solve some of the world’s biggest challenges — not just in the digital realm, but in the physical world, too. Robotics is one of the most exciting frontiers of AI, where advances in language, vision and action m... Measuring the impact of learning with AI in Sierra Leone and beyond 14d ago Google DeepMind shares results from a randomized controlled trial in Sierra Leone, measuring the impact of AI in education on student learning and engagement. We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks 32d ago The Asia-Pacific region is a global engine for economic growth, but it's also highly vulnerable to climate change. While green technologies are gaining momentum, a recen… Fast-tracking genetic leads to reverse cellular aging 35d ago Biologists Omar Abudayyeh and Jonathan Gootenberg use Co-Scientist to scan thousands of papers and identify over 20 novel factors that could reverse cellular aging.
20 loaded
3B
3Blue1Brown
6d ago · 15 items
15 loaded
TL
The latest research from Google
6d ago · 20 items
From pixels to planning: Earth AI for nature restoration 6d ago Research into how AI can help users understand skin conditions 10d ago A low-carbon computing platform from your retired phones 10d ago New framework for auditing machine unlearning 12d ago Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG 17d ago Towards passive heart health monitoring via smartphone camera 18d ago The next chapter in flood resilience: Open sourcing Google’s hydrology framework 19d ago A New Era of Discovery: Google Research at I/O 2026 25d ago Private analytics via zero-trust aggregation 26d ago Empirical Research Assistance (ERA): From Nature publication to catalyzing Computational Discovery 34d ago
20 loaded
AE
AI Explained
8d ago · 15 items
15 loaded
DS
David Stutz
8d ago · 10 items
Domain-Specific AI Should Focus on Workflows Rather Than Modeling 8d ago I spent the past few years working on AI for health. Starting with custom multimodal encoders, post-training, and sophisticated multi-agent architecturs, I now see modeling work becoming less and less important for domain-specific applicati... AI Evaluation is Becoming an Exciting Standalone Discipline 67d ago Having worked on robustness problems during my PhD, I see many of the characteristics appearing in the evaluation of LLMs and AI systems. Adversarial attacks such as jailbreaks are becoming more relevant, edge cases finally become relevant,... RAISE 2025 Panel Statement on Aligning AI to Clinical Values 261d ago Recently, I attended the Responsible AI for Social and Ethical Healthcare 2025 “2.0” Symposium organized by, among others, Harvard Medical School. The symposium featured various panels on topics surrounding generative AI, in particular mult... Some Lessons on Reviews and Rebuttals 505d ago Writing and responding to reviews is the bread and butter of any academic and especially in AI research, PhD students are confronted with both rather early compared to other displicines. Unfortunately, I found that drafting reviews and rebu... Thoughts on Watermarking AI-Generated Content 523d ago Watermarking AI-generated content has the potential to address various problems that generative AI threatens to aggravate — misinformation, impersonation, copyright infringement, web pollution, etc. However, it is also controversial with ma... Thoughts and Lessons for Planning Rater Studies in AI 532d ago With the goal of deploying generative AI systems, rater studies are becoming increasingly common and important. This means more and more researchers and engineers face the challenge of actually planning and conducting rater studies for AI s... Open-Sourcing Relabeled MedQA and Dermatology DDx Datasets 587d ago Dealing with rater disagreement is becoming more important in AI, especially for LLMs and in specialized domains such as health. In the past year, I helped open source two datasets allowing to study rater disagreement in the health domain: ... Thinking About Research Ideas vs. Technology 589d ago In this article, I want to share some thoughts on the difference between research ideas and technology, particularly in machine learning. This distinction is have been contemplating since starting my PhD. After joining Google DeepMind and b... The Importance of Effectively Experimenting in an AI PhD 714d ago Engineering and running experiments are a key component of most PhDs in AI. While there are plenty of more theoretical topics that are often limited to smaller scale experimentation, the trend has definitely been to scale up models, dataset... FAQ for our Monte Carlo Conformal Prediction 771d ago Over the past months, I have given several talks about Monte Carlo conformal prediction and the problem of calibrating with uncertain ground truth, for example, stemming from annotator disagreement. Each time, the audience had great questio...
MR
Microsoft Research
10d ago · 10 items
Ire identifies another LOTUSLITE specimen 10d ago Project Ire examined a timely malware sample and determined its intent through reverse engineering—identifying LOTUSLITE characteristics even as most major EDR tools did not detect it. Data Formulator 0.7: AI-powered data analytics for enterprise data 25d ago Data Formulator introduces AI-powered analytics for enterprise data workflows. Data teams can easily bring enterprise data into an AI-ready workspace where users can explore, analyze, and visualize data with AI agents to turn raw data into ... Extending Human Intelligence Through AI 26d ago Understanding AI as an extension of human intelligence—not a replacement for it—offers a more grounded path for building trustworthy AI systems. MagenticLite, MagenticBrain, Fara1.5: An agentic experience optimized for small models 32d ago MagenticLite is an agentic system for small models that works across the browser and local file system in a single workflow. It combines specialized models and orchestration to support efficient agentic performance on everyday tasks: Vega: Zero-knowledge proofs for digital identity in the age of AI 32d ago Vega turns a full credential into a single proof, sharing only what is needed and nothing more, with performance that works in real apps. Further Notes on Our Recent Research on AI Delegation and Long-Horizon Reliability 38d ago Our recent paper, “LLMs Corrupt Your Documents When You Delegate”, has generated discussion about the reliability of AI systems in delegated workflows. We appreciate the interest in this work and want to clarify several important points abo... mimalloc: A new, high-performance, scalable memory allocator for the modern era 40d ago mimalloc is an open-source, modern, scalable memory allocator that is a drop-in replacement for malloc and free. It is relatively small (~12K lines), with clear internal data structures, and is easy to build and integrate into other project... GridSFM: A new, small foundation model for the electric grid 40d ago Introducing GridSFM, a small foundation model that can predict AC optimal power flow in milliseconds, boosting efficiency and unlocking cost savings. Learn how GridSFM gives grid operators direct visibility into congestion, stability, and s... Advancing AI for materials with MatterSim: experimental synthesis, faster simulation, and multi-task models 41d ago MatterSim is expanding what AI can do for materials science—from faster large-scale simulations to MatterSim-MT, a new multi-task model for simulating properties beyond potential energy surfaces alone. SocialReasoning-Bench: Measuring whether AI agents act in users’ best interests 42d ago Using SocialReasoning Bench, we observed a stable pattern across models—agents execute competently, but fail to consistently improve the user’s position, even with explicit instructions to optimize for user interest.
LM
Learn Machine Learning
11d ago · 2235 items
We are open-sourcing an vision-language interaction model and system 11d ago We are open-sourcing an vision-language interaction model and system 11d ago We are open-sourcing an vision-language interaction model and system 11d ago We are open-sourcing an vision-language interaction model and system 11d ago Everyone's been talking about Thinking Machines' "interaction model" as a concept. We went and built one — 8B, vision-driven, decides on its own when to speak, and when to delegate to agent — and we're open-sourcing all of it 11d ago Math Focused Learning Resources 11d ago I built a FIFA World Cup 2026 Predictive Oracle & Bracket Simulator live on the web! 11d ago How is CampusX One Membership courses ?? worth it? 11d ago GitHub Autopilot — Open Source GitHub App for Repository Automation 11d ago Day 20 of Reviewing 1 free AI, ML, or data certification every day, so you don’t have to waste time with bad courses. 11d ago
2235 loaded
AI
Artificial Intelligence (AI)
11d ago · 1197 items
[ Removed by Reddit ] 11d ago Visa and OpenAI Let AI Agents Shop on Your Behalf Using Visa's Global Network 11d ago The productivity gap between "AI user" and "AI agent user" is bigger than I expected 11d ago If AI could comfort you perfectly, would you still want parts of yourself left unread? 11d ago How To Get Web Design Clients 11d ago Which AI agent are you? 11d ago Do you think AI is becoming normal faster than people expected? 11d ago By 2050, we may see AI assistants in every home, personalized learning for every student, advanced medical treatments, smart cities, and even human-AI collaboration on a massive scale. 11d ago The gap between decision and exécution 11d ago OpenAI Filed for IPO at $852B as Anthropic Beats It to Market and Price Cuts Loom 11d ago
1197 loaded
DL
Deep Learning
11d ago · 610 items
SenseNova U1 training code and dataset are open-sourced. How is it different from other text-to-image models? 11d ago JudgeOS V5.7 / EBH — The Governance Firewall Above AI, Robots, Agents, and Autonomous Workflows 11d ago “GenalShift (mi función de activación) ha superado a ReLU en CIFAR-10 entrenando una ResNet18 desde cero: 92.33% vs 92.07% (+0.26%). Código abierto en GitHub. #IAsoberana #DeepLearning” 11d ago I spent a year applying information geometry to LLM behavioral monitoring. Here’s what the math shows about multi-turn attacks. 11d ago Nobody sent a memo. Nobody made an announcement. But somewhere in the last two years, marketing quietly changed forever. 11d ago Need help with implementation of transformer-decoder model 11d ago Plot twist: your future killer already has a USB port 11d ago Running Gemma 4 QAT 12B on an 8GB GPU at 16k context — measured the KV-cache tradeoffs 12d ago Request for critique: deterministic governance boundary for AI agent actions before execution 12d ago Analysis of the results of the "Transforming autoencoders" architecture mentioned by Hilton, for my dissertation. 12d ago
610 loaded
RL
Reinforcement Learning
11d ago · 254 items
highway-v0 env is too slow 11d ago Fair Reinforcement Learning 11d ago Korrel: turn one agent eval into a verifiers or OpenEnv RL environment, with a fidelity proof against tau2-bench 12d ago Do you ever get to the point of mental breakdown? 12d ago Roast my resume 13d ago Optimizing an RL Training Pipeline: Memory, Sampling, and Copy Elimination 13d ago Resoning LLMs make RL agent learn Faster 13d ago I Built a Reinforcement Learning AI That Runs on an Arduino Mega 13d ago Previous Claude models struggled to play Pokémon Fire even with harnesses that gave them additional helpful tools, but Fable 5 beat FireRed with a minimal, vision-only harness. 13d ago Testing the stability of my new walking gait (x0.25) 13d ago
254 loaded
DS
Data Science
12d ago · 161 items
Is this AgenticAI Ragebait? 12d ago How to stop shipping low-quality RL environments, with examples 12d ago Is my tech stack becoming a liability for future job prospects? 12d ago AI Overuse Follow-up 13d ago How do you measure to performance / accuracy of a recommender system? 13d ago How do you put a price on a healthy work environment and a good manager? 13d ago How Earnings Impact My Momentum Strategy - A Backtest Across Two XGBoost Models 13d ago What Data Structures and Algorithms topics actually come up in technical interviews? 13d ago Weekly Entering & Transitioning - Thread 08 Jun, 2026 - 15 Jun, 2026 15d ago Open and closed models are on different exponentials 15d ago
161 loaded
ONLY FOR A LIMITED TIME 12d ago Designing a Universal AI Safety Framework 12d ago Analog Neuromorphic letter recognition circuit 12d ago How one engineer at Spotify solved the recommendations of music by building an open source library ANNOY 12d ago Built native iOS/macOS ONNX model analyzer (inspired by Netron) (Looking for feedback) 12d ago Optimizing an RL Training Pipeline: Memory, Sampling, and Copy Elimination 13d ago Personalization Yo-Yo: A Ruler-Based Mechanism for Non-Sticky Long-Term Personalization 14d ago Object detection Using Detection Transformer (Detr) for Bone fraction dataset 15d ago I built an MNIST classifier from scratch in pure Python (no NumPy) to actually understand backprop 17d ago dataset and architecture 18d ago
59 loaded
AS
Amazon Science homepage
12d ago · 20 items
EC2’s formally verified “isolation engine” provides mathematical assurance of virtual-machine isolation 12d ago 330,000 lines of machine-checked proofs in Isabelle/HOL verify that the Nitro Isolation Engine correctly enforces confidentiality, integrity, and memory safety between EC2 virtual machines on Graviton5. Graviton5’s improved design increases speed and energy efficiency — beyond Moore’s law 12d ago Graviton5's four-chiplet architecture, custom die-to-die connectivity, three-nanometer process, and 192 megabytes of L3 cache deliver up to 35% faster performance for web applications and ML inference. Real-world grounding in agentic AI 14d ago Physics-guided learning, calibrated uncertainty, numerical precision, and formal verification help AI agents avoid hallucinations and operate safely in warehouses, factories, and other physical systems. Bridging intent and execution in agentic systems 14d ago The harnesses that mediate between models and tools in agentic systems are becoming their own performance bottleneck, but a few simple design principles can fix what ails them. Ground truth is a process, not a dataset 19d ago A new audit-then-score protocol improves benchmark accuracy from 60.8% to 90.9% for evaluating AI fact-checkers on deep-research reports. How flat is replacing fat in AWS data center networks 25d ago A new network architecture called RNG uses passive optical ShuffleBoxes and quasi-random wiring to cut routers by 69%, boost throughput by up to 33%, and reduce network energy consumption by 40% — and it's now the default for most new AWS d... Amazon Research Awards recipients announced 26d ago Awardees represent more than 49 universities in 11 countries. Recipients have access to Amazon public datasets, along with AWS AI/ML services and tools. Diverse reasoning traces teach LLMs to make better decisions 27d ago Amazon researchers introduce set-supervised fine-tuning (SSFT) and global forking policy optimization (GFPO) to train language models that generate diverse reasoning paths — boosting single-shot accuracy on AIME 2025 and LiveCodeBench bench... Making LLMs faster without sacrificing accuracy 38d ago By treating hidden size, MLP-to-attention ratio, and grouped-query attention as first-class variables in a Chinchilla-style scaling law, researchers identify model architectures that match LLaMA-3.2 accuracy while significantly improving se... Promptimus: Improving already good LLM prompts with zero manual engineering 39d ago By focusing on specific failure points and suggesting targeted solutions, a new automated prompt-engineering framework improves prompt performance without compromising existing functionality.
20 loaded
OU
One Useful Thing
13d ago · 20 items
20 loaded
AM
Apple Machine Learning Research
15d ago · 10 items
Introducing the Third Generation of Apple’s Foundation Models 15d ago Our next generation of Apple Intelligence is centered around our users, integrated deeply into our operating systems, and powered by a bold… IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026 26d ago Apple is presenting new research at the annual IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), which takes place in… VSAS-Bench: Real-Time Evaluation of Visual Streaming Assistant Models 32d ago Streaming vision-language models (VLMs) continuously generate responses given an instruction prompt and an online stream of input frames… EpiCache: Episodic KV Cache Management for Long-Term Conversation on Resource-Constrained Environments 35d ago Modern large language models (LLMs) extend context lengths to millions of tokens, enabling coherent, personalized responses grounded in long… BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning 43d ago Image captioning is one of the most fundamental tasks in computer vision. Owing to its open-ended nature, it has received significant… Large-Scale High-Quality 3D Gaussian Head Reconstruction from Multi-View Captures 46d ago We propose HeadsUp, a scalable feed-forward method for reconstructing high-quality 3D Gaussian heads from large-scale multi-camera setups… RVPO: Risk-Sensitive Alignment via Variance Regularization 46d ago Current critic-less RLHF methods aggregate multi-objective rewards via an arithmetic mean, leaving them vulnerable to constraint neglect:… Apple Workshop on Privacy-Preserving Machine Learning & AI 2026 46d ago At Apple, we believe privacy is a fundamental human right. As AI capabilities increase and become more integrated into people’s daily… Velox: Learning Representations of 4D Geometry and Appearance 46d ago We introduce a framework for learning latent representations of 4D objects which are descriptive, faithfully capturing object geometry and… What Matters in Practical Learned Image Compression 47d ago One of the major differentiators unlocked by learned codecs relative to their hard-coded traditional counterparts is their ability to be…
AO
Ahead of AI
16d ago · 20 items
20 loaded
NVIDIA Research Unlocks Advanced Grasping, Smarter Autonomous Driving and Agent Training at Scale 19d ago New NVIDIA Research breakthroughs show how training at scale — across gripper types, driving scenarios and virtual worlds — creates AI that generalizes to diverse applications. NVIDIA Enables the Next Era Of Physical AI Research With Agent Skills For Autonomous Vehicles, Robotics And Vision AI 19d ago New physical AI agent skills, powered by NVIDIA Cosmos 3, help researchers accelerate data generation, simulation, policy training and evaluation for autonomous system development. NVIDIA Research Advances Robotics From Simulation to the Real World 25d ago Featured at the International Conference on Robotics and Automation, eight new NVIDIA Research papers show how robots trained in simulation are moving into the real world. NVIDIA Launches Earth-2 Family of Open Models — the World’s First Fully Open, Accelerated Set of Models and Tools for AI Weather 147d ago NVIDIA Earth-2 makes weather AI accessible worldwide at every stage — from processing initial observation data to generating 15-day global forecasts or local storm forecasts. At NeurIPS, NVIDIA Advances Open Model Development for Digital and Physical AI 203d ago NVIDIA releases new AI tools for speech, safety and autonomous driving — including NVIDIA DRIVE Alpamayo-R1, the world’s first open industry-scale reasoning vision language action model for mobility — and a new independent benchmark recogni... How Do You Teach an AI Model to Reason? With Humans 299d ago NVIDIA’s data factory team creates the foundation for AI models like Cosmos Reason, which today topped the physical reasoning leaderboard on Hugging Face. NVIDIA Research Shapes Physical AI 315d ago AI and graphics research breakthroughs in neural rendering, 3D generation and world simulation power robotics, autonomous vehicles and content creation. NVIDIA Research Showcases the Future of Robotics at RSS 367d ago At this year’s Robotics: Science and Systems conference, NVIDIA Research is presenting work that advances robot learning across simulation, real-world transfer and decision-making. NVIDIA Scores Consecutive Win for End-to-End Autonomous Driving Grand Challenge at CVPR 376d ago NVIDIA was today named an Autonomous Grand Challenge winner at the Computer Vision and Pattern Recognition (CVPR) conference, held this week in Nashville, Tennessee. The announcement was made at the Embodied Intelligence for Autonomous Syst... NVIDIA Research Casts New Light on Scenes With AI-Powered Rendering for Physical AI Development 376d ago DiffusionRenderer introduces a neural rendering technique that can be applied to content generation and editing for creative fields — and to synthetic data generation for autonomous vehicles and robotics development.
18 loaded
LF
Lex Fridman Podcast
24d ago · 20 items
#497 – Biggest Mysteries in Physics: Antimatter, Dark Energy & ToE – Don Lincoln 24d ago Don Lincoln is a particle physicist at Fermilab who has spent decades working at the frontiers of high energy physics. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep497-sc See below for timestamps, and ... #496 – FFmpeg: The Incredible Technology Behind Video on the Internet 47d ago Jean-Baptiste Kempf is lead developer of VLC and president of VideoLAN. Kieran Kunhya is a longtime FFmpeg contributor, codec engineer, and the person behind the now-infamous FFmpeg account on X. Thank you for listening ❤ Check out our spon... #495 – Vikings, Ragnar, Berserkers, Valhalla & the Warriors of the Viking Age 74d ago Lars Brownworth is a historian, teacher, podcaster, and author specializing in Viking history, medieval Europe, and the Byzantine Empire. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep495-sc See below f... #494 – Jensen Huang: NVIDIA – The $4 Trillion Company & the AI Revolution 91d ago Jensen Huang is the co-founder and CEO of NVIDIA, the world’s most valuable company and the engine powering the AI computing revolution. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep494-sc See below fo... #493 – Jeff Kaplan: World of Warcraft, Overwatch, Blizzard, and Future of Gaming 103d ago Jeff Kaplan is a legendary Blizzard game designer of World of Warcraft and Overwatch, now preparing to launch a new game, The Legend of California, from his new studio Kintsugiyama – available to wishlist on Steam today, with alpha later in... #492 – Rick Beato: Greatest Guitarists of All Time, History & Future of Music 114d ago Rick Beato is a music educator, interviewer, producer, songwriter, and a true multi-instrument musician, playing guitar, bass, cello & piano. His incredible YouTube channel celebrates great musicians & musical ideas, and helps millions of p... #491 – OpenClaw: The Viral AI Agent that Broke the Internet – Peter Steinberger 131d ago Peter Steinberger is the creator of OpenClaw, an open-source AI agent framework that’s the fastest-growing project in GitHub history. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep491-sc See below for t... #490 – State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI 142d ago Nathan Lambert and Sebastian Raschka are machine learning researchers, engineers, and educators. Nathan is the post-training lead at the Allen Institute for AI (Ai2) and the author of The RLHF Book. Sebastian Raschka is the author of Build ... #489 – Paul Rosolie: Uncontacted Tribes in the Amazon Jungle 160d ago Paul Rosolie is a naturalist, explorer, author of a new book titled Junglekeeper, and is someone who has dedicated his life to protecting the Amazon rainforest. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponso... #488 – Infinity, Paradoxes that Broke Mathematics, Gödel Incompleteness & the Multiverse – Joel David Hamkins 173d ago Joel David Hamkins is a mathematician and philosopher specializing in set theory, the foundations of mathematics, and the nature of infinity, and he’s the #1 highest-rated user on MathOverflow. He is also the author of several books, includ...
20 loaded
LF
Lex Fridman
24d ago · 15 items
Biggest Mysteries in Physics: Antimatter, Dark Energy & ToE - Don Lincoln | Lex Fridman Podcast #497 24d ago FFmpeg: The Incredible Technology Behind Video on the Internet | Lex Fridman Podcast #496 47d ago Vikings, Ragnar, Berserkers, Valhalla & the Warriors of the Viking Age | Lex Fridman Podcast #495 74d ago Jensen Huang: NVIDIA - The $4 Trillion Company & the AI Revolution | Lex Fridman Podcast #494 91d ago Jeff Kaplan: World of Warcraft, Overwatch, Blizzard, and Future of Gaming | Lex Fridman Podcast #493 103d ago Lex trains w/ Khabib Nurmagomedov | Exclusive Footage at UFC PI 111d ago Rick Beato: Greatest Guitarists of All Time, History & Future of Music | Lex Fridman Podcast #492 114d ago Khabib vs Lex: Training with Khabib | FULL EXCLUSIVE FOOTAGE 117d ago OpenClaw: The Viral AI Agent that Broke the Internet - Peter Steinberger | Lex Fridman Podcast #491 131d ago State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 142d ago
15 loaded
DA
deepcognition.ai
34d ago · 10 items
YK
Yannic Kilcher
108d ago · 15 items
15 loaded
IN
inFERENCe
117d ago · 15 items
The Future of Software 117d ago The world of software is undergoing a shift not seen since the advent of compilers in the 1970s. Compilers were the original vibe coding: they automatically generate complex machine code that human programmers had to manually write before. ... Deep Learning is Powerful Because It Makes Hard Things Easy - Reflections 10 Years On 142d ago Ten years ago this week, I wrote a post called "Deep Learning is Easy - Learn Something Harder". The post blew up, top spot on HackerNews. Needless to say, it didn't age well. Discrete Diffusion: Continuous-Time Markov Chains 396d ago A tutorial explaining some intuitions behind continuous time Markov chains for machine learners interested in discrete diffusion models. We may finally crack Maths. But should we? 1110d ago Automating mathematical theorem proving has been a long standing goal of artificial intelligence and indeed computer science. It's one of the areas I became very interested in recently. This is because I feel we may have the ingredients nee... Mortal Komputation: On Hinton's argument for superhuman AI. 1119d ago Last week in Cambridge was Hinton bonanza. He visited the university town where he was once an undergraduate in experimental psychology, and gave a series of back-to-back talks, Q&A sessions, interviews, dinners, etc. He was stopped on the ... Autoregressive Models, OOD Prompts and the Interpolation Regime 1180d ago A few years ago I was very much into maximum likelihood-based generative modeling and autoregressive models (see this, this or this). More recently, my focus shifted to characterising inductive biases of gradient-based optimization focussin... We May be Surprised Again: Why I take LLMs seriously. 1188d ago "Deep Learning is Easy, Learn something Harder" - I proclaimed in one of my early and provocative blog posts from 2016. While some observations were fair, that post is now evidence that I clearly underestimated the impact simple techniques ... Implicit Bayesian Inference in Large Language Models 1572d ago This intriguing paper kept me thinking long enough for me to I decide it's time to resurrect my blogging (I started writing this during ICLR review period, and realised it might be a good idea to wait until that's concluded) * Sang Michael ... Eastern European Guide to Writing Reference Letters 1575d ago Excruciating. One phrase I often use to describe what it's like to read reference letters for Eastern European applicants to PhD and Master's programs in Cambridge. Even objectively outstanding students often receive dull, short, factual, a... Causal inference 4: Causal Diagrams, Markov Factorization, Structural Equation Models 1838d ago This post is written with my PhD student and now guest author Patrik Reizinger [https://twitter.com/rpatrik96] and is part 4 of a series of posts on causal inference: * Part 1: Intro to causal inference and do-calculus [https://www.inferenc...
15 loaded
DB
Damian Bogunowicz - dtransposed
123d ago · 10 items
TG
The Gradient
124d ago · 15 items
15 loaded
AK
Andrej Karpathy blog
131d ago · 10 items
SR
Salmon Run
141d ago · 20 items
Book Review: Software Engineering for Data Scientists 141d ago As a Software Engineer (backend Web Development then Search) turned Data Scientist, I was particularly interested in what the book Software ... Book Review: Transformers In Action 163d ago The Attention Is All You Need paper proposed the Transformer Architecrture as an improvement to the dominant encoder-decoder models of the ... Trip Report: PyData Global 2025 178d ago I attended PyData Global 2025 earlier this month. I had hoped to write this up earlier, but I've been busy, so only now getting the time Ch... Book Review: Time Series Forecasting using Foundation Models 253d ago As someone who primarily works in NLP and Search in the Health Domain, I don't have much use for Time Series. However, while exploring the F... Book Review: Statistics every Programmer Needs 275d ago I recently read Statistics every Programmer Needs by Gary Sutton. I am probably a good target audience for the book since I used to be a so... Book Review: Hands-On Artificial Intelligence for IoT 359d ago For those in similar professional circles as I am in, i.e. looking forward into the Generative AI space, yet with one foot pragmatically and... Book Review: Essential Graph RAG 372d ago Coming from a background of Knowledge Graph (KG) backed Medical Search, I don't need to be convinced about the importance of manually curate... Packaging ML Pipelines from Experiment to Deployment 538d ago As an ML Engineer, we are generally tasked with solving some business problem with technology. Typically it involves leveraging data assets ... Trip Report - PyData Global 2024 561d ago I attended PyData Global 2024 last week. Its a virtual conference, so I was able to attend it from the comfort of my home, although presenta... Using Knowledge Graphs to enhance Retrieval Augmented Generation 625d ago Retrieval Augmented Generation (RAG) has become a popular approach to harness LLMs for question answering using your own corpus of data. Typ...
20 loaded
VI
VITALab
168d ago · 10 items
Towards Brain MRI Foundation Models for the Clinic: Findings from the FOMO25 Challenge 168d ago 1. Motivation Brain Latent Progression Individual-based spatiotemporal disease progression on 3D Brain MRIs via latent diffusion 299d ago This article aims at reviewing a Alzheimer’s spatiotemporal disease progression predictive model called Brain Latent Progression (BrLP). All in all, this is ... A Survey of popular LLM Evaluation Metrics 307d ago Large Language Models (LLMs) are increasingly applied to critical domains such as medical report generation, where accuracy and trust are essential. Evaluati... Open-Source Large Language Models in Radiology: A Review and Tutorial for Practical Research and Clinical Deployment 316d ago Open-Source Large Language Models in Radiology MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation 385d ago MemSAM Simplifying Deep Temporal Difference Learning 442d ago tl;dr The authors propose PQN, a simplified deep online Q-Learning that uses very small replay buffers. Normalization and parallelized sampling from vectoriz... EchoPrime: Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation 456d ago Objective EchoPrime is a foundation model designed for comprehensive echocardiographic interpretation. Unlike previous models that use single views or static... DeepSeek-V3 Technical Report 497d ago DeepSeek-V3 Variational Autoencoders for Generating Synthetic Tractography-Based Bundle Templates in a Low-Data Setting 525d ago Highlights Implicit neural representations 553d ago Implicit neural networks
JM
JMLR
173d ago · 20 items
Transformers Can Overcome the Curse of Dimensionality: A Theoretical Study from an Approximation Perspective 173d ago Online Bernstein-von Mises theorem 173d ago Covariate-dependent Hierarchical Dirichlet Processes 173d ago DCatalyst: A Unified Accelerated Framework for Decentralized Optimization 173d ago Boosted Control Functions: Distribution Generalization and Invariance in Confounded Models 173d ago Contrasting Local and Global Modeling with Machine Learning and Satellite Data: A Case Study Estimating Tree Canopy Height in African Savannas 173d ago A Symplectic Analysis of Alternating Mirror Descent 173d ago Two-way Node Popularity Model for Directed and Bipartite Networks 173d ago Convergence and complexity of block majorization-minimization for constrained block-Riemannian optimization 173d ago Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood 173d ago
20 loaded
LL
Lil'Log
418d ago · 20 items
Why We Think 418d ago Special thanks to John Schulman for a lot of super valuable feedback and direct edits on this post. Test time compute (Graves et al. 2016, Ling, et al. 2017, Cobbe et al. 2021) and Chain-of-thought (CoT) (Wei et al. 2022, Nye et al. 2021), ... Reward Hacking in Reinforcement Learning 572d ago Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing the intended task. Reward hacking exists because RL enviro... Extrinsic Hallucinations in LLMs 716d ago Hallucination in large language models usually refers to the model generating unfaithful, fabricated, inconsistent, or nonsensical content. As a term, hallucination has been somewhat generalized to cases when the model makes mistakes. Here,... Diffusion Models for Video Generation 802d ago Diffusion models have demonstrated strong results on image synthesis in past years. Now the research community has started working on a harder task—using it for video generation. The task itself is a superset of the image case, since an ima... Thinking about High-Quality Human Data 869d ago [Special thank you to Ian Kivlichan for many useful pointers (E.g. the 100+ year old Nature paper “Vox populi”) and nice feedback. 🙏 ] High-quality data is the fuel for modern data deep learning model training. Most of the task-specific lab... Adversarial Attacks on LLMs 972d ago The use of large language models in the real world has strongly accelerated by the launch of ChatGPT. We (including my team at OpenAI, shoutout to them) have invested a lot of effort to build default safe behavior into the model during the ... LLM Powered Autonomous Agents 1096d ago Building agents with LLM (large language model) as its core controller is a cool concept. Several proof-of-concepts demos, such as AutoGPT, GPT-Engineer and BabyAGI, serve as inspiring examples. The potentiality of LLM extends beyond genera... Prompt Engineering 1196d ago Prompt Engineering, also known as In-Context Prompting, refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without updating the model weights. It is an empirical science and the effect of prompt eng... The Transformer Family Version 2.0 1243d ago Many new Transformer architecture improvements have been proposed since my last post on “The Transformer Family” about three years ago. Here I did a big refactoring and enrichment of that 2020 post — restructure the hierarchy of sections an... Large Transformer Model Inference Optimization 1259d ago [Updated on 2023-01-24: add a small section on Distillation.] Large transformer models are mainstream nowadays, creating SoTA results for a variety of tasks. They are powerful but very expensive to train and use. The extremely high inferenc...
20 loaded
DA
Datumbox
422d ago · 20 items
20 loaded
JA
Jay Alammar
454d ago · 10 items
Moving To Substack 454d ago I’m freezing this blog and starting to post on my Substack instead. The authoring experience is much more convenient for me there. Please follow me there, and check out The Illustrated DeepSeek R-1 if you haven’t yet. And check out our How ... Generative AI and AI Product Moats 1141d ago Here are eight observations I’ve shared recently on the Cohere blog and videos that go over them.: Article: What’s the big deal with Generative AI? Is it the future or the present? Article: AI is Eating The World Remaking Old Computer Graphics With AI Image Generation 1269d ago Can AI Image generation tools make re-imagined, higher-resolution versions of old video game graphics? Over the last few days, I used AI image generation to reproduce one of my childhood nightmares. I wrestled with Stable Diffusion, Dall-E ... The Illustrated Stable Diffusion 1358d ago Translations: Chinese, Vietnamese. (V2 Nov 2022: Updated images for more precise description of forward diffusion. A few more images in this version) AI image generation is the most recent AI capability blowing people’s minds (mine included... Applying massive language models in the real world with Cohere 1569d ago A little less than a year ago, I joined the awesome Cohere team. The company trains massive language models (both GPT-like and BERT-like) and offers them as an API (which also supports finetuning). Its founders include Google Brain alums in... The Illustrated Retrieval Transformer 1632d ago Discussion: Discussion Thread for comments, corrections, or any feedback. Translations: Korean, Russian Summary: The latest batch of language models can be much smaller yet achieve GPT-3 like performance by being able to query a database or... Explainable AI Cheat Sheet 1876d ago Introducing the Explainable AI Cheat Sheet, your high-level guide to the set of tools and methods that helps humans understand AI/ML models and their predictions. I introduce the cheat sheet in this brief video: Finding the Words to Say: Hidden State Visualizations for Language Models 1981d ago By visualizing the hidden state between a model's layers, we can get some clues as to the model's Interfaces for Explaining Transformer Language Models 2014d ago Interfaces for exploring transformer language models by looking at input saliency and neuron activation. Explorable #1: Input saliency of a list of countries generated by a language model Tap or hover over the output tokens: Explorable #2: ... How GPT3 Works - Visualizations and Animations 2157d ago Discussions: Hacker News (397 points, 97 comments), Reddit r/MachineLearning (247 points, 27 comments) Translations: German, Korean, Chinese (Simplified), Russian, Turkish The tech world is abuzz with GPT3 hype. Massive language models (lik...
AK
Andrej Karpathy
480d ago · 15 items
15 loaded
CH
Chip Huyen
523d ago · 10 items

No matching sources found.