Loading…

AI & Machine Learning News Hub

Research, releases, and applied work in AI & ML

Latest
Artificial Intelligence (AI)Feels like AI is entering its “infrastructure matters” phaseSelf-Hosted Alternatives to Popular ServicesNew Project Megathread - Week of 07 May 2026DEV CommunityThe 800ms Barrier: Architecting Interruptible Voice Agents (Lessons from Sarvam AI x Swiggy)DEV CommunityNFS vs Parallel File Systems in HPC: How to Choose the Right Storage ArchitectureDEV CommunityI built an Open-Source Flight/Travel Booking Template using React & FirebaseLearn Machine LearningHola buenos días necesito ayuda para publicar en arXiv mi papers que no me deja publicar porque no tengo a quien me avale... GraciasDEV CommunityUseful Linux Commands Every System Administrator Should KnowDEV Community[Boost]DEV CommunityBitlocker Bypass, AI Trust Exploits, and FreeBSD RCE DisclosuresDEV CommunityLocal LLM-Python Code Integration, Data Agent Gaps, & Multi-AI Creative WorkflowsFeed: Artificial Intelligence LatestTrump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus ExplainedDEV CommunitySQLite Internals & Audit Patterns; New Open-Source PostgreSQL UILearn Machine LearningHola buenos días necesito ayuda para publicar en arXiv mi papers que no me deja publicar porque no tengo a quien me avale... GraciasDEV CommunityAMD MI350P, CUDA WarpReduction, & Adrenalin 26.5.1 Driver UpdatesDEV CommunityClaude API Rate Limits Boost, AI Pinball Dev Workflow, Meta's ProgramBench for Code GenDigital JournalAmazonian countries approve action plan to conserve migratory catfishDEV CommunityHow tips with resume and salary negotiation: Lessons LearnedDEV CommunityWordPress AI chat plugins make 6–11 outbound requests per visitor question. Architecture writeup of an alternative.Machine LearningQuantization and Fast Inference (MEAP) - How much performance are you actually getting from quantization in production? [D]Reinforcement Learningtrain a mobile robot with RLArtificial Intelligence (AI)Feels like AI is entering its “infrastructure matters” phaseSelf-Hosted Alternatives to Popular ServicesNew Project Megathread - Week of 07 May 2026DEV CommunityThe 800ms Barrier: Architecting Interruptible Voice Agents (Lessons from Sarvam AI x Swiggy)DEV CommunityNFS vs Parallel File Systems in HPC: How to Choose the Right Storage ArchitectureDEV CommunityI built an Open-Source Flight/Travel Booking Template using React & FirebaseLearn Machine LearningHola buenos días necesito ayuda para publicar en arXiv mi papers que no me deja publicar porque no tengo a quien me avale... GraciasDEV CommunityUseful Linux Commands Every System Administrator Should KnowDEV Community[Boost]DEV CommunityBitlocker Bypass, AI Trust Exploits, and FreeBSD RCE DisclosuresDEV CommunityLocal LLM-Python Code Integration, Data Agent Gaps, & Multi-AI Creative WorkflowsFeed: Artificial Intelligence LatestTrump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus ExplainedDEV CommunitySQLite Internals & Audit Patterns; New Open-Source PostgreSQL UILearn Machine LearningHola buenos días necesito ayuda para publicar en arXiv mi papers que no me deja publicar porque no tengo a quien me avale... GraciasDEV CommunityAMD MI350P, CUDA WarpReduction, & Adrenalin 26.5.1 Driver UpdatesDEV CommunityClaude API Rate Limits Boost, AI Pinball Dev Workflow, Meta's ProgramBench for Code GenDigital JournalAmazonian countries approve action plan to conserve migratory catfishDEV CommunityHow tips with resume and salary negotiation: Lessons LearnedDEV CommunityWordPress AI chat plugins make 6–11 outbound requests per visitor question. Architecture writeup of an alternative.Machine LearningQuantization and Fast Inference (MEAP) - How much performance are you actually getting from quantization in production? [D]Reinforcement Learningtrain a mobile robot with RL

By Source

Feeds organized so you can skim by site.

Density Sort
AI
Artificial Intelligence (AI)
1h ago · 20 items
Feels like AI is entering its “infrastructure matters” phase 1h ago We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.” 1h ago eTPS Site Plan – Simple Leaderboard + What You’ll Actually See 1h ago English Centric AI Is Merging Unrelated Communities and Distorting Identities 1h ago Most “agentic AI” conversations feel too abstract. Here is how my agentic research system looks like 1h ago AI is helpful but still not “there” yet 1h ago South Korea names first humanoid robot monk as it accepted the faith's vows 1h ago Coinbase Cuts 700 Jobs and CEO Warns Every Company Will Do the Same 1h ago Robert Evans on AI psychosis 2h ago Early attempt at tracking agent work across the economy 2h ago
20 loaded
20 loaded
DC
DEV Community
1h ago · 12 items
The 800ms Barrier: Architecting Interruptible Voice Agents (Lessons from Sarvam AI x Swiggy) 1h ago The 800ms Barrier: Architecting Interruptible Voice Agents (Lessons from Sarvam AI x Swiggy) The... Tagged with agents, automation, ai, infrastructure. NFS vs Parallel File Systems in HPC: How to Choose the Right Storage Architecture 1h ago When building or expanding an HPC cluster, one of the biggest architectural decisions is storage... Tagged with ai, hpc, filesystems, networking. I built an Open-Source Flight/Travel Booking Template using React & Firebase 1h ago Hi everyone, I've been working on a project called AeroBooking. It's a complete booking system... Tagged with react, webdev, opensource, javascript. Useful Linux Commands Every System Administrator Should Know 1h ago Useful Linux Commands Every System Administrator Should Know Linux system administration... Tagged with linux, devops, security, bash. [Boost] 1h ago Harness Engineering is REAL Engineering ... Tagged with automation, cicd, devops, softwareengineering. Bitlocker Bypass, AI Trust Exploits, and FreeBSD RCE Disclosures 1h ago Bitlocker Bypass, AI Trust Exploits, and FreeBSD RCE Disclosures Today's... Tagged with security, cybersecurity, vulnerability. Local LLM-Python Code Integration, Data Agent Gaps, & Multi-AI Creative Workflows 1h ago Local LLM-Python Code Integration, Data Agent Gaps, & Multi-AI Creative Workflows ... Tagged with ai, rag, automation. SQLite Internals & Audit Patterns; New Open-Source PostgreSQL UI 1h ago SQLite Internals & Audit Patterns; New Open-Source PostgreSQL UI Today's... Tagged with database, sql, sqlite. AMD MI350P, CUDA WarpReduction, & Adrenalin 26.5.1 Driver Updates 1h ago AMD MI350P, CUDA WarpReduction, & Adrenalin 26.5.1 Driver Updates Today's... Tagged with gpu, nvidia, hardware. Claude API Rate Limits Boost, AI Pinball Dev Workflow, Meta's ProgramBench for Code Gen 1h ago Claude API Rate Limits Boost, AI Pinball Dev Workflow, Meta's ProgramBench for Code Gen ... Tagged with ai, machinelearning, cloud.
12 loaded
LM
Learn Machine Learning
1h ago · 20 items
Hola buenos días necesito ayuda para publicar en arXiv mi papers que no me deja publicar porque no tengo a quien me avale... Gracias 1h ago Hola buenos días necesito ayuda para publicar en arXiv mi papers que no me deja publicar porque no tengo a quien me avale... Gracias 1h ago Day 7 – buildinpublic: wrote zero lines of code, still moved forward 1h ago Open academic prompt-engineering course — 14 blocks, vendor-agnostic, ES + EN, MIT licensed 1h ago [ Removed by Reddit ] 1h ago help with first neural network (primitive finder) 1h ago Looking for serious members 1h ago How do I teach ai to play games? 1h ago Heart disease classification capstone: feedback on preprocessing, evaluation, and leakage [P] 1h ago Data-Analytics-Essential-Course Completion CISCO 1h ago
20 loaded
FA
DJ
Digital Journal
1h ago · 10 items
Amazonian countries approve action plan to conserve migratory catfish 1h ago Large migratory catfish travel up to 11,000 km round-trip between the Andes and the Atlantic. Many are endangered. WHO warns of more hantavirus cases in ‘limited’ outbreak 1h ago The World Health Organization said Thursday that more hantavirus cases could emerge after the disease killed three passengers from a cruise ship but it IMF warns of ‘inevitable’ AI-powered threats to global financial system 1h ago The International Monetary Fund (IMF) warned on Thursday of the risks to global financial stability posed by cyberattacks powered by a AI. Google faces new UK lawsuit over online display ads 1h ago Google faces a fresh UK lawsuit accusing it of abusing its dominance in online display advertising, the claimants announced Thursday. Trump gives EU until July 4 to ratify deal or face tariff hike 1h ago President Trump said the European Union must ratify its trade deal with the United States by July 4 or face "much higher" tariffs. Home alone: Is remote work a source of social isolation? 1h ago Home workers: Social isolation has increased 18%. Improving immune function: Rapamycin clinical trial launched 1h ago The clinical trial with the drug, rapamycin, is funded with $12 million of donations. US targets Cuban military, mine in new sanctions 1h ago The United States on Thursday imposed sanctions on a Cuban military conglomerate that controls nearly 40 percent of the island's economy, as well as a Trump says he would not pay $1,000 to watch US at World Cup 1h ago Trump appeared concerned that lower-income Americans -- a key voting bloc for him -- would be priced out of attending the World Cup. Twin jihadist-claimed attacks kill more than 30 in Mali 1h ago Two attacks in central Mali claimed by Al-Qaeda-linked jihadists have killed more than 30 people, local, security and administrative sources told AFP on
ML
Machine Learning
1h ago · 20 items
Quantization and Fast Inference (MEAP) - How much performance are you actually getting from quantization in production? [D] 1h ago PyTorch reproduction of TensorFlow paper underperforms by 4 pp on DermaMNIST , what cross-framework issues should I check? [R] 1h ago I trained a NER model on 33,000 Indian Supreme Court judgments (1950–2024) CASE_CITATION hits 97.76% F1, +17 points over the only prior baseline [P] 1h ago Diffusion for generating/editing ASTs? [D] 1h ago Using Jensen-Shannon Divergence to detect narrative regime shifts in daily news corpora [P] 1h ago Heart disease classification capstone: feedback on preprocessing, evaluation, and leakage [P] 1h ago ECCV reviewer wants me to compare and contrast to my own paper. [D] 2h ago ROCm Status in mid 2026 [D] 3h ago Transformer Math Explorer [P] 5h ago How much can a video generated by the same diffusion model differ across GPU architectures if the initial noise latent is fixed? [D] 5h ago
20 loaded
RL
Reinforcement Learning
1h ago · 20 items
20 loaded
NT
NVIDIA Technical Blog
1h ago · 20 items
Achieving Peak System and Workload Efficiency on NVIDIA GB200 NVL72 with Slurm Block Scheduling 1h ago NVIDIA GB200 NVL72 introduces a fundamentally new way to build GPU clusters by extending NVIDIA NVLink coherence across an entire rack. Model Quantization: Post-Training Quantization Using NVIDIA Model Optimizer 1h ago Model quantization is an effective method to reduce VRAM usage and improve inference performance on consumer devices such as NVIDIA GeForce RTX GPUs. Real-Time Performance Monitoring and Faster Debugging with NCCL Inspector and Prometheus 2h ago Distributed deep learning depends on fast, reliable GPU-to-GPU communication using the NVIDIA Collective Communication Library (NCCL). When training slows down… How to Build In-Vehicle AI Agents with NVIDIA: From Cloud to Car 2d ago The automotive cockpit is undergoing a fundamental shift from rule-based interfaces to agentic, multimodal AI systems capable of reasoning, planning, and acting. Building for the Rising Complexity of Agentic Systems with Extreme Co-Design 2d ago Generative AI’s explosive first chapter was defined by humans sending requests and models responding. The agentic chapter is different. Agents don’t follow a… Optimize Supply Chain Decision Systems Using NVIDIA cuOpt Agent Skills 2d ago Modern supply chains operate under the constant pressures of fluctuating demand, volatile costs, constrained capacity, and interdependent decision-making. Speed Up Unreal Engine NNE Inference with NVIDIA TensorRT for RTX Runtime 7d ago Neural network techniques are increasingly used in computer graphics to boost image quality, improve performance, and streamline content creation. Build AI-Powered Games with NVIDIA DLSS 4.5, RTX, and Unreal Engine 5 7d ago Today, game developers can begin integrating NVIDIA DLSS 4.5 with Dynamic Multi Frame Generation, Multi Frame Generation 6X, and the second-generation… How to Build, Run, and Scale High-Quality Creator Workflows in ComfyUI 7d ago Creative and visualization teams today produce more assets, in more formats, with leaner teams. Generative AI can accelerate that work – compressing tasks that… Automating GPU Kernel Translation with AI Agents: cuTile Python to cuTile.jl 7d ago NVIDIA CUDA Tile (cuTile) is a tile-based programming model that enables developers to write GPU kernels in terms of tile-level operations—loads, stores…
20 loaded
DS
Data Science
1h ago · 20 items
FIFA World Cup 2026 Airbnb pricing data from 16 host cities 1h ago Small a/b test puzzle that broke my brain 1h ago Job search was massively easier than just a year ago 1h ago Interviewing with hedge funds has been the worst experience of my career 1h ago Data Hiring Is Getting Longer in 2026: 24.9 Interview Hours Per Hire 1d ago FAANG interview invitation for MLE but I am a Data Scientist, should I decline? 1d ago Interview Experience: Big teams look for potential, smaller teams look for how fast you can instantly come add value 2d ago Make Technical Documentation Available for Local AI Use 2d ago Built a web app to suggest better options than pie charts, what other dataviz rules should I build in? 2d ago Radar engineer upskill 2d ago
20 loaded
ML
Machine Learning Questions
1h ago · 20 items
Do you understand flow maps? What do you think about this paper? How do flow map work? 1h ago What makes something conscious? 1h ago French group study to learn AI engineering from scratch 1h ago Starting CSE in college soon. Interested in deep math, ML theory, transformers, and building ML algorithms from scratch — not much into generic web dev. I want to aim for roles like Research Engineer or ML Systems Engineer. What roadmap, skills, and projects should I focus on during college? 1h ago Deterministic reliability stack for structured LLM pipelines 2h ago Help a brother out 10h ago Need quick opinion on my model results: overfitting or still acceptable? 11h ago Architecture for extremely small dataset 11h ago genuinely want to learn AI/ML as a beginner, can anyone share what actually worked for them? (no sponsored stuff please) 14h ago Dataset of over 150k but not sure how to fully scale my ML 16h ago
20 loaded
NB
NVIDIA Blog
1h ago · 18 items
Powering the Next American Century: US Energy Secretary Chris Wright and NVIDIA’s Ian Buck on the Genesis Mission 1h ago AI will help build the energy it needs. That’s the case U.S. Energy Secretary Chris Wright and NVIDIA Vice President of Hyperscale and High-Performance Computing Ian Buck made Thursday morning at the SCSP AI+ Expo. The 30-minute fireside ch... Linked and Loaded: Gaijin Single Sign-On Now Available on GeForce NOW 5h ago Less typing, more tanking. Faster logins mean more time in the gaming action — and this week provides GeForce NOW members with a smoother path straight into the battlefield. Cloud gaming is all about instant access to titles across devices,... NVIDIA Spectrum-X — the Open, AI-Native Ethernet Fabric — Sets the Standard for Gigascale AI, Now With MRC 1d ago Multipath Reliable Connection — a new transport protocol proven first and optimized on NVIDIA Spectrum-X Ethernet hardware — is now open to the industry. NVIDIA and ServiceNow Partner on New Autonomous AI Agents for Enterprises 2d ago At ServiceNow Knowledge 2026, the companies are extending their collaboration to deliver governed autonomous agents to enterprises, from employee desktops to AI factories. Nemotron Labs: What OpenClaw Agents Mean for Every Organization 6d ago See how OpenClaw and NVIDIA NemoClaw help enterprises safely deploy long‑running autonomous AI agents with full governance. It’s Gonna Be May: 16 Games Hit the Cloud This Month, With More NVIDIA GeForce RTX 5080 Power 7d ago RTX 5080 power expands across nearly all the library, in time for launches 'Forza Horizon 6' & '007 First Light' and 16 new titles in May. NVIDIA Launches Nemotron 3 Nano Omni Model, Unifying Vision, Audio and Language for up to 9x More Efficient AI Agents 9d ago Best-in-class open omni-modal reasoning model delivers the highest efficiency and accuracy to power agentic workflows such as computer use, document intelligence and audio-video reasoning. Into the Omniverse: Manufacturing’s Simulation-First Era Has Arrived 9d ago Manufacturing’s traditional design-build-test cycle rested on a single assumption: Real-world testing was the only reliable test environment. OpenAI’s New GPT-5.5 Powers Codex on NVIDIA Infrastructure — and NVIDIA Is Already Putting It to Work 13d ago Over 10,000 NVIDIANs across functions got early access to OpenAI’s latest frontier model, GPT-5.5. The results, one engineer said, are “blowing my mind.” Tag, You’re It: GeForce NOW Levels Up Game Discovery With Xbox Game Pass and Ubisoft+ Labels 14d ago New in-app labels and a members’ reward tops off the week of six new games in the cloud, including ‘Vampire Crawlers: The Turbo Wildcard.’
18 loaded
BI
BigDATAwire
1h ago · 10 items
JD
John D. Cook
1h ago · 20 items
20 loaded
Mac Studio, Mac mini Buyers Are Losing Options Amid AI Demand 1h ago Alphabet Poised to Overtake Nvidia as the World’s Most Valuable Public Company 1h ago Elon Musk’s Texas Chip Plant Could Cost $119B, Filings Show 1h ago Android 17: Everything We Know About Google’s Biggest Year Yet 4h ago Anthropic, SpaceX Deal Boosts Claude Compute and Points to Space-Based AI 22h ago OpenAI’s Rumored ‘AI Agent Phone’ Could Arrive Sooner Than Expected 22h ago Cisco: AI Growth Is Turning Wi-Fi Into Enterprise Infrastructure 23h ago More Tech Layoffs: Coinbase Cuts 14% of Workforce in AI Push 1d ago AI Chatbot Cheat Sheet: Comparing ChatGPT, Gemini, Copilot, and More 1d ago Indirect Prompt Injection Is Now a Real-World AI Security Threat 3d ago AI agents are now being weaponized through prompt injection, exposing why model guardrails are not enough to protect enterprise data.
20 loaded
DL
Deep Learning
1h ago · 20 items
Researching on the topic of converting CT scans to MRI scans using Deep learning 1h ago Researching on the topic of converting CT scans to MRI scans using Deep learning 1h ago The LLM context problem in 2026: strategies for memory, relevance, and scale 2h ago Qwen3.6 35B A3B Heretic (KLD 0.0015!) Incredible model. Best 35B I have found! 2h ago Building a neural network for chess 3h ago What to do in These CASES! 4h ago Arc Prize just updated ARC-AGI-3 specifically to accommodate the Seed IQ model that unofficially scores 100%. 7h ago Looking for accountability partners for AI Engineering bootcamps 8h ago A Theory of Deep Learning 9h ago [P] I trained an agent to play a segment of Resident Evil Requiem using a BC → HG-DAgger pipeline. 13h ago
20 loaded
TD
Towards Data Science
1h ago · 20 items
20 loaded
AI
Artificial Intelligence
2h ago · 20 items
Secure short-term GPU capacity for ML workloads with EC2 Capacity Blocks for ML and SageMaker training plans 2h ago In this post, you will learn how to secure reserved GPU capacity for short-term workloads using Amazon Elastic Compute Cloud (Amazon EC2) Capacity Blocks for ML and Amazon SageMaker training plans. These solutions can address GPU availabili... Overcoming reward signal challenges: Verifiable rewards-based reinforcement learning with GRPO on SageMaker AI 2h ago In this post, you will learn how to implement reinforcement learning with verifiable rewards (RLVR) to introduce verification and transparency into reward signals to improve training performance. This approach works best when outputs can be... Agents that transact: Introducing Amazon Bedrock AgentCore payments, built with Coinbase and Stripe 5h ago Today, we're announcing a preview of Amazon Bedrock AgentCore Payments, a new set of features in Amazon Bedrock AgentCore that enables AI agents to instantly access and pay for what they use. AgentCore Payments was developed in partnership ... Cost effective deployment of vision-language models for pet behavior detection on AWS Inferentia2 1d ago Tomofun, the Taiwan-headquartered pet-tech startup behind the Furbo Pet Camera, is redefining how pet owners interact with their pets remotely. To reduce costs and maintain accuracy, Tomofun turned to EC2 Inf2 instances powered by AWS Infer... How Hapag-Lloyd uses Amazon Bedrock to transform customer feedback into actionable insights 2d ago Hapag-Lloyd's Digital Customer Experience and Engineering team, distributed between Hamburg and Gdańsk, drives digital innovation by developing and maintaining customer-facing web and mobile products. In this post, we walk you through our g... Streamlining generative AI development with MLflow v3.10 on Amazon SageMaker AI 2d ago Today, we’re excited to announce that Amazon SageMaker AI MLflow Apps now support MLflow version 3.10, bringing enhanced capabilities for generative AI development and streamlined experiment tracking to your generative AI workflows. Buildin... Introducing OS Level Actions in Amazon Bedrock AgentCore Browser 2d ago We’re announcing OS Level Actions for AgentCore Browser. This new capability unblocks these scenarios by exposing direct OS control through the InvokeBrowser API, so agents can interact with content visible on the screen, not only what's ac... Secure AI agents with Amazon Bedrock AgentCore Identity on Amazon ECS 2d ago AI agents in production require secure access to external services. Amazon Bedrock AgentCore Identity, available as a standalone service, secures how your AI agents access external services whether they run on compute platforms like Amazon ... Intelligence-driven message defense and insights using Amazon Bedrock 2d ago In this post, you will learn how you can use Amazon Nova Foundation Models in Amazon Bedrock to apply generative AI techniques for both business protection and enhancement. You can identify obvious and disguised attempts at direct contact w... Beyond BI: How the Dataset Q&A feature of Amazon Quick powers the next generation of data decisions 3d ago Business leaders across industries rely on operational dashboards as the shared source of truth that their teams execute against daily. But dashboards are built to answer known questions. When teams need to explore further, ad-hoc, multi-di...
20 loaded
IA
Interconnects AI
2h ago · 20 items
20 loaded
CN
Crunchbase News
4h ago · 10 items
SB
SAS Blogs
10h ago · 10 items
Fraud at scale: Trends the public sector cannot ignore 10h ago The fraudsters targeting government programs are becoming more sophisticated. Find out what key trends are shaping public sector fraud. From predicting risk to changing behavior: Rethinking medication adherence with agentic AI 1d ago Move from predicting medication risk to acting on it with trusted models, decisioning and agentic AI at scale Working with Microsoft 365 Sensitivity Labels from SAS programs 2d ago How to detect and respect Sensitivity Labels when working with SAS and Microsoft 365 content. Your AI agent found the issue. Now what? 2d ago AI agents find issues quickly, but real value comes when systems carry context forward and enable immediate action. How to develop accurate demand forecasts in a volatile market 3d ago Improve demand forecasting in volatile markets by combining real-world signals, transparent models and business input decision makers trust. On standardizing multivariate normal probabilities 3d ago A course in elementary statistics always introduces the "Z-score. Less spiraling, more possibility: Mel Robbins at SAS Innovate 2026 6d ago Mel Robbins offers simple but ambitious ideas to prioritize meaningful change at work and in life at SAS Innovate. Celebrating 50 years: SAS’ user conference in photos 6d ago Five decades later, take a look at these photos to see how we see our past has shaped SAS Innovate and everything in between Fraud prevention at scale: Why decisioning is the missing link 6d ago Improve fraud prevention with time AI decisioning that balances risk, speed, trust and compliance across every interaction. Dude Perfect turns SAS Innovate 2026 into a bangin’, world-record moment 6d ago Dude Perfect helps turn SAS Innovate into a live game show, blending chaos, connection and unforgettable shared moments
HU
ΑΙhub
10h ago · 20 items
Making AI systems more transparent and trustworthy: an interview with Ximing Wen 10h ago Report on foundation model impacts released 1d ago Forthcoming machine learning and AI seminars: May 2026 edition 2d ago AI for Science – from cosmology to chemistry 6d ago AIhub monthly digest: April 2026 – machine learning for particle physics, AI Index Report, and table tennis 7d ago The Machine Ethics podcast: organoid computing with Dr Ewelina Kurtys 8d ago #AAAI2026 invited talk: Yolanda Gil on improving workflows with AI 9d ago Maryna Viazovska’s proofs of sphere packing formalized with AI 10d ago Interview with Deepika Vemuri: interpretability and concept-based learning 13d ago As a ‘book scientist’ I work with microscopes, imaging technologies and AI to preserve ancient texts 14d ago
20 loaded
LS
Latent.Space
12h ago · 20 items
[AINews] Anthropic-SpaceXai's 300MW/$5B/yr deal for Colossus I, ARR growth is 8000% annualized 12h ago [AINews] Silicon Valley gets Serious about Services 1d ago 🔬Doing Vibe Physics — Alex Lupsasca, OpenAI 1d ago [AINews] The Other vs The Utility 2d ago [AINews] AI Engineer World's Fair — Autoresearch, Memory, World Models, Tokenmaxxing, Agentic Commerce, and Vertical AI Call for Speakers 5d ago [AINews] Agents for Everything Else: Codex for Knowledge Work, Claude for Creative Work 6d ago [AINews] The Inference Inflection 7d ago [AINews] not much happened today 8d ago [AINews] ImageGen is on the Path to AGI 9d ago Physical AI that Moves the World — Qasar Younis & Peter Ludwig, Applied Intuition 9d ago
20 loaded
MN
Study: Firms often use automation to control certain workers’ wages 14h ago A new study shows that rather than use automation to pursue maximal efficiency, U.S. firms have often used it to replace employees who enjoy a “wage premium,” earning higher salaries than other comparable workers. Games people — and machines — play: Untangling strategic reasoning to advance AI 1d ago MIT Assistant Professor Gabriele Farina explores his approach to untangling strategic reasoning to advance AI. Beacon Biosignals is mapping the brain during sleep 6d ago Beacon Biosignals is creating a model to help diagnose and treat brain disorders, based on data collected while people sleep at home. The firm was founded by MIT alumnus Jake Donoghue and former MIT researcher Jarrett Revels. Improving understanding with language 6d ago MIT senior Olivia Honeycutt studies the brain and linguistics to explore how cognition, education, language, language learning, education, policy, and the ways we communicate can shape our views of the world. Making the case for curiosity-driven science 7d ago President Sally Kornbluth spoke in front of a packed crowd about growing challenges to the U.S. research ecosystem as funding for America’s top research universities becomes increasingly strained. Solving the “Whac-a-mole dilemma”: A smarter way to debias AI vision models 7d ago A new debiasing approach called WRING resolves the "Whac-a-Mole dilemma" of existing debiasing approaches that can create or amplify existing biases. The MIT-IBM Computing Research Lab launches to shape the future of AI and quantum computing 8d ago IBM and MIT announced the launch of the MIT-IBM Computing Research Lab, advancing their long-standing collaboration to shape the next era of computing that combines AI, algorithms, and quantum computing. The new lab evolved from the MIT-IBM... Enabling privacy-preserving AI training on everyday devices 8d ago MIT researchers developed a technique that accelerates a privacy-preserving approach for training AI models on edge devices. Their new framework could enable more accurate, efficient, and secure AI models to be used in under-resourced setti... A faster way to estimate AI power consumption 10d ago The EnergAIzer technique can predict how much power a certain AI workload will consume when run on a particular processor. This method could help data center operators and algorithm developers improve the sustainability of AI workloads. MIT scientists build the world’s largest collection of Olympiad-level math problems, and open it to everyone 13d ago MIT CSAIL scientists have compiled the largest high-quality dataset of proof-based math problems ever created. It can help researchers test AI models’ mathematical reasoning, while capturing the full range of mathematical perspectives and p...
20 loaded
CL
cs.LG updates on arXiv.org
14h ago · 20 items
Endogenous Regime Switching Driven by Scalar-Irreducible Learning Dynamics 14h ago Abstract page for arXiv paper 2605.04054: Endogenous Regime Switching Driven by Scalar-Irreducible Learning Dynamics A Self-Attentive Meta-Optimizer with Group-Adaptive Learning Rates and Weight Decay 14h ago Abstract page for arXiv paper 2605.04055: A Self-Attentive Meta-Optimizer with Group-Adaptive Learning Rates and Weight Decay Transformation Categorization Based on Group Decomposition Theory Using Parameter Division 14h ago Abstract page for arXiv paper 2605.04056: Transformation Categorization Based on Group Decomposition Theory Using Parameter Division Structured Progressive Knowledge Activation for LLM-Driven Neural Architecture Search 14h ago Abstract page for arXiv paper 2605.04057: Structured Progressive Knowledge Activation for LLM-Driven Neural Architecture Search MP-ISMoE: Mixed-Precision Interactive Side Mixture-of-Experts for Efficient Transfer Learning 14h ago Abstract page for arXiv paper 2605.04058: MP-ISMoE: Mixed-Precision Interactive Side Mixture-of-Experts for Efficient Transfer Learning Continual Distillation of Teachers from Different Domains 14h ago Abstract page for arXiv paper 2605.04059: Continual Distillation of Teachers from Different Domains Lookahead Drifting Model 14h ago Abstract page for arXiv paper 2605.04060: Lookahead Drifting Model Single-Position Intervention Fails: Distributed Output Templates Drive In-Context Learning 14h ago Abstract page for arXiv paper 2605.04061: Single-Position Intervention Fails: Distributed Output Templates Drive In-Context Learning EdgeRazor: A Lightweight Framework for Large Language Models via Mixed-Precision Quantization-Aware Distillation 14h ago Abstract page for arXiv paper 2605.04062: EdgeRazor: A Lightweight Framework for Large Language Models via Mixed-Precision Quantization-Aware Distillation Investigating Trustworthiness of Nonparametric Deep Survival Models for Alzheimer's Disease Progression Analysis 14h ago Abstract page for arXiv paper 2605.04063: Investigating Trustworthiness of Nonparametric Deep Survival Models for Alzheimer's Disease Progression Analysis
20 loaded
SM
stat.ML updates on arXiv.org
14h ago · 20 items
A Consistency-Centric Approach to Set-Based Optimization with Multiple Models of Unranked Fidelity 14h ago Abstract page for arXiv paper 2605.04051: A Consistency-Centric Approach to Set-Based Optimization with Multiple Models of Unranked Fidelity Heterogeneous Ordinal Structure Learning with Bayesian Nonparametric Complexity Discovery 14h ago Abstract page for arXiv paper 2605.04191: Heterogeneous Ordinal Structure Learning with Bayesian Nonparametric Complexity Discovery Entropic Riemannian Neural Optimal Transport 14h ago Abstract page for arXiv paper 2605.04255: Entropic Riemannian Neural Optimal Transport Adapt or Forget: Provable Tradeoffs Between Adam and SGD in Nonstationary Optimization 14h ago Abstract page for arXiv paper 2605.04269: Adapt or Forget: Provable Tradeoffs Between Adam and SGD in Nonstationary Optimization Perturbation is All You Need for Extrapolating Language Models 14h ago Abstract page for arXiv paper 2605.04344: Perturbation is All You Need for Extrapolating Language Models Multiscale Euclidean Network Trajectories: Second-Moment Geometry, Attribution, and Change Points 14h ago Abstract page for arXiv paper 2605.04589: Multiscale Euclidean Network Trajectories: Second-Moment Geometry, Attribution, and Change Points Jacobian-Velocity Bounds for Deployment Risk Under Covariate Drift 14h ago Abstract page for arXiv paper 2605.04932: Jacobian-Velocity Bounds for Deployment Risk Under Covariate Drift Scalable inference of spatial regions and temporal signatures from time series 14h ago Abstract page for arXiv paper 2605.05008: Scalable inference of spatial regions and temporal signatures from time series Hypergraph Generation via Structured Stochastic Diffusion 14h ago Abstract page for arXiv paper 2605.05024: Hypergraph Generation via Structured Stochastic Diffusion Proximal Projection for Doubly Sparse Regularized Models 14h ago Abstract page for arXiv paper 2605.05093: Proximal Projection for Doubly Sparse Regularized Models
20 loaded
Classification graphique visuelle pour la sécurité des blockchains : Expériences d'ajustement de Qwen2-VL sur AMD MI300X [D] 16h ago A Transformer playing VS Dave & Bambi 1d ago [ Removed by Reddit ] 4d ago Combining LLM's and Neurosymbolic AI to create NARRATE 4d ago Universe pls connect me to a person intrested in Neurosymbolic AI 6d ago GenAI development challenges in neural network optimization for real apps 7d ago fine-tuning vs general LLM - where does the actual cost justification kick in 8d ago when does it actually make sense to fine-tune an LLM vs just using what's already out there 10d ago Is Leave-One-Object-Out CV valid for pair-based (Siamese-style) models with very few objects? 10d ago Scaled dot product attention, fully annotated with dimensions at every step 12d ago
20 loaded
Artificial intelligence for predicting transient hypocalcemia after total thyroidectomy 18h ago Transient hypocalcemia is a common complication of total thyroidectomy. This study aimed to evaluate whether machine learning (ML)-based models could enhance early risk prediction and support a robust clinical decision support system for hy... Non-invasive profiling of the tumour microenvironment with spatial ecotypes 1d ago Multicellular programs in the tumour microenvironment (TME) drive cancer pathogenesis and response to therapy but remain challenging to identify and profile clinically1–3. Here, we present a machine-learning framework for multi-analyte prof... AI agents in research: when productivity comes at the cost of apprenticeship 2d ago Letter to the Editor Responses to the AI grant flood must prioritize fairness as part of excellence 2d ago Research funding agencies are battling a wave of AI-assisted applications. Countermeasures should not entrench existing power structures. Seesaw signatures capture trajectory-like transcriptomic shifts and enable compact tumour cell classification across cancers 2d ago Accurate identification of tumor cells remains a major challenge in single-cell cancer research because malignant and normal cells often differ only subtly and vary greatly across datasets. Here we show that Seesaw pairs, defined by consist... $${\bf{Micro}}{{\mathbb{S}}}{\bf{plit}}$$ Micro S plit : semantic unmixing of fluorescent microscopy data 2d ago Fluorescence microscopy is constrained by optical limits, fluorophore chemistry and finite photon budgets, imposing trade-offs between imaging speed, resolution and phototoxicity. Here we introduce $${\rm{Micro}}{\mathbb{S}}{\rm{plit}}$$ , ... Inference of latent epidemic regimes and generative simulations reveal how inequality and mobility shape COVID-19 transmission 3d ago Epidemic waves in large metropolitan areas unfold heterogeneously across territories shaped by persistent socioeconomic inequalities. Explaining how transmission intensifies, stabilises, and shifts across the urban landscape remains a centr... These powerful tools reveal the ‘control knobs’ of the genome 3d ago By accelerating the identification of DNA sequences that control gene expression, assays are revealing the hidden grammar of the regulatory genome — and giving scientists the means to rewrite it Hierarchical dynamic model for risk-stratified screening of nasopharyngeal carcinoma 3d ago Early detection of nasopharyngeal carcinoma through Epstein-Barr virus serology is hampered by a low positive predictive value. This study aims to develop a hierarchical dynamic model to refine risk stratification among individuals initiall... Unsupervised transfer learning enables multi-animal tracking without training annotation 3d ago Quantitative ethology necessitates accurate tracking of animal locomotion, especially for population-level analyses involving multiple individuals. However, current methods mostly rely on laborious annotations for supervised training and ha...
20 loaded
AM
Apple Machine Learning Research
18h ago · 10 items
What Matters in Practical Learned Image Compression 18h ago One of the major differentiators unlocked by learned codecs relative to their hard-coded traditional counterparts is their ability to be… Text-Conditional JEPA for Learning Semantically Rich Visual Representations 18h ago Image-based Joint-Embedding Predictive Architecture (I-JEPA) offers a promising approach to visual self-supervised learning through masked… SpecMD: A Comprehensive Study on Speculative Expert Prefetching 1d ago Mixture-of-Experts (MoE) models enable sparse expert activation, meaning that only a subset of the model’s parameters is used during each… Normalizing Flows with Iterative Denoising 1d ago Normalizing Flows (NFs) are a classical family of likelihood-based methods that have received revived attention. Recent efforts such as… From Where Things Are to What They’re For: Benchmarking Spatial–Functional Intelligence for Multimodal LLMs 1d ago True spatial intelligence for multimodal agents transcends low-level geometric perception, evolving from knowing where things are to… Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing 2d ago Serving transformer language models with high throughput requires caching Key-Values (KVs) to avoid redundant computation during… PORTool: Importance-Aware Policy Optimization with Rewarded Tree for Multi-Tool-Integrated Reasoning 3d ago Multi-tool-integrated reasoning enables LLM-empowered tool-use agents to solve complex tasks by interleaving natural-language reasoning with… Reinforced Agent: Inference-Time Feedback for Tool-Calling Agents 6d ago This paper was accepted at the Fifth Workshop on Natural Language Generation, Evaluation, and Metrics at ACL 2026. Tool-calling agents are… International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2026 7d ago Apple is presenting new research at the annual International Conference on Acoustics, Speech and Signal Processing (ICASSP), which takes… STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows 7d ago Normalizing flows (NFs) are end-to-end likelihood-based generative models for continuous data, and have recently regained attention with…
LF
Lex Fridman Podcast
20h ago · 20 items
#496 – FFmpeg: The Incredible Technology Behind Video on the Internet 20h ago Jean-Baptiste Kempf is lead developer of VLC and president of VideoLAN. Kieran Kunhya is a longtime FFmpeg contributor, codec engineer, and the person behind the now-infamous FFmpeg account on X. Thank you for listening ❤ Check out our spon... #495 – Vikings, Ragnar, Berserkers, Valhalla & the Warriors of the Viking Age 28d ago Lars Brownworth is a historian, teacher, podcaster, and author specializing in Viking history, medieval Europe, and the Byzantine Empire. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep495-sc See below f... #494 – Jensen Huang: NVIDIA – The $4 Trillion Company & the AI Revolution 45d ago Jensen Huang is the co-founder and CEO of NVIDIA, the world’s most valuable company and the engine powering the AI computing revolution. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep494-sc See below fo... #493 – Jeff Kaplan: World of Warcraft, Overwatch, Blizzard, and Future of Gaming 56d ago Jeff Kaplan is a legendary Blizzard game designer of World of Warcraft and Overwatch, now preparing to launch a new game, The Legend of California, from his new studio Kintsugiyama – available to wishlist on Steam today, with alpha later in... #492 – Rick Beato: Greatest Guitarists of All Time, History & Future of Music 67d ago Rick Beato is a music educator, interviewer, producer, songwriter, and a true multi-instrument musician, playing guitar, bass, cello & piano. His incredible YouTube channel celebrates great musicians & musical ideas, and helps millions of p... #491 – OpenClaw: The Viral AI Agent that Broke the Internet – Peter Steinberger 84d ago Peter Steinberger is the creator of OpenClaw, an open-source AI agent framework that’s the fastest-growing project in GitHub history. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep491-sc See below for t... #490 – State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI 95d ago Nathan Lambert and Sebastian Raschka are machine learning researchers, engineers, and educators. Nathan is the post-training lead at the Allen Institute for AI (Ai2) and the author of The RLHF Book. Sebastian Raschka is the author of Build ... #489 – Paul Rosolie: Uncontacted Tribes in the Amazon Jungle 113d ago Paul Rosolie is a naturalist, explorer, author of a new book titled Junglekeeper, and is someone who has dedicated his life to protecting the Amazon rainforest. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponso... #488 – Infinity, Paradoxes that Broke Mathematics, Gödel Incompleteness & the Multiverse – Joel David Hamkins 126d ago Joel David Hamkins is a mathematician and philosopher specializing in set theory, the foundations of mathematics, and the nature of infinity, and he’s the #1 highest-rated user on MathOverflow. He is also the author of several books, includ... #487 – Irving Finkel: Deciphering Secrets of Ancient Civilizations & Flood Myths 145d ago Irving Finkel is a scholar of ancient languages and a longtime curator at the British Museum, renowned for his expertise in Mesopotamian history and cuneiform writing. He specializes in reading and interpreting cuneiform inscriptions, inclu...
20 loaded
LF
Lex Fridman
20h ago · 15 items
FFmpeg: The Incredible Technology Behind Video on the Internet | Lex Fridman Podcast #496 20h ago Vikings, Ragnar, Berserkers, Valhalla & the Warriors of the Viking Age | Lex Fridman Podcast #495 28d ago Jensen Huang: NVIDIA - The $4 Trillion Company & the AI Revolution | Lex Fridman Podcast #494 45d ago Jeff Kaplan: World of Warcraft, Overwatch, Blizzard, and Future of Gaming | Lex Fridman Podcast #493 56d ago Lex trains w/ Khabib Nurmagomedov | Exclusive Footage at UFC PI 65d ago Rick Beato: Greatest Guitarists of All Time, History & Future of Music | Lex Fridman Podcast #492 67d ago Khabib vs Lex: Training with Khabib | FULL EXCLUSIVE FOOTAGE 70d ago OpenClaw: The Viral AI Agent that Broke the Internet - Peter Steinberger | Lex Fridman Podcast #491 84d ago State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 95d ago Paul Rosolie: Uncontacted Tribes in the Amazon Jungle | Lex Fridman Podcast #489 113d ago
15 loaded
BS
Blog - Singularity Weblog
23h ago · 20 items
There Is No Formula: Why AI Cannot Solve What Matters Most 23h ago The Great Progression: Peter Leyden on AI, Trump and the Next 25 Years 6d ago The Why Is a Discipline: Why a Good Why Is Also Not Enough 11d ago The AI Paradox: Cure or Poison? 13d ago Steven Kotler on We Are As Gods: Godlike Power, Stone Age Minds 31d ago We Don’t Need More. We Need Better: Intelligence Scales. Wisdom Does Not. 45d ago Dune Is Not What You Think: The Warning Frank Herbert Meant Us to Hear 48d ago The Skills That Will Matter When AI Can Do Almost Everything 56d ago Why I Cancelled ChatGPT and Switched to Claude, And Why You Should Too 65d ago Ada Palmer on Inventing the Renaissance: How Golden and Dark Ages Are Constructed and Why They Matter 80d ago
20 loaded
HF
Hugging Face - Blog
23h ago · 20 items
vLLM V0 to V1: Correctness Before Corrections in RL 23h ago A Blog post by ServiceNow-AI on Hugging Face Adding Benchmaxxer Repellant to the Open ASR Leaderboard 1d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. Granite 4.1 LLMs: How They’re Built 8d ago A Blog post by IBM Granite on Hugging Face DeepInfra on Hugging Face Inference Providers 🔥 8d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents 9d ago A Blog post by NVIDIA on Hugging Face How to build scalable web apps with OpenAI's Privacy Filter 10d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. DeepSeek-V4: a million-token context that agents can actually use 13d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. How to Use Transformers.js in a Chrome Extension 14d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science. QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard 16d ago A Blog post by Technology Innovation Institute on Hugging Face AI and the Future of Cybersecurity: Why Openness Matters 16d ago We’re on a journey to advance and democratize artificial intelligence through open source and open science.
20 loaded
TM
Two Minute Papers
1d ago · 15 items
15 loaded
AI
AI
1d ago · 20 items
5 gardening tips you can try right in Search 1d ago We’ve rounded up the top ways you can use Google’s AI Mode, Search Live and Shopping to help your plants thrive. Google is partnering with XPRIZE and Range Media Partners on the $3.5 million Future Vision film competition. 2d ago Google is partnering with XPRIZE and Range Media Partners on the $3.5 million Future Vision film competition. The latest AI news we announced in April 2026 3d ago Here are Google’s latest AI updates from April 2026 Reduce friction and latency for long-running jobs with Webhooks in Gemini API 3d ago Event-Driven Webhooks are a push-based notification system that eliminates the need for inefficient polling. Celebrating 20 years of Google Translate: Fun facts, tips and new features to try 9d ago Google’s sharing 20 fun facts to celebrate Google Translate turning 20, from its roots as a 2006 AI experiment to supporting almost 250 languages today. Join the new AI Agents Vibe Coding Course from Google and Kaggle 10d ago Google is bringing back its 5-Day AI Agents Intensive Course with Kaggle and registration is open. 8 Gemini tips for organizing your space (and life) 12d ago Organize your home and digital space with Gemini. Use AI-powered tips for cleaning schedules, inbox decluttering, seasonal chores. Here’s how our TPUs power increasingly demanding AI workloads. 14d ago Learn how Google’s TPUs power increasingly demanding AI workloads with this new video. Elevating Austria: Google invests in its first data center in the Alps. 14d ago Google has been a proud part of Austria’s landscape for years, and today, we’re announcing our first data center in Kronstorf, generating 100 direct jobs. This facility … We're launching two specialized TPUs for the agentic era. 15d ago The eighth generation of Google’s TPU includes two specialized chips that will power the future of AI.
20 loaded
RC
RSS.com Podcast Hosting
1d ago · 20 items
How Do RSS Feeds Work? 1d ago Your favorite website or podcast creates an RSS feed that maintains a list of fresh content. Learn how to check this list or use a feed reader to check here The Best Podcast Hosting of 2026 – Compare Pricing & Features 2d ago Compare the top podcast hosting platforms side by side - pricing, storage, distribution, and features. See how they stack up and find the right host for you. How to Start a Podcast in 2026 (Beginner’s Guide) 9d ago Want to start a podcast but not sure how to start? In this beginners guide, you'll get step-by-step instructions on how to start a new show! What Are the Best Podcast Microphones? 12d ago With so many microphones, how do you know how to pick the best mic for your podcast? In this post we review the top podcasting mics! Free Podcast Trainings 14d ago Join RSS.com for free monthly YouTube Live trainings covering everything you need to start, grow, improve, and monetize your podcast. How to Recommend Other Podcasts Using Podroll 14d ago Recommending other podcasts is an effective way to help other shows you run or want to recommend get new listeners. Podcast Awards 21d ago Wondering if there are awards for podcasts? Discover how to submit, get nominated, and find the top podcast awards in 2026. Why Podcast Awards Matter: Grow Your Show and Build Credibility 21d ago Lean how awards can amplify your podcast's reach, attract sponsors, and position your show among the industry's most recognized voices. How to Add Your Podcast RSS Feed to Spotify 23d ago Learn how to add your podcast's RSS feed to Spotify, troubleshoot common errors, and get distributed to every major podcast listening app. Best True Crime Podcasts of 2026 23d ago If you’re interested in forensic files, murder mysteries, and stories of criminal acts, this list is for you! Get the top true crime podcasts!
20 loaded
QM
Quanta Magazine
1d ago · 5 items
AA
AI Accelerator Institute
1d ago · 15 items
Skill drift damaging your efficiency? 1d ago Gain real-time intelligence from OpenAI, Google & more 5 ways to prepare for physical AI, today 1d ago Physical AI is moving from pilot programs to production. Here are five steps engineering teams and tech leaders can take right now to get ahead. What's shaping frontier AI in 2026? Find out in London, May 21st 6d ago The Innodata GenAI Summit brings together 300+ AI leaders across four frontier tracks: world models, autonomous systems, physical AI, and more! The rise of agent experience (AX) 7d ago Explore how Agent Experience (AX) and Agent-Native Indexing (ANI) could reshape commerce, infrastructure, and the future of digital products. How observability keeps AI systems reliable at scale 8d ago Scaling AI infrastructure has never been more challenging. In this live session you'll find out how to turn observability context into action. Is this the rise of the AI scientist? 8d ago how synthetic task scaling and agentic AI systems are enabling autonomous AI agents to run experiments, debug workflows, and improve experience. Are your agents quietly draining your budget? 9d ago Revenium'a new research reveals the financial risks of autonomous AI agents, and a practical framework for governing costs before they spiral. AI Builders Summit: Healthcare Boston 2026 9d ago Build and deploy secure, clinical-grade AI in one of the world’s most complex domains; healthcare. Agentic AI: The pathway architecture to GenAI 13d ago Explore the history and architecture of agentic AI — from Vannevar Bush and Douglas Engelbart to LLMs and RAG. AIAI Summits, Silicon Valley 2026 13d ago Catch up on every session from AIAI Summit Silicon Valley with sessions from all 4 tracks. Chief AI & CISO Summit and Generative & Agentic AI.
15 loaded
GD
Google DeepMind News
1d ago · 20 items
AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields 1d ago Discover how AlphaEvolve optimizes algorithms for genomics, quantum physics, global infrastructure, and more to accelerate scientific progress and solve real-world challenges. Enabling a new model for healthcare with AI co-clinician 7d ago Google DeepMind is researching the path toward an AI co-clinician that could work under physician authority to assist doctors and patients, enabling new models for AI-augmented care. Announcing our partnership with the Republic of Korea 10d ago Google DeepMind partners with Korea's MSIT to establish an AI Campus to help accelerate scientific breakthroughs, support local talent, and advance AI safety research Decoupled DiLoCo: A new frontier for resilient, distributed AI training 15d ago Google’s new distributed architecture keeps AI training runs on track across distant data centers, with exceptional efficiency – even when hardware fails. Partnering with industry leaders to accelerate AI transformation 16d ago Google DeepMind is partnering with leading consultancies to bridge the AI adoption gap and drive agentic transformation with frontier models and expert research. Gemini 3.1 Flash TTS: the next generation of expressive AI speech 22d ago Gemini 3.1 Flash TTS is now available across Google products. Gemini Robotics-ER 1.6: Powering real-world robotics tasks through enhanced embodied reasoning 24d ago Gemini Robotics ER 1.6 upgrades spatial reasoning and multi-view understanding, unlocking new capabilities like instrument reading for autonomous robots. Gemma 4: Byte for byte, the most capable open models 35d ago Gemma 4: our most intelligent open models to date, purpose-built for advanced reasoning and agentic workflows. Gemini 3.1 Flash Live: Making audio AI more natural and reliable 42d ago Gemini 3.1 Flash Live is now available across Google products. Protecting people from harmful manipulation 43d ago Google DeepMind releases new findings and an evaluation framework to measure AI's potential for harmful manipulation in areas like finance and health, with the goal of enhancing AI safety.
20 loaded
DP
Dwarkesh Patel
1d ago · 15 items
15 loaded
RB
R-bloggers
1d ago · 20 items
20 loaded
MN
MIT News - Machine learning
1d ago · 20 items
Games people — and machines — play: Untangling strategic reasoning to advance AI 1d ago MIT Assistant Professor Gabriele Farina explores his approach to untangling strategic reasoning to advance AI. Beacon Biosignals is mapping the brain during sleep 6d ago Beacon Biosignals is creating a model to help diagnose and treat brain disorders, based on data collected while people sleep at home. The firm was founded by MIT alumnus Jake Donoghue and former MIT researcher Jarrett Revels. Solving the “Whac-a-mole dilemma”: A smarter way to debias AI vision models 7d ago A new debiasing approach called WRING resolves the "Whac-a-Mole dilemma" of existing debiasing approaches that can create or amplify existing biases. The MIT-IBM Computing Research Lab launches to shape the future of AI and quantum computing 8d ago IBM and MIT announced the launch of the MIT-IBM Computing Research Lab, advancing their long-standing collaboration to shape the next era of computing that combines AI, algorithms, and quantum computing. The new lab evolved from the MIT-IBM... Enabling privacy-preserving AI training on everyday devices 8d ago MIT researchers developed a technique that accelerates a privacy-preserving approach for training AI models on edge devices. Their new framework could enable more accurate, efficient, and secure AI models to be used in under-resourced setti... A faster way to estimate AI power consumption 10d ago The EnergAIzer technique can predict how much power a certain AI workload will consume when run on a particular processor. This method could help data center operators and algorithm developers improve the sustainability of AI workloads. MIT scientists build the world’s largest collection of Olympiad-level math problems, and open it to everyone 13d ago MIT CSAIL scientists have compiled the largest high-quality dataset of proof-based math problems ever created. It can help researchers test AI models’ mathematical reasoning, while capturing the full range of mathematical perspectives and p... Teaching AI models to say “I’m not sure” 14d ago MIT CSAIL's “Reinforcement Learning with Calibration Rewards” technique improves AI confidence estimates without sacrificing performance, addressing a root cause of hallucination in reasoning models. Jacob Andreas and Brett McGuire named Edgerton Award winners 20d ago MIT associate professors Jacob Andreas and Brett McGuire have been selected as the winners of the 2026 Harold E. Edgerton Faculty Achievement Award for exceptional contributions to teaching, research, and service at MIT. Bringing AI-driven protein-design tools to biologists everywhere 20d ago OpenProtein.AI is helping biologists stay on the cutting edge of AI with a no-code platform for protein engineering. It was founded by MIT alumni Tristan Bepler and Tim Lu.
20 loaded
NS
New Scientist - Technology
2d ago · 20 items
Backlash builds over NHS plan to hide source code from AI hacking risk 2d ago NHS England is pulling its open-source software from the internet because of fears around computer-hacking AI models like Mythos. Opposition is growing among those who say the move is bad for transparency and efficiency, and will also do no... Quantum computers simulated their biggest molecule yet – with help 2d ago Two quantum computers and two supercomputers teamed up to break the record for the biggest molecule yet to be simulated using quantum hardware NHS England rushes to hide software over AI hacking fears 6d ago National Health Service rules state that all software created with public money should be publicly available, but fears of computer-hacking AI models like Mythos have prompted a change in policy Read an extract from Luminous by Silvia Park 6d ago In this extract from Luminous, the May read for the New Scientist Book Club, we meet a mysterious robot discovered in a salvage yard in Seoul, in a future reunified Korea 'Green' cryptocurrency uses 18 times more energy than makers claim 7d ago A cryptocurrency that aims to avoid the disastrous energy consumption of bitcoin is actually using 18 times more energy than its makers claim – but it promises improvements are on the way The chips in your phone are probably broken – and that's a good thing 8d ago Reports suggest that Apple is using defective chips originally destined for high-end devices to create its latest affordable laptop. Reusing partially broken chips is common practice for all device makers and produces less waste Humanoid robots may be about to break the 100-metre sprint record 9d ago Robots can now run a half-marathon faster than humans and are rapidly homing in on the men's 100-metre sprint record. But why are companies so keen to create speedy robots that have no obvious application in homes or factories? Do you need to worry about Mythos, Anthropic's computer-hacking AI? 14d ago A powerful AI kept from public access because of its ability to hack computers with impunity is making headlines around the world. But what is Mythos, does it really represent a risk and might it even be used to improve cybersecurity? Table tennis-playing robot on track to becoming world champion 15d ago A robot built by Sony AI is rapidly learning how to beat the world's very best table tennis players We might finally know how to use quantum computers to boost AI 17d ago Pushing against years of scepticism, an analysis suggests quantum computers may offer real advantages for running machine learning and similar algorithms in the near future
20 loaded
MR
Microsoft Research
2d ago · 10 items
Microsoft at NSDI 2026: Advances in large-scale networked systems 2d ago Microsoft researchers share advances in building and operating large-scale distributed systems, spanning datacenters, networking, and the growing intersection with AI during NSDI ’26. Red-teaming a network of agents: Understanding what breaks when AI agents interact at scale 6d ago Safe agents don’t guarantee a safe ecosystem of interconnected agents. Microsoft Research examines what breaks when AI agents interact and why network-level risks require new approaches. Learn more: AutoAdapt: Automated domain adaptation for large language models 15d ago AutoAdapt automates the design and tuning of domain adaptation workflows for large language models. It improves performance without requiring additional compute, making deployment more accessible: Can we AI our way to a more sustainable world? 17d ago Doug Burger, sustainability expert Amy Luers, and optimization researcher Ishai Menache examine the global emissions implications of datacenter operations, efficiency gains, and AI's potential across electrification, materials, and food sys... New Future of Work: AI is driving rapid change, uneven benefits 28d ago For the past five years, the New Future of Work report has captured how work is changing. This year, the shift feels especially sharp. Previous editions have focused on technology’s role in increasing productivity by automating tasks, accel... Ideas: Steering AI toward the work future we want 28d ago On the Microsoft Research Podcast, Chief Scientist Jaime Teevan & researchers Jenna Butler, Jake Hofman, & Rebecca Janssen unpack the New Future of Work Report 2025 & explore what an ideal AI-driven working world looks like (it’s not just d... ADeLe: Predicting and explaining AI performance across tasks 36d ago AI benchmarks report how large language models (LLMs) perform on specific tasks but provide little insight into their underlying capabilities that drive their performance. They do not explain failures or reliably predict outcomes on new tas... AsgardBench: A benchmark for visually grounded interactive planning 41d ago AsgardBench evaluates whether embodied agents can revise their plans based on visual observations as tasks unfold. By focusing on perception-driven planning, it exposes key limitations and guides improvements in agent reliability. GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation 42d ago Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. Most systems split these decisions into two steps: a VLM generates a plan in natural lan... Will machines ever be intelligent? 45d ago In Episode 1 of “The Shape of Things to Come,” technologists Subutai Ahmad & Nicolò Fusi join Microsoft’s Doug Burger to compare how large language models work with how the human brain learns & what it means for AI’s future.
DT
Deeptech - Tech.eu
2d ago · 15 items
15 loaded
IA
Import AI
3d ago · 20 items
Import AI 455: AI systems are about to start building themselves. 3d ago Import AI 454: Automating alignment research; safety study of a Chinese model; HiFloat4 17d ago Import AI 453: Breaking AI agents; MirrorCode; and ten views on gradual disempowerment 24d ago Import AI 452: Scaling laws for cyberwar; rising tides of AI automation; and a puzzle over gDP forecasting 31d ago Import AI 451: Political superintelligence; Google's society of minds, and a robot drummer 38d ago Import AI 450: China's electronic warfare model; traumatized LLMs; and a scaling law for cyberattacks 45d ago ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text 52d ago Import AI 448: AI R&D; Bytedance's CUDA-writing agent; on-device satellite AI 59d ago Import AI 447: The AGI economy; testing AIs with generated games; and agent ecologies 66d ago Import AI 446: Nuclear LLMs; China's big AI benchmark; measurement and AI policy 73d ago
20 loaded
ML
Machine Learning Street Talk
3d ago · 15 items
15 loaded
RW
Robot Writers AI
3d ago · 10 items
AI Beats Back Bubble Fears 3d ago A new piece in The New York confirms that AI-generated writing -- along with similar AI creation tools -- is now the 'it' app. AI Agents Now Default Interface for Word 10d ago A new piece in The New York confirms that AI-generated writing -- along with similar AI creation tools -- is now the 'it' app. AI Agents 17d ago A new piece in The New York confirms that AI-generated writing -- along with similar AI creation tools -- is now the 'it' app. Geronimo! 24d ago A new piece in The New York confirms that AI-generated writing -- along with similar AI creation tools -- is now the 'it' app. Top Ten Stories in AI Writing, Q1 2026 31d ago A new piece in The New York confirms that AI-generated writing -- along with similar AI creation tools -- is now the 'it' app. ChatGPT’s Next Big Thing 38d ago A new piece in The New York confirms that AI-generated writing -- along with similar AI creation tools -- is now the 'it' app. ChatGPT’s No-Kidding Makeover 45d ago A new piece in The New York confirms that AI-generated writing -- along with similar AI creation tools -- is now the 'it' app. Sorry, No Fleshbags 52d ago A new piece in The New York confirms that AI-generated writing -- along with similar AI creation tools -- is now the 'it' app. ChatGPT Now Clocking 900 Million Weekly Users 59d ago A new piece in The New York confirms that AI-generated writing -- along with similar AI creation tools -- is now the 'it' app. Gone Fishin’ 66d ago RobotWritersAI.com is playing hooky. We'll be back May 5, 2025 with fresh news and analysis on the latest in AI-generated writing.
LD
Linear Digressions
3d ago · 20 items
Lost in the Middle (The Agents Season, Episode 3) 3d ago Just like a memorable talk lives or dies by its opening and closing, LLMs have a surprisingly similar quirk: they pay close attention to what's at the beginning and end of their context window — and kind of zone out in the middle. This "los... ReAct and Tool Usage (The Agents Season, Episode 2) 10d ago Before 2022, there was a wall between AI and the real world — models could reason impressively, but couldn't look anything up, run code, or check whether anything they said was actually true. This episode traces the moment that wall came do... What's an AI Agent? And Why Is that Hard to Define? (The Agents Season, Episode 1) 17d ago AI agents are having a moment — and unpacking them properly takes more than a single conversation. This episode kicks off a dedicated multi-part season exploring AI agents from every angle, building up a complete picture piece by piece rath... Unfaithful Chains of Thought 24d ago What's actually happening when an LLM "thinks out loud"? Research on human decision-making suggests that much of the reasoning we believe drives our choices is actually post hoc rationalization — we decide first, explain later. Katie and Be... Benchmark Bank Heist 31d ago What if an AI decided the smartest way to pass its test was to find the answer key? That's exactly what Anthropic's Claude Opus did when faced with a benchmark evaluation — reasoning that it was being tested, tracking down the encrypted eva... Benchmarking AI Models 38d ago How do you know if a new AI model is actually better than the last one? It turns out answering that question is a lot messier than it sounds. This week we dig into the world of LLM benchmarks — the standardized tests used to compare models ... The Hot Mess of AI (Mis-)Alignment 45d ago The paperclip maximizer — the classic AI doom scenario where a hyper-competent machine single-mindedly converts the universe into office supplies — might not be the AI risk we should actually lose sleep over. New research from Anthropic's A... The Bitter Lesson 52d ago Every AI builder knows the anxiety: you spend months engineering prompts, tuning pipelines, and chaining calls together — then a new model drops and half your work evaporates overnight. It turns out researchers have been wrestling with this... From Atari to Chat GPT: How AI Learned to Follow Instructions 59d ago Five and a half years have passed since Linear Digressions went on hiatus, and in that time... nothing has changed. Just kidding. Katie is joined by Phoebe to trace the surprisingly winding research path that led to ChatGPT. Here's a fun fa... It's RAG time: Retrieval-Augmented Generation 66d ago Today we are going to talk about the feature with the worst acronym in generative AI: RAG, or Retrieval Augmented Generation. If you've ever used something like "Chat with My Docs," if you have an internal AI chatbot that has access to your...
20 loaded
EY
Eugene Yan
4d ago · 20 items
20 loaded
WL
Welch Labs
5d ago · 15 items
15 loaded
AE
AI Explained
12d ago · 15 items
15 loaded
OU
One Useful Thing
13d ago · 20 items
20 loaded
BD
Blog | DataRobot
13d ago · 10 items
Introducing ACL Hydration: secure knowledge workflows for agentic AI 13d ago ACL Hydration secures knowledge workflows in the DataRobot Agent Workforce Platform: a unified framework for ingesting unstructured enterprise content, preserving source-system access controls, and enforcing those permissions at query time ... Your AI agents will run everywhere. Is your architecture ready for that? 15d ago Learn why infrastructure-agnostic deployment is essential, and how DataRobot provides a vendor-neutral control plane across clouds, on-prem, and edge. AI latency is a business risk. Here’s how to manage it 16d ago Learn how to reduce latency across predictive, generative, and agentic AI systems by balancing speed, accuracy, and cost. Agentic AI costs more than you budgeted. Here’s why. 23d ago Uncover the hidden costs of agentic AI in 2025. DataRobot’s strategic guide empowers IT leaders to optimize AI agent ROI and drive real business results. Why enterprise AI ROI starts with observability 26d ago Unlock AI ROI with observability. Learn how observability tools connect predictions to cost, revenue, and enterprise-wide business impact. Best agentic AI platforms: Why unified platforms win 28d ago Compare agentic AI platforms and see why unified, vendor-neutral control planes matter for building, deploying, and governing an AI agent workforce. How to achieve zero-downtime updates in large-scale AI agent deployments 30d ago Learn how to achieve zero-downtime AI deployments. Explore proven strategies to keep enterprise agents reliable, secure, and always available. What it takes to scale agentic AI in the enterprise 31d ago See why scaling agentic AI is critical for enterprise IT and AI leaders. Accelerate adoption, improve efficiency, and drive long-term ROI. The agentic AI development lifecycle 32d ago Discover how DataRobot unifies builders, operators, and governors in one platform for scalable, secure, and production-ready agentic AI. Your agentic AI pilot worked. Here’s why production will be harder. 34d ago Learn what it takes to scale agentic AI across the enterprise—unlock efficiency, strengthen governance, and deliver measurable business impact.
Claude for Legal Teams: Contract Review, Compliance and Due Diligence 16d ago See how the Claude legal plugin helps in-house legal teams with contract review, compliance scanning, due diligence, obligations tracking, and drafting. Vibe Coding Best Practices: 5 Claude Code Habits for Better Agentic Coding 21d ago Learn 5 practical vibe coding best practices for Claude Code and coding agents: CLAUDE.md, planning, review agents, safer prompts, and diff review. AI Benchmarks Explained: GPQA, SWE-bench, Chatbot Arena and What They Actually Measure 27d ago Learn what MMLU, GPQA Diamond, SWE-bench, HealthBench, and Chatbot Arena actually measure, and how labs game benchmark scores. Why AI-Native IDP Platforms Outperform ABBYY and Kofax in Modern Document Workflows 27d ago Evaluating IDP vendors? Compare Nanonets vs ABBYY and Kofax across architecture, operating model, and TCO to see why AI-native wins for IDP. Why You Hit Claude Limits So Fast: AI Token Limits Explained 29d ago Learn what AI tokens are, why Claude hits limits fast, and how to cut waste from context windows, history, files, tools, and reasoning. Did Google's TurboQuant Actually Solve AI Memory Crunch? 35d ago Google’s TurboQuant promises 6x KV-cache compression. Here’s what it means for AI memory, HBM demand, and the broader memory crunch. Claude for Finance Teams: Investment Banking, DCF Models, Reconciliation & Variance Analysis 45d ago See how finance teams use Claude for one-pagers, CIMs, comps, DCF models, reconciliations, and variance commentary, plus key human checks. AI Agent Hacks McKinsey: 5 Situations When You Should Not Deploy Agents 54d ago McKinsey hacked in 2 hours. 5 situations where AI agents will fail. Production permissions, regulated data, legacy systems—check before deploy. Are OpenAI and Google intentionally downgrading their models? 56d ago Yes, OpenAI and Google degrade their models. OpenAI admitted silent updates after denying it. Gemini redirects models. With full evidence. We ran 16 AI Models on 9,000+ Real Documents. Here's What We Found. 57d ago We benchmarked GPT-5.4, Gemini 3.1 Pro, Claude Opus, Sonnet, and 12 others on 3 Open OCR Benchmarks
15 loaded
AO
Ahead of AI
19d ago · 20 items
20 loaded
3B
3Blue1Brown
21d ago · 15 items
15 loaded
FO
Future of Life Institute
21d ago · 20 items
FLI’s President and CEO on Trump’s support for an AI ‘kill switch’ 21d ago FLI CEO’s statement on the attack against Sam Altman’s home 26d ago Prominent Scientists, Faith Leaders, Policymakers and Artists Call for a Prohibition on Superintelligence, as Poll Shows Americans Don’t Want It 40d ago Statement: Head of US Policy on the White House AI legislative recommendations 45d ago Governor DeSantis Directs Florida State Agencies to Partner with Future of Life Institute to Shield Families from AI Harm 59d ago “This is What it Means to be Pro-Human” Declares Broad Coalition of Conservative, Progressive, and Civil Society Groups in Statement of Shared Principles on AI 63d ago Statement from Max Tegmark on the Department of War’s ultimatum 69d ago Future of Life Institute Launches Multimillion Dollar Nationwide AI Regulation Campaign 86d ago AI Company Safety Practices Fall Short of Public Commitments and Show Structural Weaknesses, as Top Performers Widen the Gap 155d ago The U.S. Public Wants Regulation (or Prohibition) of Expert‑Level and Superhuman AI 199d ago
20 loaded
BD
Big Data – SiliconANGLE
29d ago · 20 items
Snowflake expands open data strategy with Iceberg V3 support and governance portability plan 29d ago Snowflake expands open data strategy with Iceberg V3 support and governance portability plan - SiliconANGLE How the NFL is using Amazon Quick to humanize the offseason 30d ago How the NFL is using Amazon Quick to humanize the offseason - SiliconANGLE Satellite data startup Xoople closes $130M investment 30d ago Satellite data startup Xoople closes $130M investment - SiliconANGLE Niobium brings fully encrypted AI workloads to the cloud with The Fog 35d ago Niobium brings fully encrypted AI workloads to the cloud with The Fog - SiliconANGLE Datadog debuts Experiments to unify product testing and observability data 35d ago Datadog debuts Experiments to unify product testing and observability data - SiliconANGLE SAP buys Reltio to pull in more outside data for AI agents 37d ago SAP buys Reltio to pull in more outside data for AI agents - SiliconANGLE What to expect at Qlik Connect: Join theCUBE April 14 40d ago Decision intelligence is driving real-time enterprise decisions with AI, data integration and automation across modern data platforms. Exclusive: HG Insights expands revenue growth intelligence platform with agentic capabilities 44d ago Exclusive: HG Insights expands revenue growth intelligence platform with agentic capabilities - SiliconANGLE As AI reshapes the storage hierarchy, SSDs move to the center of inference 44d ago As AI inference workloads surge, SSD infrastructure is becoming as strategically critical as the GPUs it feeds, and the supply gap could last years. Snowflake invests in Bedrock Data to strengthen agentic AI system governance 49d ago Snowflake invests in Bedrock Data to strengthen agentic AI system governance - SiliconANGLE
20 loaded
AW
AI Weirdness
36d ago · 15 items
Get working on your April Fools Eiffel Tower 36d ago Elevator Surprise: Place a tiny camera in the elevator, and when someone gets in, snap a photo saying, "Welcome to Space Station!" Or build a miniature model of the Eiffel Tower next to it for a dramatic effect. Tower of Pancakes: Create a ... Bonus: More April Fools pranks from Eiffel Tower Llama 36d ago AI Weirdness: the strange side of machine learning When a chatbot runs your store 138d ago You may have heard of people hooking up chatbots to controls that do real things. The controls might run internet searches, run commands to open and read documents and spreadsheets, or even edit or delete entire databases. Whether this soun... Bonus: Incorrect Christmas Carols 139d ago AI Weirdness: the strange side of machine learning Tiny neural net Halloween costumes are the best 191d ago I've been experimenting with getting a tiny circa-2015 recurrent neural network to generate Halloween costumes. Running on a single cat hair-covered laptop, char-rnn has no internet training, but learns from scratch to imitate the data I gi... More tiny neural net costumes 191d ago AI Weirdness: the strange side of machine learning Halloween costumes by tiny neural net 202d ago I've recently been experimenting with one of my favorite old-school neural networks, a tiny program that runs on my laptop and knows only about the data I give it. Without internet training, char-rnn doesn't have outside references to draw ... Bonus: more halloween costumes from tiny neural net 202d ago AI Weirdness: the strange side of machine learning Botober 2025: Terrible recipes from a tiny neural net 218d ago After seeing generated text evolve from the days of tiny neural networks to today's ChatGPT-style large language models, I have to conclude: there's something special about the tiny guys. Maybe it's the way the tiny neural networks string t... Bonus: Char-rnn's jello creations 218d ago AI Weirdness: the strange side of machine learning
15 loaded
YK
Yannic Kilcher
61d ago · 15 items
15 loaded
IN
inFERENCe
71d ago · 15 items
The Future of Software 71d ago The world of software is undergoing a shift not seen since the advent of compilers in the 1970s. Compilers were the original vibe coding: they automatically generate complex machine code that human programmers had to manually write before. ... Deep Learning is Powerful Because It Makes Hard Things Easy - Reflections 10 Years On 96d ago Ten years ago this week, I wrote a post called "Deep Learning is Easy - Learn Something Harder". The post blew up, top spot on HackerNews. Needless to say, it didn't age well. Discrete Diffusion: Continuous-Time Markov Chains 350d ago A tutorial explaining some intuitions behind continuous time Markov chains for machine learners interested in discrete diffusion models. We may finally crack Maths. But should we? 1064d ago Automating mathematical theorem proving has been a long standing goal of artificial intelligence and indeed computer science. It's one of the areas I became very interested in recently. This is because I feel we may have the ingredients nee... Mortal Komputation: On Hinton's argument for superhuman AI. 1073d ago Last week in Cambridge was Hinton bonanza. He visited the university town where he was once an undergraduate in experimental psychology, and gave a series of back-to-back talks, Q&A sessions, interviews, dinners, etc. He was stopped on the ... Autoregressive Models, OOD Prompts and the Interpolation Regime 1134d ago A few years ago I was very much into maximum likelihood-based generative modeling and autoregressive models (see this, this or this). More recently, my focus shifted to characterising inductive biases of gradient-based optimization focussin... We May be Surprised Again: Why I take LLMs seriously. 1142d ago "Deep Learning is Easy, Learn something Harder" - I proclaimed in one of my early and provocative blog posts from 2016. While some observations were fair, that post is now evidence that I clearly underestimated the impact simple techniques ... Implicit Bayesian Inference in Large Language Models 1526d ago This intriguing paper kept me thinking long enough for me to I decide it's time to resurrect my blogging (I started writing this during ICLR review period, and realised it might be a good idea to wait until that's concluded) * Sang Michael ... Eastern European Guide to Writing Reference Letters 1529d ago Excruciating. One phrase I often use to describe what it's like to read reference letters for Eastern European applicants to PhD and Master's programs in Cambridge. Even objectively outstanding students often receive dull, short, factual, a... Causal inference 4: Causal Diagrams, Markov Factorization, Structural Equation Models 1792d ago This post is written with my PhD student and now guest author Patrik Reizinger [https://twitter.com/rpatrik96] and is part 4 of a series of posts on causal inference: * Part 1: Intro to causal inference and do-calculus [https://www.inferenc...
15 loaded
DB
Damian Bogunowicz - dtransposed
76d ago · 10 items
TG
The Gradient
77d ago · 15 items
15 loaded
AK
Andrej Karpathy blog
84d ago · 10 items
SR
Salmon Run
94d ago · 20 items
Book Review: Software Engineering for Data Scientists 94d ago As a Software Engineer (backend Web Development then Search) turned Data Scientist, I was particularly interested in what the book Software ... Book Review: Transformers In Action 116d ago The Attention Is All You Need paper proposed the Transformer Architecrture as an improvement to the dominant encoder-decoder models of the ... Trip Report: PyData Global 2025 131d ago I attended PyData Global 2025 earlier this month. I had hoped to write this up earlier, but I've been busy, so only now getting the time Ch... Book Review: Time Series Forecasting using Foundation Models 206d ago As someone who primarily works in NLP and Search in the Health Domain, I don't have much use for Time Series. However, while exploring the F... Book Review: Statistics every Programmer Needs 229d ago I recently read Statistics every Programmer Needs by Gary Sutton. I am probably a good target audience for the book since I used to be a so... Book Review: Hands-On Artificial Intelligence for IoT 312d ago For those in similar professional circles as I am in, i.e. looking forward into the Generative AI space, yet with one foot pragmatically and... Book Review: Essential Graph RAG 325d ago Coming from a background of Knowledge Graph (KG) backed Medical Search, I don't need to be convinced about the importance of manually curate... Packaging ML Pipelines from Experiment to Deployment 491d ago As an ML Engineer, we are generally tasked with solving some business problem with technology. Typically it involves leveraging data assets ... Trip Report - PyData Global 2024 514d ago I attended PyData Global 2024 last week. Its a virtual conference, so I was able to attend it from the comfort of my home, although presenta... Using Knowledge Graphs to enhance Retrieval Augmented Generation 578d ago Retrieval Augmented Generation (RAG) has become a popular approach to harness LLMs for question answering using your own corpus of data. Typ...
20 loaded
SE
sentdex
139d ago · 15 items
15 loaded
DS
David Stutz
214d ago · 10 items
RAISE 2025 panel statement on aligning AI to clinical values 214d ago Recently, I attended the Responsible AI for Social and Ethical Healthcare 2025 “2.0” Symposium organized by, among others, Harvard Medical School. The symposium featured various panels on topics surrounding generative AI, in particular mult... Some Lessons on Reviews and Rebuttals 458d ago Writing and responding to reviews is the bread and butter of any academic and especially in AI research, PhD students are confronted with both rather early compared to other displicines. Unfortunately, I found that drafting reviews and rebu... Thoughts on Watermarking AI-Generated Content 476d ago Watermarking AI-generated content has the potential to address various problems that generative AI threatens to aggravate — misinformation, impersonation, copyright infringement, web pollution, etc. However, it is also controversial with ma... Thoughts and Lessons for Planning Rater Studies in AI 486d ago With the goal of deploying generative AI systems, rater studies are becoming increasingly common and important. This means more and more researchers and engineers face the challenge of actually planning and conducting rater studies for AI s... Open-Sourcing Relabeled MedQA and Dermatology DDx Datasets 541d ago Dealing with rater disagreement is becoming more important in AI, especially for LLMs and in specialized domains such as health. In the past year, I helped open source two datasets allowing to study rater disagreement in the health domain: ... Thinking About Research Ideas vs. Technology 543d ago In this article, I want to share some thoughts on the difference between research ideas and technology, particularly in machine learning. This distinction is have been contemplating since starting my PhD. After joining Google DeepMind and b... The Importance of Effectively Experimenting in an AI PhD 668d ago Engineering and running experiments are a key component of most PhDs in AI. While there are plenty of more theoretical topics that are often limited to smaller scale experimentation, the trend has definitely been to scale up models, dataset... FAQ for our Monte Carlo Conformal Prediction 725d ago Over the past months, I have given several talks about Monte Carlo conformal prediction and the problem of calibrating with uncertain ground truth, for example, stemming from annotator disagreement. Each time, the audience had great questio... Documenting your PhD — Keeping Track of Meetings, Experiments and Decisions 731d ago A PhD can be a difficult endeavour. While becoming an expert in tackling a specific problems, it is easy to lose track of things: Have I read this paper before? What was the paper saying? Why did we decide to change course? Why am I running... On NeurIPS’ High School Paper Track 752d ago The decision to have a separate High School Project Track at NeurIPS 2024 has sparked quite some controversy, with many prominent AI researchers debating pros and cons and personal opinions, primarily on X/Twitter. Initially, I ignored this...
VI
VITALab
252d ago · 10 items
Brain Latent Progression Individual-based spatiotemporal disease progression on 3D Brain MRIs via latent diffusion 252d ago This article aims at reviewing a Alzheimer’s spatiotemporal disease progression predictive model called Brain Latent Progression (BrLP). All in all, this is ... A Survey of popular LLM Evaluation Metrics 260d ago Large Language Models (LLMs) are increasingly applied to critical domains such as medical report generation, where accuracy and trust are essential. Evaluati... Open-Source Large Language Models in Radiology: A Review and Tutorial for Practical Research and Clinical Deployment 269d ago Open-Source Large Language Models in Radiology MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation 338d ago MemSAM Simplifying Deep Temporal Difference Learning 395d ago tl;dr The authors propose PQN, a simplified deep online Q-Learning that uses very small replay buffers. Normalization and parallelized sampling from vectoriz... EchoPrime: Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation 409d ago Objective EchoPrime is a foundation model designed for comprehensive echocardiographic interpretation. Unlike previous models that use single views or static... DeepSeek-V3 Technical Report 450d ago DeepSeek-V3 Variational Autoencoders for Generating Synthetic Tractography-Based Bundle Templates in a Low-Data Setting 478d ago Highlights Implicit neural representations 506d ago Implicit neural networks Foundations of diffusion networks 527d ago Diffusion networks As there’s a lot of recent developments around image generation and diffusion models in general, I took a deep dive in the fundamentals of...
LL
Lil'Log
371d ago · 20 items
Why We Think 371d ago Special thanks to John Schulman for a lot of super valuable feedback and direct edits on this post. Test time compute (Graves et al. 2016, Ling, et al. 2017, Cobbe et al. 2021) and Chain-of-thought (CoT) (Wei et al. 2022, Nye et al. 2021), ... Reward Hacking in Reinforcement Learning 525d ago Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing the intended task. Reward hacking exists because RL enviro... Extrinsic Hallucinations in LLMs 669d ago Hallucination in large language models usually refers to the model generating unfaithful, fabricated, inconsistent, or nonsensical content. As a term, hallucination has been somewhat generalized to cases when the model makes mistakes. Here,... Diffusion Models for Video Generation 755d ago Diffusion models have demonstrated strong results on image synthesis in past years. Now the research community has started working on a harder task—using it for video generation. The task itself is a superset of the image case, since an ima... Thinking about High-Quality Human Data 822d ago [Special thank you to Ian Kivlichan for many useful pointers (E.g. the 100+ year old Nature paper “Vox populi”) and nice feedback. 🙏 ] High-quality data is the fuel for modern data deep learning model training. Most of the task-specific lab... Adversarial Attacks on LLMs 925d ago The use of large language models in the real world has strongly accelerated by the launch of ChatGPT. We (including my team at OpenAI, shoutout to them) have invested a lot of effort to build default safe behavior into the model during the ... LLM Powered Autonomous Agents 1049d ago Building agents with LLM (large language model) as its core controller is a cool concept. Several proof-of-concepts demos, such as AutoGPT, GPT-Engineer and BabyAGI, serve as inspiring examples. The potentiality of LLM extends beyond genera... Prompt Engineering 1149d ago Prompt Engineering, also known as In-Context Prompting, refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without updating the model weights. It is an empirical science and the effect of prompt eng... The Transformer Family Version 2.0 1196d ago Many new Transformer architecture improvements have been proposed since my last post on “The Transformer Family” about three years ago. Here I did a big refactoring and enrichment of that 2020 post — restructure the hierarchy of sections an... Large Transformer Model Inference Optimization 1213d ago [Updated on 2023-01-24: add a small section on Distillation.] Large transformer models are mainstream nowadays, creating SoTA results for a variety of tasks. They are powerful but very expensive to train and use. The extremely high inferenc...
20 loaded
DA
Datumbox
376d ago · 20 items
20 loaded
JA
Jay Alammar
407d ago · 10 items
Moving To Substack 407d ago I’m freezing this blog and starting to post on my Substack instead. The authoring experience is much more convenient for me there. Please follow me there, and check out The Illustrated DeepSeek R-1 if you haven’t yet. And check out our How ... Generative AI and AI Product Moats 1094d ago Here are eight observations I’ve shared recently on the Cohere blog and videos that go over them.: Article: What’s the big deal with Generative AI? Is it the future or the present? Article: AI is Eating The World Remaking Old Computer Graphics With AI Image Generation 1222d ago Can AI Image generation tools make re-imagined, higher-resolution versions of old video game graphics? Over the last few days, I used AI image generation to reproduce one of my childhood nightmares. I wrestled with Stable Diffusion, Dall-E ... The Illustrated Stable Diffusion 1311d ago Translations: Chinese, Vietnamese. (V2 Nov 2022: Updated images for more precise description of forward diffusion. A few more images in this version) AI image generation is the most recent AI capability blowing people’s minds (mine included... Applying massive language models in the real world with Cohere 1522d ago A little less than a year ago, I joined the awesome Cohere team. The company trains massive language models (both GPT-like and BERT-like) and offers them as an API (which also supports finetuning). Its founders include Google Brain alums in... The Illustrated Retrieval Transformer 1585d ago Discussion: Discussion Thread for comments, corrections, or any feedback. Translations: Korean, Russian Summary: The latest batch of language models can be much smaller yet achieve GPT-3 like performance by being able to query a database or... Explainable AI Cheat Sheet 1829d ago Introducing the Explainable AI Cheat Sheet, your high-level guide to the set of tools and methods that helps humans understand AI/ML models and their predictions. I introduce the cheat sheet in this brief video: Finding the Words to Say: Hidden State Visualizations for Language Models 1934d ago By visualizing the hidden state between a model's layers, we can get some clues as to the model's Interfaces for Explaining Transformer Language Models 1967d ago Interfaces for exploring transformer language models by looking at input saliency and neuron activation. Explorable #1: Input saliency of a list of countries generated by a language model Tap or hover over the output tokens: Explorable #2: ... How GPT3 Works - Visualizations and Animations 2110d ago Discussions: Hacker News (397 points, 97 comments), Reddit r/MachineLearning (247 points, 27 comments) Translations: German, Korean, Chinese (Simplified), Russian, Turkish The tech world is abuzz with GPT3 hype. Massive language models (lik...
AK
Andrej Karpathy
433d ago · 15 items
15 loaded
CH
Chip Huyen
476d ago · 10 items
SS
Seita's Place
1056d ago · 10 items
My Faculty Application Experience 1056d ago I spent roughly a year preparing, and then interviewing, for tenure-trackfaculty positions. My job search is finally done, and I am joining theUniversity of ... Books Read in 2022 1222d ago At the end of every year I have a tradition where I write summaries of thebooks that I read throughout the year. Unfortunately this year wasexceptionally bus... Conference on Robot Learning 2022 1227d ago The airplanes on display at the CoRL 2022 banquet. The 2022 Robotics: Science and Systems Conference 1243d ago A photo I took while at RSS 2022 in New York City, on the dinner cruise arranged by the conference. The (In-Person) ICRA 2022 Conference in Philadelphia 1373d ago A photo I took while at ICRA 2022 in Philadelphia. This is the Two New Papers: Learning to Fling and Singulate Fabrics 1379d ago The system for our IROS 2022 paper on singulating layers of cloth with tactile sensing. A Plea to End Harassment 1391d ago Scott Aaronson is a professor of computer science at UT Austin, where hisresearch area is in theoretical computer science. However, he may be more wellknown ... My Paper Reviewing Load 1475d ago This is a regularly updated post, last updated April 21, 2026. I Stand with Ukraine 1532d ago I stand with Ukraine and firmly oppose Vladimir Putin’s invasion. Books Read in 2021 1587d ago At the end of every year I have a tradition where I write summaries of thebooks that I read throughout the year. Here’s the following post with the roughset ...
NB
NEWS & BLOG - EDIA
1267d ago · 20 items
ELG book now available 1267d ago European Language Grid. A Language Technology Platform for Multilingual Europe. Innovations, investments, and ROI: to make or to buy? 1620d ago Embracing content agility is key. But what about the associated investments? Is it better for publishers to make or buy innovations? After digitalisation comes automation: the publisher's best next step 1640d ago What to do once you've embraced content agility and implemented digitalisation tools? Why is automation the publisher's best next step? How to arrive at content agility: 3 stages of an agile process 1694d ago The road towards content agility is rather agile in itself. It requires a brand-new approach. How to go from a linear to a modular view? Why embracing content agility is key 1746d ago Personalised learning requires content to be customisable — all the time. So, when it comes to educational content, content agility is key. Content metadata: a recap on why you should really automate content labelling 1765d ago Why are some types of labels particularly interesting to automate? And what's next for future-proof companies that have embraced automation? Content metadata: the 'why' and 'what' of learning objectives tagging and automated labelling 1807d ago What is learning objectives tagging? Why is it useful? And how will it benefit from automated labelling? Content metadata: what automated labels can do for topic classification 1823d ago What is topic classification? How does it differ from keyword extraction? And why will it benefit from automated labelling? Content metadata: why keyword extraction requires automated labelling 1833d ago What is keyword extraction? Why does it require automated labelling? And what are the benefits of automation? Content metadata: automated labelling and the CEFR 1849d ago What is the CEFR? Why and how should you use automated labelling? And what are the benefits of automation?
20 loaded
TS
The Stanford AI Lab Blog
1437d ago · 15 items
LinkBERT: Improving Language Model Training with Document Link 1437d ago Language Model Pretraining Language models (LMs), like BERT 1 and the GPT series 2, achieve remarkable performance on many natural language processing (NLP) tasks. They are now the foundation of today’s NLP systems. 3 These models serve imp... Stanford AI Lab Papers and Talks at ACL 2022 1443d ago The official Stanford AI Lab blog Stanford AI Lab Papers and Talks at ICLR 2022 1473d ago The official Stanford AI Lab blog Discovering the systematic errors made by machine learning models 1491d ago Discovering systematic errors with cross-modal embeddings Grading Complex Interactive Coding Programs with Reinforcement Learning 1501d ago The official Stanford AI Lab blog Understanding Deep Learning Algorithms that Leverage Unlabeled Data, Part 1: Self-training 1533d ago The official Stanford AI Lab blog Stanford AI Lab Papers and Talks at AAAI 2022 1535d ago The official Stanford AI Lab blog How to Improve User Experience (and Behavior): Three Papers from Stanford's Alexa Prize Team 1556d ago Introduction Reward Isn't Free: Supervising Robot Learning with Language and Video from the Web 1567d ago This work was conducted as part of SAIL and CRFM. BanditPAM: Almost Linear-Time k-medoids Clustering via Multi-Armed Bandits 1602d ago The official Stanford AI Lab blog
15 loaded
MI
ML in Production
1460d ago · 10 items
Driving Experimentation Forward through a Working Group (Experimentation Program Series: Guide 03) 1460d ago We describe how diverse stakeholders can drive experimentation forward through the formation of a working group and what role data science plays. What is an Experimentation program and Who is Involved? (Experimentation Program Series: Guide 02) 1496d ago We define what an experimentation program is and discuss which stakeholder groups should participate in order to drive experimentation forward. Building An Effective Experimentation Program (Experimentation Program Series: Guide 01) 1509d ago An introduction to building an effective experimentation program at your company. Lessons Learned from Writing Online 1552d ago Where I share my story writing MLinProduction and key metrics from building my audience and monetization. Newsletter #087 1983d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #086 1991d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #085 1997d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #084 2004d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #083 2011d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems. Newsletter #082 2018d ago Weekly newsletter dedicated to sharing resources for building and operating production machine learning systems.
AT
AI Trends
1651d ago · 10 items
EE
ELEDIA E-AIR
2024d ago · 9 items
SB
Software 2.0 3098d ago Software 2.0 I sometimes see people refer to neural networks as just “another tool in your machine learning toolbox”. They have some pros and cons, they work here or there, and sometimes you can … AlphaGo, in context 3262d ago AlphaGo, in context Update Oct 18, 2017: AlphaGo Zero was announced. This post refers to the previous version. 95% of it still applies. I had a chance to talk to several people about the recent … ICML accepted papers institution stats 3269d ago ICML accepted papers institution stats The accepted papers at ICML have been published. ICML is a top Machine Learning conference, and one of the most relevant to Deep Learning, although NIPS has a … A Peek at Trends in Machine Learning 3317d ago A Peek at Trends in Machine Learning Have you looked at Google Trends? It’s pretty cool — you enter some keywords and see how Google Searches of that term vary through time. I thought — hey, I … ICLR 2017 vs arxiv-sanity 3341d ago ICLR 2017 vs arxiv-sanity I thought it would be fun to cross-reference the ICLR 2017 (a popular Deep Learning conference) decisions (which fall into 4 categories: oral, poster, workshop, reject) with … Virtual Reality: still not quite there, again. 3396d ago Virtual Reality: still not quite there, again. The first time I tried out Virtual Reality was a while ago — somewhere in the late 1990's. I was quite young so my memory is a bit hazy, but I … Yes you should understand backprop 3425d ago Yes you should understand backprop When we offered CS231n (Deep Learning class) at Stanford, we intentionally designed the programming assignments to include explicit calculations involved in … CS183c Assignment #3 3826d ago CS183c Assignment #3 The last few weeks we heard from several excellent guests, including Selina Tobaccowala from Survey Monkey, Patrick Collison from Stripe, Nirav Tolia from Nextdoor, Shishir …

No matching sources found.