AI Feed

HF Daily Papers Papers 1 day ago

Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning

The composition of training data, governed by the diversity of sources and their mixing strategy, is a cornerstone of Large Language Model (LLM) pre-training. Online Data…

HF Daily Papers Papers 1 day ago

Are Text-to-Image Models Inductivist Turkeys? A Counterfactual Benchmark for Causal Reasoning

Text-to-image (T2I) generation models have achieved remarkable progress in producing visually realistic images from natural language prompts. Yet it remains unclear whether their success reflects genuine…

X · @teortaxesTex X / Twitter 1 day ago

Just realized that this is GRPO-brained, generally ORM-brained dense process reward signal, in theory, would let you progress even if you do not have …

Just realized that this is GRPO-brained, generally ORM-braineddense process reward signal, in theory, would let you progress even if you do not have "positive trajectories". Of…

HF Daily Papers Papers 1 day ago

World Value Models for Robotic Manipulation

Generalist value models play a pivotal role in scaling robotic policy learning from large-scale, mixed-quality data. Mathematically, accurate value estimation demands deep temporal understanding, requiring models…

HF Daily Papers Papers 1 day ago

Qwen-AgentWorld: Language World Models for General Agents

A world model predicts environment dynamics based on current observations and actions, serving as a core cognitive mechanism for reasoning and planning. In this work, we…

HF Daily Papers Papers 1 day ago

MemGUI-Agent: An End-to-End Long-Horizon Mobile GUI Agent with Proactive Context Management

MLLM-based mobile GUI agents have made substantial progress on short-horizon tasks, yet remain unreliable on long-horizon tasks that require retaining intermediate facts across many steps and…

HF Daily Papers Papers 1 day ago

OpenThoughts-Agent: Data Recipes for Agentic Models

Agentic language models dramatically expand the applications of AI yet little is publicly known about how to curate training data for broadly capable agents. Existing open…

Luma News (scraped) Generative Media 1 day ago

Introducing Luma Skills: Build a Creative Workflow Once, Run It Forever

HF Daily Papers Papers 1 day ago

FLUX3D: High-Fidelity 3D Gaussian Generation with Diffusion-Aligned Sparse Representation

Sparse voxel representation has emerged as a scalable foundation for image-to-3D Gaussian Splatting (3DGS) generation, yet current methods struggle to preserve high-frequency visual details of input…

Luma News (scraped) Generative Media 1 day ago

Luma Introduces Ray3.2 Model & API: Complete Creative Control for Video Generation

X · @ylecun X / Twitter 1 day ago

RT Lawfare: Re https://www.lawfaremedia.org/article/tulsi-gabbard-s-fauci-files-don-t-prove-what-she-says-they-prove

RT LawfareRe https://www.lawfaremedia.org/article/tulsi-gabbard-s-fauci-files-don-t-prove-what-she-says-they-prove

HF Daily Papers Papers 1 day ago

MobileForge: Annotation-Free Adaptation for Mobile GUI Agents with Hierarchical Feedback-Guided Policy Optimization

MLLM-based mobile GUI agents have made substantial progress in UI understanding and action execution, but adapting them to real target apps remains costly because mobile apps…

Cohere Blog (scraped) Frontier Labs 1 day ago

North Mini CodeNEWAgentic coding model, built for practical software engineering

Mistral News (scraped) Frontier Labs 1 day ago

Research Introducing Mistral OCR 4 State of the art document intelligence model. June 23, 2026 Mistral AI

HF Daily Papers Papers 1 day ago

DiffusionBench: On Holistic Evaluation of Diffusion Transformers

Diffusion transformer (DiT) research on image generation has converged to a single evaluation setup: class-conditional generation on ImageNet. While methods improve the FID and related metrics,…

OpenAI News (scraped) Frontier Labs 1 day ago

OpenAI and Broadcom unveil LLM-optimized inference chip

ElevenLabs Blog (scraped) Generative Media 1 day ago

ElevenLabs to expand in California creating 173 high-paying jobs

Luma News (scraped) Generative Media 1 day ago

Luma Announces The Open Physical AI Lab: An Open Science Effort to Solve Generalization in Physical AI

Microsoft Research Frontier Labs 1 day ago

Talos: Scaling rare disease diagnosis with automated, iterative genomic reanalysis

Talos was built to help resolve a major bottleneck in genomic medicine: human review time. The open-source system recovered 90% of in-scope diagnoses while surfacing just…

Mistral News (scraped) Frontier Labs 1 day ago

Product Remote agents in Vibe. Powered by Mistral Medium 3.5. Introducing Mistral Medium 3.5, remote coding agents in Vibe, plus new Work mode in Le Chat for complex tasks. May 22, 2026 Mistral AI

Product Remote agents in Vibe. Powered by Mistral Medium 3.5. Introducing Mistral Medium 3.5, remote coding agents in Vibe, plus new Work mode in Le Chat…

TechCrunch - AI Generative Media 1 day ago

3 days left to save up to $190 on your TechCrunch Founder Summit 2026 pass

You have just 3 days left to save up to $190 on your pass to TechCrunch Founder Summit 2026 before Early Bird rates end on June…

HF Daily Papers Papers 1 day ago

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

Long-context reasoning is an essential capability for large language models, particularly when they are deployed as autonomous agents that must reason over lengthy trajectories. Reinforcement learning…

OpenAI News (scraped) Frontier Labs 1 day ago

How GPT-5 helped immunologist Derya Unutmaz solve a 3-year-old mystery

ElevenLabs Blog (scraped) Generative Media 1 day ago

ElevenLabs partners with the UK Government to bring voice AI to public services, as it expands London HQ

Latest

Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning

Are Text-to-Image Models Inductivist Turkeys? A Counterfactual Benchmark for Causal Reasoning

Just realized that this is GRPO-brained, generally ORM-brained dense process reward signal, in theory, would let you progress even if you do not have …

World Value Models for Robotic Manipulation

Qwen-AgentWorld: Language World Models for General Agents

MemGUI-Agent: An End-to-End Long-Horizon Mobile GUI Agent with Proactive Context Management

OpenThoughts-Agent: Data Recipes for Agentic Models

Introducing Luma Skills: Build a Creative Workflow Once, Run It Forever

FLUX3D: High-Fidelity 3D Gaussian Generation with Diffusion-Aligned Sparse Representation

Luma Introduces Ray3.2 Model & API: Complete Creative Control for Video Generation

RT Lawfare: Re https://www.lawfaremedia.org/article/tulsi-gabbard-s-fauci-files-don-t-prove-what-she-says-they-prove

MobileForge: Annotation-Free Adaptation for Mobile GUI Agents with Hierarchical Feedback-Guided Policy Optimization

North Mini CodeNEWAgentic coding model, built for practical software engineering

Research Introducing Mistral OCR 4 State of the art document intelligence model. June 23, 2026 Mistral AI

DiffusionBench: On Holistic Evaluation of Diffusion Transformers

OpenAI and Broadcom unveil LLM-optimized inference chip

ElevenLabs to expand in California creating 173 high-paying jobs

Luma Announces The Open Physical AI Lab: An Open Science Effort to Solve Generalization in Physical AI

Talos: Scaling rare disease diagnosis with automated, iterative genomic reanalysis

Product Remote agents in Vibe. Powered by Mistral Medium 3.5. Introducing Mistral Medium 3.5, remote coding agents in Vibe, plus new Work mode in Le Chat for complex tasks. May 22, 2026 Mistral AI

3 days left to save up to $190 on your TechCrunch Founder Summit 2026 pass

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

How GPT-5 helped immunologist Derya Unutmaz solve a 3-year-old mystery

ElevenLabs partners with the UK Government to bring voice AI to public services, as it expands London HQ

Browse by category