AI Feed

arXiv cs.LG Papers 2 days ago

The Degeneracy Distillery

arXiv:2606.23838v1 Announce Type: new Abstract: When two or more parameters or labels produce similar data, they are degenerate, or hard to distinguish. Degeneracies render both label…

arXiv cs.LG Papers 2 days ago

HyMaTE: A Hybrid Mamba and Transformer Model for EHR Representation Learning

arXiv:2509.24118v2 Announce Type: replace Abstract: Electronic health Records (EHRs) have become a cornerstone in modern-day healthcare. They are a crucial part for analyzing the progression of…

arXiv cs.LG Papers 2 days ago

A Survey on Federated Causal Discovery and Inference

arXiv:2606.23741v1 Announce Type: new Abstract: Causal reasoning, which encompasses the discovery of causal structures and the inference of causal effects, is fundamental to data-driven decision making.…

arXiv cs.LG Papers 2 days ago

Low-power analogue neural networks with trainable nonlinear connections for continuous control

arXiv:2606.23742v1 Announce Type: new Abstract: Physical neural networks promise low-power machine learning by computing directly with analogue device physics, but most architectures force nonlinear device responses…

arXiv cs.LG Papers 2 days ago

Synergizing Physically Constrained MCMC and Chemical-Informed Gaussian Processes for Reaction Network Discovery

arXiv:2606.23757v1 Announce Type: new Abstract: Extracting interpretable governing equations from sparse, noisy chemical time-series data remains difficult because discrete reaction topology and continuous kinetic parameters are…

arXiv cs.LG Papers 2 days ago

One Ruler: A Same-Hands Re-Evaluation of Bivariate Causal Direction on Tuebingen, with a Parameter-Free Compression Baseline

arXiv:2606.23767v1 Announce Type: new Abstract: Headline accuracies on the Tuebingen cause-effect pairs are routinely compared across papers even though each is measured under its authors' own…

arXiv cs.LG Papers 2 days ago

Exploring Dualistic Meta-Learning to Enhance Domain Generalization in Open Set Scenarios

arXiv:2606.23758v1 Announce Type: new Abstract: Domain generalization learns from multiple source domains to generalize to unseen target domains. However, it often neglects the realistic case of…

arXiv cs.LG Papers 2 days ago

Reconstructing GRACE Terrestrial Water Storage with Spatio-Temporal Graph Neural Networks: An Application to South America

arXiv:2606.23833v1 Announce Type: new Abstract: Terrestrial water storage (TWS) integrates snow, soil moisture, surface water, and groundwater and is a key indicator of how climate variability…

arXiv cs.LG Papers 2 days ago

Deciphering Fingerprints of 3D Molecular Surfaces for Accurate Epitope Prediction

arXiv:2606.23830v1 Announce Type: new Abstract: Molecular surfaces encode the geometric and physicochemical patterns that determine antibody-antigen recognition, central to epitope prediction. However, existing methods rely on…

arXiv cs.LG Papers 2 days ago

Machine-Learning Emulation of Satellite Greenhouse Gas Retrievals: Stability over Time

arXiv:2606.09313v2 Announce Type: replace Abstract: Retrieval algorithms are used to estimate atmospheric concentrations of greenhouse gases (GHGs), such as carbon dioxide (CO2) and methane (CH4), by…

arXiv cs.LG Papers 2 days ago

Multi-agent imitation learning with function approximation: Linear Markov games and beyond

arXiv:2602.22810v2 Announce Type: replace Abstract: In this work, we present the first theoretical analysis of multi-agent imitation learning (MAIL) in linear Markov games where both the…

arXiv cs.LG Papers 2 days ago

SLEEPING-DISCO 9M: A large-scale pre-training dataset for generative music modeling

arXiv:2506.14293v4 Announce Type: replace-cross Abstract: We present Sleeping-DISCO 9M, a large-scale pre-training dataset for music and song. To the best of our knowledge, there are no…

arXiv cs.LG Papers 2 days ago

On the Position Bias of On-Policy Distillation

arXiv:2606.22600v2 Announce Type: replace Abstract: On-Policy Distillation (OPD) improves the learning efficiency of standard reinforcement learning through dense, token-level supervision from teachers. In the standard KL…

arXiv cs.LG Papers 2 days ago

Teaching Diffusion to Speculate Left-to-Right

arXiv:2606.11552v2 Announce Type: replace-cross Abstract: Large language models (LLMs) achieve remarkable performance across a wide range of tasks, but their autoregressive decoding process incurs substantial inference…

arXiv cs.LG Papers 2 days ago

Event-Grounded Question Answering over Long Audio via Structured Retrieval

arXiv:2602.14612v4 Announce Type: replace-cross Abstract: Answering natural-language questions over multi-hour audio requires both event recognition and temporal grounding. Current large audio-language models perform well on short…

arXiv cs.LG Papers 2 days ago

Weight-Space Geometry of Offline Reasoning Training

arXiv:2606.23740v1 Announce Type: new Abstract: Offline reinforcement-learning losses (RFT, RIFT, DFT, Offline GRPO, DPO) are widely used to distill reasoning from large teachers into smaller students,…

arXiv cs.CL Papers 2 days ago

RASC+: Retrieval-Constrained LLM Adjudication for Clinical Value Set Authoring

arXiv:2606.23992v1 Announce Type: new Abstract: Clinical value sets define the standardized terminology codes used in quality measurement, phenotyping, cohort construction, and clinical decision support. The recently…

arXiv cs.CL Papers 2 days ago

Business as Rulesual: A Benchmark and Framework for Business Rule Flow Modeling with LLMs

arXiv:2505.18542v4 Announce Type: replace Abstract: Extracting structured procedural knowledge from unstructured business documents is a critical yet unresolved bottleneck in process automation. While prior work has…

arXiv cs.CL Papers 2 days ago

Faithful by Construction: Claim-Anchored Attribution for Multi-Document Summarization

arXiv:2606.23989v1 Announce Type: new Abstract: End-to-end large language models (LLMs) produce fluent multi-document summaries but remain prone to hallucination, and the attributions they offer are typically…

arXiv cs.CL Papers 2 days ago

AdversaBench: Automated LLM Red-Teaming with Multi-Judge Confirmation and Cross-Model Transferability

arXiv:2606.24589v1 Announce Type: cross Abstract: Scaling adversarial evaluation of large language models requires both a method for generating hard inputs and a reliable way to confirm…

arXiv cs.CL Papers 2 days ago

Does My Embedding Reflect That $A = B$? Evaluating Mathematical Equivalence in Embedding Models

arXiv:2606.23959v1 Announce Type: new Abstract: Because mathematics is highly abstract, a single statement can take very different forms depending on what subfield it is framed in.…

arXiv cs.CL Papers 2 days ago

Co-occurring associated retained concepts in Diffusion Unlearning

arXiv:2606.24192v1 Announce Type: cross Abstract: Unlearning has emerged as a key technique to mitigate harmful content generation in diffusion models. However, existing methods often remove not…

arXiv cs.CL Papers 2 days ago

Layer-wise Probing of wav2vec 2.0 and Whisper for Consonant Cluster Reduction in African American English

arXiv:2606.23948v1 Announce Type: new Abstract: Self-supervised and supervised speech models are increasingly used to investigate which linguistic information their internal representations encode, and at what level…

arXiv cs.CL Papers 2 days ago

VieSpeaker: A Large-Scale Vietnamese Speaker Recognition Dataset Beyond Visual Dependency

arXiv:2606.24066v1 Announce Type: cross Abstract: Speaker recognition has advanced rapidly with large-scale training datasets, yet Vietnamese remains under-resourced, with existing corpora limited in scale and acoustic…

Latest

The Degeneracy Distillery

HyMaTE: A Hybrid Mamba and Transformer Model for EHR Representation Learning

A Survey on Federated Causal Discovery and Inference

Low-power analogue neural networks with trainable nonlinear connections for continuous control

Synergizing Physically Constrained MCMC and Chemical-Informed Gaussian Processes for Reaction Network Discovery

One Ruler: A Same-Hands Re-Evaluation of Bivariate Causal Direction on Tuebingen, with a Parameter-Free Compression Baseline

Exploring Dualistic Meta-Learning to Enhance Domain Generalization in Open Set Scenarios

Reconstructing GRACE Terrestrial Water Storage with Spatio-Temporal Graph Neural Networks: An Application to South America

Deciphering Fingerprints of 3D Molecular Surfaces for Accurate Epitope Prediction

Machine-Learning Emulation of Satellite Greenhouse Gas Retrievals: Stability over Time

Multi-agent imitation learning with function approximation: Linear Markov games and beyond

SLEEPING-DISCO 9M: A large-scale pre-training dataset for generative music modeling

On the Position Bias of On-Policy Distillation

Teaching Diffusion to Speculate Left-to-Right

Event-Grounded Question Answering over Long Audio via Structured Retrieval

Weight-Space Geometry of Offline Reasoning Training

RASC+: Retrieval-Constrained LLM Adjudication for Clinical Value Set Authoring

Business as Rulesual: A Benchmark and Framework for Business Rule Flow Modeling with LLMs

Faithful by Construction: Claim-Anchored Attribution for Multi-Document Summarization

AdversaBench: Automated LLM Red-Teaming with Multi-Judge Confirmation and Cross-Model Transferability

Does My Embedding Reflect That $A = B$? Evaluating Mathematical Equivalence in Embedding Models

Co-occurring associated retained concepts in Diffusion Unlearning

Layer-wise Probing of wav2vec 2.0 and Whisper for Consonant Cluster Reduction in African American English

VieSpeaker: A Large-Scale Vietnamese Speaker Recognition Dataset Beyond Visual Dependency

Browse by category