AI & ML

313 stories from 28 sources

Dark Reading · 6d ago

Microsoft, Salesforce Patch AI Agent Data Leak Flaws

Microsoft, Salesforce Patch AI Agent Data Leak Flaws Two recently fixed prompt injections in Salesforce Agentforce and Microsoft Copilot would have enabled an external attacker to leak sensitive data.

impact 39
arXiv Security · 5h ago

Prompt to Pwn: Automated Exploit Generation for Smart Contracts

Prompt to Pwn: Automated Exploit Generation for Smart Contracts arXiv:2508.01371v3 Announce Type: replace Abstract: Smart contracts are important for digital finance, yet they are hard to patch once deployed. Prior work…

impact 26
Microsoft · 5w ago

Announcing Copilot leadership update

Announcing Copilot leadership update Satya Nadella, Chairman and CEO, and Mustafa Suleyman, Executive Vice President and CEO of Microsoft AI, shared the below communications with Microsoft employees this morning. SATYA…

impact 26
SecurityWeek · just now

Claude Mythos Finds 271 Firefox Vulnerabilities

Claude Mythos Finds 271 Firefox Vulnerabilities All the flaws could have also been found by an elite human researcher, according to Mozilla. The post Claude Mythos Finds 271 Firefox Vulnerabilities appeared first on Sec…

impact 24
TheHackerNews · just now

Toxic Combinations: When Cross-App Permissions Stack into Risk

Toxic Combinations: When Cross-App Permissions Stack into Risk On January 31, 2026, researchers disclosed that Moltbook, a social network built for AI agents, had left its database wide open, exposing 35,000 email addre…

impact 20
MIT Tech Review · just now

AI needs a strong data fabric to deliver business value

AI needs a strong data fabric to deliver business value Artificial intelligence is moving quickly in the enterprise, from experimentation to everyday use. Organizations are deploying copilots, agents, and predictive sys…

impact 16
arXiv AI · 5h ago

AI scientists produce results without reasoning scientifically

AI scientists produce results without reasoning scientifically arXiv:2604.18805v1 Announce Type: new Abstract: Large language model (LLM)-based systems are increasingly deployed to conduct scientific research autonomous…

impact 16
arXiv AI · 5h ago

Human-Guided Harm Recovery for Computer Use Agents

Human-Guided Harm Recovery for Computer Use Agents arXiv:2604.18847v1 Announce Type: new Abstract: As LM agents gain the ability to execute actions on real computer systems, we need ways to not only prevent harmful acti…

impact 16
arXiv AI · 5h ago

Error-free Training for MedMNIST Datasets

Error-free Training for MedMNIST Datasets arXiv:2604.18916v1 Announce Type: new Abstract: In this paper, we introduce a new concept called Artificial Special Intelligence by which Machine Learning models for the classif…

impact 16
arXiv AI · 5h ago

AutomationBench

AutomationBench arXiv:2604.18934v1 Announce Type: new Abstract: Existing AI benchmarks for software automation rarely combine cross-application coordination, autonomous API discovery, and policy adherence. Real business…

impact 16
arXiv AI · 5h ago

On Accelerating Grounded Code Development for Research

On Accelerating Grounded Code Development for Research arXiv:2604.19022v1 Announce Type: new Abstract: A major challenge for niche scientific and technical domains in leveraging coding agents is the lack of access to up…

impact 16
arXiv AI · 5h ago

Learning Lifted Action Models from Unsupervised Visual Traces

Learning Lifted Action Models from Unsupervised Visual Traces arXiv:2604.19043v1 Announce Type: new Abstract: Efficient construction of models capturing the preconditions and effects of actions is essential for applying…

impact 16
arXiv AI · 5h ago

Reasoning-Aware AIGC Detection via Alignment and Reinforcement

Reasoning-Aware AIGC Detection via Alignment and Reinforcement arXiv:2604.19172v1 Announce Type: new Abstract: The rapid advancement and widespread adoption of Large Language Models (LLMs) have elevated the need for rel…

impact 16
arXiv AI · 5h ago

Explicit Trait Inference for Multi-Agent Coordination

Explicit Trait Inference for Multi-Agent Coordination arXiv:2604.19278v1 Announce Type: new Abstract: LLM-based multi-agent systems (MAS) show promise on complex tasks but remain prone to coordination failures such as g…

impact 16
arXiv AI · 5h ago

SimDiff: Depth Pruning via Similarity and Difference

SimDiff: Depth Pruning via Similarity and Difference arXiv:2604.19520v1 Announce Type: new Abstract: Depth pruning improves the deployment efficiency of large language models (LLMs) by identifying and removing redundant…

impact 16
arXiv AI · 5h ago

Revac: A Social Deduction Reasoning Agent

Revac: A Social Deduction Reasoning Agent arXiv:2604.19523v1 Announce Type: new Abstract: Social deduction games such as Mafia present a unique AI challenge: players must reason under uncertainty, interpret incomplete a…

impact 16
arXiv AI · 5h ago

Detecting Data Contamination in Large Language Models

Detecting Data Contamination in Large Language Models arXiv:2604.19561v1 Announce Type: new Abstract: Large Language Models (LLMs) utilize large amounts of data for their training, some of which may come from copyrighte…

impact 16
arXiv AI · 5h ago

Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic

Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic arXiv:2604.19567v1 Announce Type: new Abstract: Reinforcement learning (RL) as post-training is crucial for enhancing the reasoning ability of large languag…

impact 16
arXiv AI · 5h ago

Time Series Augmented Generation for Financial Applications

Time Series Augmented Generation for Financial Applications arXiv:2604.19633v1 Announce Type: new Abstract: Evaluating the reasoning capabilities of Large Language Models (LLMs) for complex, quantitative financial tasks…

impact 16
arXiv AI · 5h ago

Two-dimensional early exit optimisation of LLM inference

Two-dimensional early exit optimisation of LLM inference arXiv:2604.18592v1 Announce Type: cross Abstract: We introduce a two-dimensional (2D) early exit strategy that coordinates layer-wise and sentence-wise exiting fo…

impact 16
arXiv AI · 5h ago

ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants

ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants arXiv:2604.18616v1 Announce Type: cross Abstract: LLM-based coding agents can generate functionally correct GPU kernels, yet their performance remains far b…

impact 16

All Categories