DoubleAgents: Fine-Tuning LLMs for Covert Malicious Tool Calls

(pub.aimind.so)

93 points | by grumblemumble 3 days ago ago

30 comments