arxiv:2604.10577
Taiwei Shi
MaksimSTW
AI & ML interests
reinforcement learning, alignment, human-AI collaboration, and computational social science
Recent Activity
authored a paper 11 days ago
The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents liked a dataset 12 days ago
lime-nlp/OS-Blind