06/23 12:51 HAKARI-Bench: A Lightweight Benchmark for Comparing Retrieval Architectures and Efficiency Settings under Unified Conditions 
06/15 14:09 Learning by Chatting? Investigating the Impact of Generative AI on Information Seeking and Learning 
06/04 23:40 One Developer Is All You Need: A Case Study of an AI-Augmented One-Person Squad in a Brownfield Enterprise 
06/04 08:53 MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation 
05/25 21:25 The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity 
05/25 19:45 [PDF] Shojaee+ (2025) The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity 