2026-02-20 09:24 |
Detailed record - Similar records
|
2026-02-20 09:21 |
Detailed record - Similar records
|
2026-02-20 09:18 |
Detailed record - Similar records
|
2026-02-20 09:15 |
Detailed record - Similar records
|
2026-02-20 09:05 |
Detailed record - Similar records
|
2026-02-19 15:27 |
Detailed record - Similar records
|
2026-02-19 15:23 |
Detailed record - Similar records
|
2026-02-19 15:03 |
Detailed record - Similar records
|
2026-02-19 14:59 |
Detailed record - Similar records
|
2026-02-19 14:57 |
[DKFZ-2026-00392]
Journal Article
Liu, Y. ; Carrero, Z. I. ; Jiang, X. ; et al
Benchmarking large language model-based agent systems for clinical decision tasks.
Agentic artificial intelligence (AI) systems, designed to autonomously reason, plan, and invoke tools, have shown promise in healthcare, yet systematic benchmarking of their real-world performance remains limited. In this study, we evaluate two such systems: the open-source OpenManus, built on Meta's Llama-4 and extended with medically customized agents; and Manus, a proprietary agent system employing a multistep planner-executor-verifier architecture. [...]
Detailed record - Similar records
|
|
|