Ontology highlight
ABSTRACT:
SUBMITTER: Hou W
PROVIDER: S-EPMC10054955 | biostudies-literature | 2023 Mar
REPOSITORIES: biostudies-literature

Shang Xinyi X Liao Xu X Ji Zhicheng Z Hou Wenpin W
bioRxiv : the preprint server for biology 20250912
Large language models (LLMs) show promise in biomedical research, but their effectiveness for genomic inquiry remains unclear. We developed GeneTuring, a benchmark consisting of 16 genomics tasks with 1,600 curated questions, and manually evaluated 48,000 answers from ten LLM configurations, including GPT-4o (via API, ChatGPT with web access, and a custom GPT setup), GPT-3.5, Claude 3.5, Gemini Advanced, GeneGPT (both slim and full), BioGPT, and BioMedLM. A custom GPT-4o configuration integrated ...[more]