
This is an umbrella project to collect the experimental acitivites going on in the institute around using large language models to help particle physicists work. This includes coding, attending conferences, reading and comprehending papers, et.c
hep-data-llm
a plot agent that experiments with taking theadl-benchmark-index
questions and hint files so that a LLM will generate and run the code. Complete with fairly complete evaluation metrics and ~20 open source and commercial models tested. Preceded by theatlas-plot-agent
project.- cmspiolot a plot agent that uses RAG techniques to target very small LLM’s. Fellow Project
- azure-light-rag RAG designed to work with very large corpuses of text (e.g. all the European Union stratigic update, or Snowmass documents). Uses RAG techniques plus entity extraction. Shows all the various problems that traditional RAG shows when working with very large amounts of data. Designed to run in the cloud, and be invoked as a tool from OpenAI’s chat-gpt tool.
- abstract-rankder - given a list of the users preferences will rank abstracts submitted to a conference and generate a spreadsheet can be used to navigate a large confernece like ICHEP or CHEP.
Team
- Gordon Watts
- David Lange
- Peter Elmer
- rrutaa