FACTS Benchmark Suite: Systematically evaluating the factuality of large language models
Systematically evaluating the factuality of large language models with the FACTS Benchmark Suite.
Like
0
Liked
Liked
Systematically evaluating the factuality of large language models with the FACTS Benchmark Suite.