researchr
explore
Tags
Journals
Conferences
Authors
Profiles
Groups
calendar
New Conferences
Events
Deadlines
search
search
You are not signed in
Sign in
Sign up
Links
Filter by Year
OR
AND
NOT
1
2024
Filter by Tag
Filter by Author
[+]
OR
AND
NOT
1
Alessio M. Pacces
Arthur Câmara
Bhashithe Abeysinghe
Bhaskar Mitra 0001
Chaneon Park
Charles L. A. Clarke
Clemencia Siro
Emine Yilmaz
Evangelos Kanoulas
Gabriel de Jesus
Guglielmo Faggioli
Honggu Lee
Hongyi Zhu
Hossein A. Rahmani
Hyunwoo Kim
Jakub Zavrel
Jheng-Hong Yang
Jia-Hong Huang
Jimmy Lin
Jin Young Kim
Filter by Top terms
[+]
OR
AND
NOT
1
10th
18
2024
analysis
answerability
applications
approaches
automated
evaluating
evaluation
framework
judgments
language
large
llm
llms
models
relevance
retrieval
using
LLM4Eval@SIGIR (llm4eval)
Editions
Publications
Viewing Publication 1 - 9 from 9
2024
The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches
Bhashithe Abeysinghe
,
Ruhan Circi
.
llm4eval 2024
:
4-18
[doi]
Exploring Large Language Models for Relevance Judgments in Tetun
Gabriel de Jesus
,
Sérgio Sobral Nunes
.
llm4eval 2024
:
19-30
[doi]
EXAM++: LLM-based Answerability Metrics for IR Evaluation
Naghmeh Farzi
,
Laura Dietz
.
llm4eval 2024
:
31-50
[doi]
A Novel Evaluation Framework for Image2Text Generation
Jia-Hong Huang
,
Hongyi Zhu
,
Yixian Shen
,
Stevan Rudinac
,
Alessio M. Pacces
,
Evangelos Kanoulas
.
llm4eval 2024
:
51-65
[doi]
Using LLMs to Investigate Correlations of Conversational Follow-up Queries with User Satisfaction
Hyunwoo Kim
,
Yoonseo Choi
,
Taehyun Yang
,
Honggu Lee
,
Chaneon Park
,
Yongju Lee
,
Jin Young Kim
,
Juho Kim
.
llm4eval 2024
:
66-91
[doi]
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework
Zackary Rackauckas
,
Arthur Câmara
,
Jakub Zavrel
.
llm4eval 2024
:
92-112
[doi]
LLMJudge: LLMs for Relevance Judgments
Hossein A. Rahmani
,
Emine Yilmaz
,
Nick Craswell
,
Bhaskar Mitra 0001
,
Paul Thomas 0001
,
Charles L. A. Clarke
,
Mohammad Aliannejadi
,
Clemencia Siro
,
Guglielmo Faggioli
.
llm4eval 2024
:
1-3
[doi]
Proceedings of The First Workshop on Large Language Models for Evaluation in Information Retrieval (LLM4Eval 2024) co-located with 10th International Conference on Online Publishing (SIGIR 2024), Washington D.C., USA, July 18, 2024
Clemencia Siro
,
Mohammad Aliannejadi
,
Hossein A. Rahmani
,
Nick Craswell
,
Charles L. A. Clarke
,
Guglielmo Faggioli
,
Bhaskar Mitra 0001
,
Paul Thomas 0001
,
Emine Yilmaz
, editors,
Volume 3752 of
CEUR Workshop Proceedings
, CEUR-WS.org,
2024.
[doi]
Toward Automatic Relevance Judgment using Vision-Language Models for Image-Text Retrieval Evaluation
Jheng-Hong Yang
,
Jimmy Lin
.
llm4eval 2024
:
113-123
[doi]
Sign in
or
sign up
to see more results.