2nd Workshop: Perspectives on the Evaluation of Recommender Systems
Workshop at ACM Recommender Systems 2022
Evaluation is essential when conducting rigorous research in recommender systems (RS). It may span the evaluation of early ideas and approaches up to elaborate systems in operation; it may target a wide spectrum of different aspects being evaluated. Naturally, we do (and have to) take various perspectives on the evaluation of RS. Thereby, the term “perspective” may, for instance, refer to various purposes of a RS, the various stakeholders affected by a RS, or the potential risks that ought to be minimized. Further, we have to consider that various methodological approaches and experimental designs represent different perspectives on evaluation. The perspective on the evaluation of RS may also be substantially characterized by the available resources. The access to resources will likely be different for PhD students compared to established researchers in industry.
Acknowledging that there are various perspectives on the evaluation of RS, we want to put into discussion whether there is a “golden standard” for the evaluation of RS, and—if so—if it indeed is “golden” in any sense. We postulate that the various perspectives are valid and reasonable, and aim to reach out to the community to discuss and reason about.
The goal of the workshop is to capture the current state of evaluation, and gauge whether there is, or should be, a different target that RS evaluation should strive for. The workshop will address the question: where should we go from here as a community? and aims at coming up with concrete steps for action.
We have a particularly strong commitment to invite and integrate researchers at the beginning of their careers and want to equally integrate established researchers and practitioners, from industry and academia alike. It is our particular concern to give a voice to the various perspectives involved.
Topics of interest include, but are not limited to, the following:
We deliberately solicit papers reporting problems and (negative) experiences regarding RS evaluation, as we consider the reflection on unsuccessful, inadequate or insufficient evaluations as a fruitful source for yet another perspective on RS evaluation that can spark discussions at the workshop. This also includes papers reporting negative study results. Accordingly, submissions may also address the following themes:
(a) “lessons learned” from the successful application of RS evaluation or from “post mortem” analyses describing specific evaluation strategies that failed to uncover decisive elements,
(b) “overview papers” analyzing patterns of challenges or obstacles to evaluation,
(c) “solution papers” presenting solutions for specific evaluation scenarios, and
(d) “visionary papers” discussing novel and future evaluation aspects will be considered as well.
We solicit two forms of contributions. First, we solicit paper submissions that will undergo peer review. Accepted papers will be published and presented at the workshop. Second, we offer the opportunity to present ideas without a paper submission. In this case, we call for the submission of abstracts that will be reviewed by the workshop organizers. Accepted abstracts will be presented at the workshop, but not published.
We solicit papers with 4 up to 12 pages (excluding references). Along the lines of this year’s call for papers of the main conference, we do not distinguish between full and short (or position) papers. Papers should be formatted in CEURART’s single column template: https://ceurws.wordpress.com/2020/03/31/ceurws-publishes-ceurart-paper-style/.
Submitted papers must not be under review in any other conference, workshop, or journal at the time of submission. Papers should be submitted through the workshop’s EasyChair page at https://easychair.org/conferences/?conf=perspectives2022.
Submissions will undergo single-blind peer review by a minimum of three program committee members and will be selected based on quality, novelty, clarity, and relevance. Authors of accepted papers will be invited to present their work during the workshop and will be published as open access workshop proceedings via ceur-ws.org. At least one author of each accepted paper must attend the workshop and present the work.
We solicit abstracts with 200-350 words, to be submitted through the workshop’s EasyChair page at https://easychair.org/conferences/?conf=perspectives2022.
The workshop organizers will select abstracts based on quality, clarity, relevance, and their potential to spark interesting discussion during the workshop. Authors of accepted abstracts will be invited to present their work during the workshop.
Please watch the videos of the accepted papers before the workshop takes place. There will be no time at the workshop to view videos, instead we will focus on discussion and active participation.
09:00-09:20 Welcome & Introduction
09:20-10:10 Keynote & Q&A: Kim Falk
10:10-10:30 Topic pitch (5 min pitch, 5 min questions)
10:10-10:20 Topic 1 - What aspects of recommendation should be tested and evaluated Video
10:20-10:30 Topic 2 - Challenges, Approaches and Evaluation Metrics
10:30-11:00 Break
11:00-12:15 Group discussions (topic 1 and topic 2) on site (breakout rooms on Zoom)
12:10-12:30 Wrap up
12:30-14:00 Lunch Break
14:00-16:30 Open round-table discussion
16:30-17:00 Wrap up
Times are in Pacific Daylight Time (PDT) (Seattle, WA, local time).
Kim Falk
Bio
Kim Falk is a Staff Recommender Engineer at Shopify, where he is the technical lead of the Product Recommendations team. Kim has experience in machine learning but is specialized in Recommender systems. Has previously worked on recommenders for customers like BT and RTL+. Worked on personalizing at IKEA. He is author of Practical Recommender Systems.
Recording of the Keynote
All Teaser Videos on a single page.
Extended abstract about the workshop as part of the RecSys proceedings.
Proceedings (CEUR-WS.org).
Learning Choice Models for Simulating Users' Interactions with Recommender Systems
Naieme Hazrati, Francesco Ricci
Video
Towards Comparing Recommendation to Multiple-Query Search Sessions for Talent Search
Mesut Kaya, Toine Bogers
Video
Recommender Systems Are Not Everything: Towards a Broader Perspective in Recommender Evaluation
Benedikt Loepp
Video
Diversifying Sentiments in News Recommendation
Mete Sertkan, Sophia Althammer, Sebastian Hofstätter, Julia Neidhardt
Video
Are We Forgetting Something? Correctly Evaluate a Recommender System With an Optimal Training Window
Robin Verachtert, Lien Michiels, Bart Goethals
Video
CaMeLS: Cooperative Meta-Learning Service for Recommender Systems
Lukas Wegmeth, Joeran Beel
Video
Should Algorithm Evaluation Extend to Testing? We Think So
Lien Michiels, Robin Verachtert, Kim Falk, Bart Goethals
Video
Multi-domain news recommender systems at Globo: Challenges, Approaches and Evaluation Metrics
Joel Pinho Lucas, Letícia Freire de Figueiredo, Felipe Alves Ferreira
Video