Skip to content

Proceedings

UMAP 2024 online proceedings can now be viewed from the following links:

TOC Main Proceedings

The ACM OpenTOC service enables visitors to download the articles below at no charge.

SESSION: Personalization for Persuasive and Behavior Change Systems

Avoiding Decision Fatigue with AI-Assisted Decision-Making

  • Jessica Maria Echterhoff
  • Aditya Melkote
  • Sujen Kancherla
  • Julian McAuley

During online browsing, e.g. when looking to select a movie to watch, we are often confronted with multiple rejection-selection steps which can lead to tens or hundreds of decisions made in quick succession. It is unclear if showing the next “best” item, as often employed by standard recommenders, is the most efficient way to help users select an item. In this work, we show that we can reduce the number of decisions to selection with a reinforcement learning-based Decision Minimizer Network (DMN). By implementing a step-aware reward function we can penalize long sequences, leading to fewer decisions having to be made by humans. Using a task to select a movie to watch, we show that we can reduce the number of decisions to selection by 39% compared to heuristic strategies and by 20% compared to standard recommender while increasing user selection satisfaction. Minimizing the number of decision steps can finally help to reduce decision fatigue, which refers to the deteriorating quality of decisions made by an individual after a long session of decision steps, and help to prevent infinite scrolling.

Designing Effective Warnings for Manipulative Designs in Mobile Applications

  • Elaheh Jafari
  • Julita Vassileva

There is a notable rise in websites and mobile apps that use manipulative (also known as “deceptive”) designs or “dark patterns”. Leveraging visual perception effects and cognitive biases or object manipulations, these designs influence user behavior in ways that may not be beneficial or can even be harmful for users. It is important to both warn and educate users about manipulative designs. While numerous studies have investigated warning designs across various domains, little attention has been given to exploring how to warn users about the presence of manipulative designs in applications. We conducted a user study with a three-level warning about the presence of manipulative designs on a simulated app page on the Google Play Store and explored the impact of different warning levels on user attention and decision-making. We also explored possibilities for personalization of warning levels based on the user’s personality (Big 5) characteristics. While our findings did not discover opportunities for personalization, they underscore the benefit of a multi-level warning design, and the pivotal role of visual elements in capturing attention, complemented by the contribution of textual explanations and more details on demand. We discuss the factors influencing users to install an app despite being informed about the presence of manipulative designs and demonstrate how app distribution platforms can embed warnings in the app information to prevent or mitigate the harms of manipulative designs.

SESSION: Fairness, Transparency, Accountability, and Privacy

Bringing Equity to Coarse and Fine-Grained Provider Groups in Recommender Systems

  • Elizabeth Gómez
  • David Contreras
  • Maria Salamo
  • Ludovico Boratto

Provider fairness aims at regulating the recommendation lists, so that the items of different providers/provider groups are suggested by respecting notions of equity. When group fairness is among the goals of a system, a common way is to use coarse groups since the number of considered provider groups is usually small (e.g., two genders, or three/four age groups) and the number of items per group is large. From a practical point of view, having few groups makes it easier for a platform to manage the distribution of equity among them. Nevertheless, there are sensitive attributes, such as the age or the geographic provenance of the providers that can be characterized at a fine granularity (e.g., one might group providers at the country level, instead of the continent one), which increases the number of groups and decrements the number of items per group. In this study, we show that, in large demographic groups, when considering coarse-grained provider groups, the fine-grained provider groups are under-recommended by the state-of-the-art models. To overcome this issue, in this paper, we present an approach that brings equity to both coarse and fine-grained provider groups. Experiments on two real-world datasets show the effectiveness of our approach.

Disentangling Web Search on Debated Topics: A User-Centered Exploration

  • Alisa Rieger
  • Suleiman Kulane
  • Ujwal Gadiraju
  • Maria Soledad Pera

When using web search engines to conduct inquiries on debated topics, searchers’ interactions with search results are commonly affected by a combination of searcher and system biases. While prior work has mainly investigated these biases in isolation, there is a lack of a comprehensive understanding of web search on debated topics. Addressing this gap, we conducted an exploratory user study (N = 255), aimed at advancing the understanding of the intricate searcher-system interplay. Particularly, we investigated the relations between (i) search system exposure, searchers’ attitude strength, prior knowledge, and receptiveness to opposing views, (ii) search interactions, and (iii) post-search epistemic states. We observed that search interaction was shaped by search system exposure, attitude strength, and prior knowledge, and that attitude change was influenced by the level of confirmation bias and initial attitude strength, but not search system exposure. Insights from this work suggest the need to adapt interventions that mitigate the risks of searcher and system bias to searchers’ nuanced pre-search epistemic states. They further emphasize the threat of customizing the search ranking to enhance user satisfaction in the context of debated topics to responsible opinion formation.

Explaining the Unexplainable: The Impact of Misleading Explanations on Trust in Unreliable Predictions for Hardly Assessable Tasks

  • Mersedeh Sadeghi
  • Daniel Pöttgen
  • Patrick Ebel
  • Andreas Vogelsang

To increase trust in systems, engineers strive to create explanations that are as accurate as possible. However, if the system’s accuracy is compromised, providing explanations for its incorrect behavior may inadvertently lead to misleading explanations. This concern is particularly pertinent when the correctness of the system is difficult for users to judge. In an online survey experiment with 162 participants, we analyze the impact of misleading explanations on users’ perceived and demonstrated trust in a system that performs a hardly assessable task in an unreliable manner. Participants who used a system that provided potentially misleading explanations rated their trust significantly higher than participants who saw the system’s prediction alone. They also aligned their initial prediction with the system’s prediction significantly more often. Our findings underscore the importance of exercising caution when generating explanations, especially in tasks that are inherently difficult to evaluate. The paper and supplementary materials are available at https://doi.org/10.17605/osf.io/azu72

EXtrA-ShaRC: Explainable and Scrutable Reading Comprehension for Conversational Systems

  • Jerome Ramos
  • Aldo Lipani

Conversational Machine Reading (CMR) systems answer high-level user questions by interpreting contextual information, asking clarification questions, and generating human-like responses. While effective, such systems often use knowledge about the task and the user in a non-transparent and non-scrutable way. For example, if a user wants to ask questions like “Why are you asking this?” or “Why is this the correct answer?”, the system should be able to highlight and return the relevant information that led to the decision in an interpretable manner. Similarly, if a user scrutinizes and edits their user profile, the final output of the model should change accordingly. To test the transparency and scrutability of conversational machine reading systems, we formalize two new tasks by extending the ShARC dataset to create the EXtrA-ShARC dataset. For transparency, we propose a baseline model that can simultaneously extract explanations and answer the user’s question. We will also publicly release counterfactual user profiles to test scrutability for all CMR models. Our dataset opens up a range of research directions for using natural language explanations and counterfactual profiles in conversational systems, both for evaluating the model and increasing transparency for end users.

Optimizing Neighborhoods for Fair Top-N Recommendation

  • Stavroula Eleftherakis
  • Georgia Koutrika
  • Sihem Amer-Yahia

We address demographic bias in neighborhood-learning models for collaborative filtering recommendations. Despite their superior ranking performance, these methods can learn neighborhoods that inadvertently foster discriminatory patterns. Little work exists in this area, highlighting an important research gap. A notable yet solitary effort, Balanced Neighborhood Sparse LInear Method (BNSLIM) aims at balancing neighborhood influence across different demographic groups. Yet, BNSLIM is hampered by computational inefficiency, and its rigid balancing approach often impacts accuracy. In that vein, we introduce two novel algorithms. The first, an enhancement of BNSLIM, incorporates the Alternating Direction Method of Multipliers (ADMM) to optimize all similarities concurrently, greatly reducing training time. The second, Fairly Sparse Linear Regression (FSLR), induces controlled sparsity in neighborhoods to reveal correlations among different demographic groups, achieving comparable efficiency while being more accurate. Their performance is evaluated using standard exposure metrics alongside a new metric for user coverage disparities. Our experiments cover various applications, including a novel exploration of bias in course recommendations by teachers’ country development status. Our results show the effectiveness of our algorithms in imposing fairness compared to BNSLIM and other well-known fairness approaches.

Rewriting Bias: Mitigating Media Bias in News Recommender Systems through Automated Rewriting

  • Qin Ruan
  • Jin Xu
  • Susan Leavy
  • Brian Mac Namee
  • Ruihai Dong

Personalised news recommender systems are effective in disseminating news content based on users’ reading histories but can also amplify and proliferate biased media. This work examines the potential of automated sentence rewriting methods, utilising word replacement methods and large language models (LLMs), to mitigate this side effect of recommender systems. We present a two-step workflow: the application of automated sentence rewriting methods to rewrite biased sentences, and the integration of these rewritten sentences into the recommendation process. We evaluate the effectiveness of sentence rewriting approaches in a simulation framework, to assess how well they mitigate the spread of biased news. Our study demonstrates that applying sentence rewriting to users’ reading histories can result in a significant reduction in the propagation of biased media. Our contributions are threefold: we pioneer the use of LLMs for mitigating the spread of biased news by recommender systems; we demonstrate that algorithms trained on debiased content maintain or improve recommendation accuracy; and we provide a comprehensive exploration of the effectiveness of applying sentence rewriting methods to various components within a recommender system, as well as an investigation of the underlying reasons for their efficacy. This work advances our understanding of media bias mitigation in news content and recommendation algorithms, providing valuable insights into how news recommender systems can prevent the dissemination of biased information.

User Perception of Fairness-Calibrated Recommendations

  • Gabrielle Alves
  • Dietmar Jannach
  • Rodrigo Ferrari De Souza
  • Marcelo Garcia Manzato

The research community has become increasingly aware of possible undesired effects of algorithmic biases in recommender systems. One common bias in such systems is to over-proportionally expose certain items to users, which may ultimately result in a system that is considered unfair to individual stakeholders. From a technical perspective, calibration approaches are commonly adopted in such situations to ensure that the individual user’s preferences are better taken into account, thereby also leading to a more balanced exposure of items overall. Given the known limitations of today’s predominant offline evaluation approaches, our work aims to contribute to a better understanding of the users’ perception of the fairness and quality of recommendations when these are served in a calibrated way. Therefore, we conducted an online user study (N=500) in which we exposed the treatment groups with recommendations calibrated for fairness in terms of two different item characteristics. Our results show that calibration can indeed be effective in guiding the users’ choices towards the “fairness items” without negatively impacting the overall quality perception of the system. We however also found that calibration did not measurably impact the users’ fairness perceptions unless explanatory information is provided by the system. Finally, our study points to challenges when applying calibration approaches in practice in terms of finding appropriate parameters.

When in Doubt! Understanding the Role of Task Characteristics on Peer Decision-Making with AI Assistance

  • Sara Salimzadeh
  • Ujwal Gadiraju

With the integration of AI systems into our daily lives, human-AI collaboration has become increasingly prevalent. Prior work in this realm has primarily explored the effectiveness and performance of individual human and AI systems in collaborative tasks. While much of decision-making occurs within human peers and groups in the real world, there is a limited understanding of how they collaborate with AI systems. One of the key predictors of human-AI collaboration is the characteristics of the task at hand. Understanding the influence of task characteristics on human-AI collaboration is crucial for enhancing team performance and developing effective strategies for collaboration. Addressing a research and empirical gap, we seek to explore how the features of a task impact decision-making within human-AI group settings. In a 2 × 2 between-subjects study (N = 256) we examine the effects of task complexity and uncertainty on group performance and behaviour. The participants were grouped into pairs and assigned to one of four experimental conditions characterized by varying degrees of complexity and uncertainty. We found that high task complexity and high task uncertainty can negatively impact the performance of human-AI groups, leading to decreased group accuracy and increased disagreement with the AI system. We found that higher task complexity led to a higher efficiency in decision-making, while a higher task uncertainty had a negative impact on efficiency. Our findings highlight the importance of considering task characteristics when designing human-AI collaborative systems, as well as the future design of empirical studies exploring human-AI collaboration.

SESSION: Personalizing Learning Experiences through User Modeling

Challenge variance: Exploiting format differences for personalized learner models

  • Charles Lang
  • Korrin Ostrow

In this study, we present an approach to utilizing variance in students’ performance across different formats (multiple-choice, numeric input, word problems) as a target for personalization. We have developed a measure called challenge variance, that indicates the degree to which different formats pose varying levels of challenge for individual learners. We investigated whether challenge variance could be a useful source of information for developing learner models by analyzing data from an online math tutoring platform. Results demonstrated that challenge variance has a relationship with an external activity, indicating its utility as a means of predicting how well a learner will perform in a new setting. We discuss the affordances and issues with the measure and whether or not it could be a useful additional tool in developing personalized learner models as an intuitive and platform-agnostic measure of performance.

Discerning Individual Preferences for Identifying and Flagging Misinformation on Social Media

  • Dipto Barman
  • Kevin Koidl
  • Owen Conlan

As social media grapples with the proliferation of misinformation, flagging systems emerge as vital digital tools that alert users to potential falsehoods, balancing the preservation of free speech. The efficacy of these systems hinges on user interpretation and reaction to the flags provided. This study probes the influence of warning flags on user perceptions, assessing their effect on the perceived accuracy of information, the propensity to share content, and the trust users have in these warnings, especially when supplemented with fact-checking explanations. Through a within-subject experiment involving 348 American participants, we mimicked a social media feed with a series of COVID-19-related headlines, both true and false, in various conditions—with flags, with flags and explanatory text, and without any intervention. Explanatory content was derived from fact-checking sites linked to the news items. Our findings indicate that false news is perceived as less accurate when flagged or accompanied by explanatory text. The presence of explanatory text correlates with heightened trust in the flags. Notably, participants with high levels of neuroticism and a deliberative cognitive thinking style showed a higher trust for explanatory text alongside warning flags. Conversely, participants with conservative leanings exhibited distrust towards social media flagging systems. These results underscore the importance of clear explanations within flagging mechanisms and support a user-centric model in their design, emphasising transparency and engagement as essential in counteracting misinformation on social media.

Not a Team but Learning as One: The Impact of Consistent Attendance on Discourse Diversification in Math Group Modeling

  • Mark Abdelshiheed
  • Jennifer Jacobs
  • Sidney D’Mello

This work investigates relationships between consistent attendance —attendance rates in a group that maintains the same tutor and students across the school year— and learning in small group tutoring sessions. We analyzed data from two large urban districts consisting of 206 9th-grade student groups (3 − 6 students per group) for a total of 803 students and 75 tutors. The students attended small group tutorials approximately every other day during the school year and completed a pre and post-assessment of math skills at the start and end of the year, respectively. First, we found that the attendance rates of the group predicted individual assessment scores better than the individual attendance rates of students comprising that group. Second, we found that groups with high consistent attendance had more frequent and diverse tutor and student talk centering around rich mathematical discussions. Whereas we emphasize that changing tutors or groups might be necessary, our findings suggest that consistently attending tutorial sessions as a group with the same tutor might lead the group to implicitly learn as a team despite not being one.

Will a Skills Passport ever get me through the lifelong learning border?: Two critical challenges facing personalised user models for lifelong learning

  • Kirsty Kitto

Lifelong personalised learning is often described as the holy grail of the educational data sciences, but work on the topic is sporadic and we are yet to achieve this goal in a meaningful form. In the wake of the skills shortages arising from national responses to COVID-19 this problem has again become a topic of interest. A number of proposals have emerged that some sort of a skills passport would help individuals, educational institutions, and employers to identify training and recruitment needs according to identified skills gaps. And yet, we are a long way from achieving a skills passport that could support lifelong learning despite more than 25 years of work on the topic. This paper draws attention to two of the critical socio-technical challenges facing skills passports, and lifelong learner models in general. This leads to a proposal for how we might move towards a useful skills passport that can cross the “skills sector border”.

SESSION: Personalized Recommender Systems

Beyond Trade-offs: Unveiling Fairness-Constrained Diversity in News Recommender Systems

  • Célina Treuillier
  • Sylvain Castagnos
  • Özlem Özgöbek
  • Armelle Brun

Recommender Systems have played an important role in our daily lives for many years. However, it is only recently that their social impact has raised ethical issues and has thus been considered in the design of such systems. Particularly, News Recommender Systems (NRS) have a critical influence on individuals. NRS can provide overspecialized recommendations and enclose users into filter bubbles. Besides, NRS can influence users and make their original opinions diverge. Worse, they can orient users’ opinions towards more radical views. The literature has worked on these issues by leveraging diversity and fairness in the recommendation algorithms, but generally only one of these dimensions at a time. We propose to consider both diversity and fairness simultaneously to provide recommendations that are fair, diverse, and obviously accurate. To this end, we propose a novel recommendation framework, Accuracy-Diversity-Fairness (ADF), which considers that fairness is not at the expense of diversity. Concretely, fairness is approached as a constraint on diversity. Experiments highlight that constraining diversity by fairness remarkably contributes to providing recommendations 5 times more diverse than models of the literature, without any loss in accuracy.

Creating an Intelligent Social Media Campaign Decision-Support Method

  • Amir Gabay
  • Adir Solomon
  • Ido Guy
  • Bracha Shapira

Predicting the success of marketing campaigns on social media can help improve campaign managers’ decision-making (e.g., deciding to stop a marketing campaign) and thus increase their profits. Most research in the field of online marketing has focused on analyzing users’ behavior rather than improving campaign manager decision-making. Furthermore, determining the success of marketing campaigns is quite challenging due to the large number of possible metrics that must be analyzed daily. In this study, we suggest a method that incorporates machine learning models with traditional business rules to provide daily decision recommendations, based on the various metrics and considerations, and aimed at achieving the campaign’s goals. We evaluate our approach on a unique dataset collected from the most popular social networks, Facebook and Instagram. Our evaluation demonstrates the proposed method’s ability to outperform an expert-based method and the machine learning baselines examined, and dramatically increase the campaign managers’ profits.

Emotion-Based Music Recommendation from Quality Annotations and Large-Scale User-Generated Tags

  • Marta Moscati
  • Hannah Strauß
  • Peer-Ole Jacobsen
  • Andreas Peintner
  • Eva Zangerle
  • Marcel Zentner
  • Markus Schedl

Emotions constitute an important aspect when listening to music. While manual annotations from user studies grounded in psychological research on music and emotions provide a well-defined and fine-grained description of the emotions evoked when listening to a music track, user-generated tags provide an alternative view stemming from large-scale data. In this work, we examine the relationship between these two emotional characterizations of music and analyze their impact on the performance of emotion-based music recommender systems individually and jointly. Our analysis shows that (i) the agreement between the two characterizations, as measured with Cohen’s κ coefficient and Kendall rank correlation, is often low, (ii) Leveraging the emotion profile based on the intensity of evoked emotions from high-quality annotations leads to performances that are stable across different recommendation algorithms; (iii) Simultaneously leveraging the emotion profiles based on high-quality and large-scale annotations allows to provide recommendations that are less exposed to the low accuracy that algorithms might reach when leveraging one type of data, only.

Evaluating Content-based Pre-Training Strategies for a Knowledge-aware Recommender System based on Graph Neural Networks

  • Giuseppe Spillo
  • Francesco Bottalico
  • Cataldo Musto
  • Marco De Gemmis
  • Pasquale Lops
  • Giovanni Semeraro

In this paper, we introduce a Knowledge-aware Recommender System (KARS) based on Graph Neural Networks that exploit pre-trained content-based embeddings to improve the representation of users and items. Our approach relies on the intuition that textual features can describe the items in the catalog from a different point of view, so they are worth to be exploited to provide users with more accurate recommendations. Accordingly, we used encoding techniques to learn a pre-trained representation of the items in the catalogue based on textual content, and we used these embeddings to feed the input layer of a KARS based on GCNs. In this way, the GCN is able to encode both the knowledge coming from the unstructured content and the structured knowledge provided by the KG (ratings and item descriptive properties). As shown in our experiments, the exploitation of pre-trained embeddings improves the predictive accuracy of the KARS, which overcomes all the baselines we considered in several experimental settings.

Improving Transformer-based Sequential Conversational Recommendations through Knowledge Graph Embeddings

  • Alessandro Petruzzelli
  • Alessandro Francesco Maria Martina
  • Giuseppe Spillo
  • Cataldo Musto
  • Marco De Gemmis
  • Pasquale Lops
  • Giovanni Semeraro

Conversational Recommender Systems (CRS) have recently drawn attention due to their capacity of delivering personalized recommendations through multi-turn natural language interactions. In this paper, we fit into this research line and we introduce a Knowledge-Aware Sequential Conversational Recommender System (KASCRS) that exploits transformers and knowledge graph embeddings to provide users with recommendations in a conversational setting.

In particular, KASCRS is able to predict a suitable recommendation based on the elements that are mentioned in a conversation between a user and a CRS. To do this, we design a model that: (i) encodes each conversation as a sequence of entities that are mentioned in the dialogue (i.e., items and properties), and (ii) is trained on a cloze task, that is to say, it learns to predict the final element in the sequence – that corresponds to the item to be recommended – based on the information it has previously seen.

The model has two main hallmarks: first, we exploit Transformers and self-attention to capture the sequential dependencies that exist among the entities that are mentioned in the training dialogues, in a way similar to session-based recommender systems [25]. Next, we used knowledge graphs (KG) to improve the quality of the representation of the elements mentioned in each sequence. Indeed, we exploit knowledge graph embeddings techniques to pre-train the representation of items and properties, and we fed the input layer of our architecture with the resulting embeddings. In this way, KASCRS integrates both knowledge from the KGs as well as the dependencies and the co-occurrences emerging from conversational data, resulting in a more accurate representation of users and items. Our experiments confirmed this intuition, since KASCRS overcame several state-of-the-art baselines on two different datasets.

Integrating sentiment features in factorization machines: Experiments on music recommender systems

  • Javier Wang
  • Alejandro Bellogin
  • Iván Cantador

Music recommender systems play a pivotal role in catering to diverse user preferences and fostering personalized listening experiences. At the same time, sentiments can profoundly influence music by shaping its emotional expression and evoking specific moods onto listeners. Expressed in textual content, these sentiments may be analyzed through natural language processing techniques to gauge emotions or opinions, hopefully increasing their relevance when exploited for recommendation. This work aims to investigate how to better integrate such information and understand its potential impact on personalized music suggestions, attempting to enhance recommendation models by incorporating sentiment features into factorization machines. For this purpose, a dataset was collected from Last.fm and enhanced with sentiment information extracted from Wikipedia. Empirical results evidence that not all sentiment-related features are equally useful, showing that each tested factorization machine approach varies in sensitivity to these features. Source code and data are available at https://github.com/abellogin/SentiFMRecSys.

Modeling user personality traits from aesthetic preference on multiple images

  • Marta Micheli
  • Alberto Valese

In recent years, people have been spending more and more time on social media. Within the realm of multimedia contents used by platforms, the quantity of visuals is certainly growing in significance. Interaction data enables to know the users’ favourite images. This information could be exploited to gain a deeper insight into their psychological profile, since the literature on automatic personality recognition suggests that personality traits may correlate with aesthetics. In this paper we explore the use of personal preference on multiple images to predict personality traits of users. Unlike previous works, we propose a model that exploits ResNet50, a Convolutional Neural Network, to automatically extract features from the images in the PsychoFlickr dataset. We then fit five independent linear regressors on these features to detect personality. In order to determine whether using more than one image leads to better results, we train the model multiple times, using one to five images as input, and we compare the performances. Our method seems to outperform the related state-of-the-art works.

Negative Feedback for Music Personalization

  • M. Jeffrey Mei
  • Oliver Bembom
  • Andreas F. Ehmann

Next-item recommender systems are often trained using only positive feedback with randomly-sampled negative feedback. We show the benefits of using real negative feedback both as inputs into the user sequence and also as negative targets for training a next-song recommender system for internet radio. In particular, using explicit negative samples during training helps reduce training time by ∼ 60% while also improving test accuracy by 6%; adding user skips as additional inputs also can considerably increase user coverage alongside improving accuracy. We test the impact of using a large number of random negative samples to capture a ‘harder’ one and find that the test accuracy increases with more randomly-sampled negatives, but only to a point. Too many random negatives leads to false negatives that limits the lift, which is still lower than if using true negative feedback. We also find that the test accuracy is fairly robust with respect to the proportion of different feedback types, and compare the learned embeddings for different feedback types.

Shaping the Future of Content-based News Recommenders: Insights from Evaluating Feature-Specific Similarity Metrics

  • Daniel Rosnes
  • Alain D. Starke
  • Christoph Trattner

In news media, recommender system technology faces several domain-specific challenges. The continuous stream of new content and users deems content-based recommendation strategies, based on similar-item retrieval, to remain popular. However, a persistent challenge is to select relevant features and corresponding similarity functions, and whether this depends on the specific context. We evaluated feature-specific similarity metrics using human similarity judgments across national and local news domains. We performed an online experiment (N = 141) where we asked participants to judge the similarity between pairs of randomly sampled news articles. We had three contributions: (1) comparing novel metrics based on large language models to ones traditionally used in news recommendations, (2) exploring differences in similarity judgments across national and local news domains, and (3) examining which content-based strategies were perceived as appropriate in the news domain. Our results showed that one of the novel large language model based metrics (SBERT) was highly correlated with human judgments, while there were only small, most non-significant differences across national and local news domains. Finally, we found that while it may be possible to automatically recommend similar news using feature-specific metrics, their representativeness and appropriateness varied. We explain how our findings can guide the design of future content-based and hybrid recommender strategies in the news domain.

User Perceptions of Diversity in Recommender Systems

  • Patrik Dokoupil
  • Ludovico Boratto
  • Ladislav Peska

In the context of recommender systems (RS), the concept of diversity is probably the most studied perspective beyond mere accuracy. Despite the extensive development of diversity measures and enhancement methods, the understanding of how users perceive diversity in recommendations remains limited. This gap hinders progress in multi-objective RS, as it challenges the alignment of algorithmic advancements with genuine user needs. Addressing this, our study delves into two key aspects of diversity perception in RS. We investigate user responses to recommendation lists generated using varied diversity metrics but identical diversification thresholds, and lists created with the same metrics but differing thresholds. Our findings reveal a user preference for metadata and content-based diversity metrics over collaborative ones. Interestingly, while users typically recognize more diversified lists as being more diverse in scenarios with significant diversification differences, this perception is not consistently linear and quickly diminishes when the diversification variance between lists is less pronounced. This study sheds light on the nuanced user perceptions of diversity in RS, providing valuable insights for the development of more user-centric recommendation algorithms. Study data and analysis scripts are available from https://osf.io/9y8gx/.

SESSION: Intelligent User Interfaces

FitSight: Tracking and Feedback Engine for Personalized Fitness Training

  • Hitesh Kotte
  • Florian Daiber
  • Milos Kravcik
  • Nghia Duong-Trung

Physical fitness presents a significant challenge in ensuring proper exercise posture. Individuals who work out need help maintaining correct exercise posture during their workouts. Maintaining correct form is critical for ensuring the safety and effectiveness of fitness routines. Yet, it is often challenging for individuals to keep proper form without professional guidance, which usually comes at expensive costs. The paper presents a novel method that utilizes the capabilities of YOLOv7 and a primary web camera to offer immediate feedback and correction on body posture during gym activities. Such a method empowers individuals to correct themselves and promotes motivation even without the presence of a professional trainer. This system has been developed to provide immediate, personalized feedback for various fitness exercises. It efficiently counts repetitions and provides textual guidance for improvement, tailored to the specific requirements of fitness enthusiasts. To determine the efficacy of our technology, we carried out a user study in a controlled laboratory setting simulating a gym environment. The study compares our interactive system with the traditional training method, involving participants of varied fitness levels. It showed significant improvements in exercise technique with real-time feedback. These findings are crucial for AI-supported training systems in strength training, underscoring the need for adaptive technologies for different user experiences. The research contributes to human-computer interaction and fitness technology discussions, highlighting interactive models’ potential to augment and sometimes replicate personal training benefits in exercise form and posture improvement.

Good GUIs, Bad GUIs: Affective Evaluation of Graphical User Interfaces

  • Syrine Haddad
  • Kayhan Latifzadeh
  • Saravanakumar Duraisamy
  • Jean Vanderdonckt
  • Olfa Daassi
  • Safya Belghith
  • Luis A. Leiva

Affective computing has potential to enrich the development lifecycle of Graphical User Interfaces (GUIs) and of intelligent user interfaces by incorporating emotion-aware responses. Yet, affect is seldom considered to determine whether a GUI design would be perceived as good or bad. We study how physiological signals can be used as an early, effective, and rapid affective assessment method for GUI design, without having to ask for explicit user feedback. We conducted a controlled experiment where 32 participants were exposed to 20 good GUI and 20 bad GUI designs while recording their eye activity through eye tracking, facial expressions through video recordings, and brain activity through electroencephalography (EEG). We observed noticeable differences in the collected data, so we trained and compared different computational models to tell good and bad designs apart. Taken together, our results suggest that each modality has its own “performance sweet spot” both in terms of model architecture and signal length. Taken together, our findings suggest that is possible to distinguish between good and bad designs using physiological signals. Ultimately, this research paves the way toward implicit evaluation methods of GUI designs through user modeling.

Initial results on personalizing explanations of AI hints in an ITS

  • Vedant Bahel
  • Harshinee Sriram
  • Cristina Conati

Previous research on an Intelligent Tutoring System (referred to as ACSP), showed the need to personalize explanations of its AI-driven hints for users with low Need for Cognition (N4C) and low Conscientiousness (Cons.). Specifically, this work found that explanations should be provided to these users with the objective of increasing user interaction with them. In this paper, we present and evaluate design alterations to the original ACSP explanation interface aimed at achieving this objective. Our results provide initial evidence that the implemented personalization, in the form of the design alterations, had a positive impact on users with low N4C and Cons., by increasing attention to explanations and contributing to learning gains.

Preference Elicitation in Interactive and User-centered Algorithmic Recourse: an Initial Exploration

  • Seyedehdelaram Esfahani
  • Giovanni De Toni
  • Bruno Lepri
  • Andrea Passerini
  • Katya Tentori
  • Massimo Zancanaro

Algorithmic Recourse aims to provide actionable explanations, or recourse plans, to overturn potentially unfavourable decisions taken by automated machine learning models. In this paper, we propose an interaction paradigm based on a guided interaction pattern aimed at both eliciting the users’ preferences and heading them toward effective recourse interventions. In a fictional task of money lending, we compare this approach with an exploratory interaction pattern based on a combination of alternative plans and the possibility of freely changing the configurations by the users themselves. Our results suggest that users may recognize that the guided interaction paradigm improves efficiency. However, they also feel less freedom to experiment with “what-if” scenarios. Nevertheless, the time spent on the purely exploratory interface tends to be perceived as a lack of efficiency, which reduces attractiveness, perspicuity, and dependability. Conversely, for the guided interface, more time on the interface seems to increase its attractiveness, perspicuity, and dependability while not impacting the perceived efficiency. That might suggest that this type of interfaces should combine these two approaches by trying to support exploratory behavior while gently pushing toward a guided effective solution.

Proponents as the Means to Increase the Uptake of Recommendations

  • Rikako Matsushima
  • Yoshinori Hijikata
  • Shlomo Berkovsky

While much research in recommender systems focused on improving the accuracy of recommendations, issues pertaining to their presentation have been under-explored. Considering the uptake of recommendations as one of their success indicators, we investigate the role of proponents in affecting user’s decision to accept a recommendation. We refer to proponent as a person or avatar, advocating in favor of the recommended item. This paper reports on a user study that evaluated the impact of including several types of proponents in the recommender interface and their impact on the uptake of recommendations. We observe that out of the studied proponents, real-world contacts have the strongest impact on the uptake of recommendations, which can inform the design recommender system interfaces.

Toward Tone-Aware Explanations in Recommender Systems

  • Ayano Okoso
  • Keisuke Otaki
  • Satoshi Koide
  • Yukino Baba

In recommender systems, the presentation of explanations plays a crucial role in supporting users’ decision-making processes. Although numerous existing studies have focused on the effects (e.g., transparency) of explanation content, explanation expression is largely overlooked. Tone, such as formal and humorous, is directly linked to expressiveness and is an important element in human communication. However, studies on the impact of tone on explanations within the context of recommender systems are insufficient. Therefore, this study investigates the tonal effects of explanations through an online user study. We focus on a hotel domain and six types of tones. The collected data analysis reveals that the tone of explanations influences the perceived effects, such as trust and effectiveness, of recommender systems. Our findings suggest that the tone of explanations can enhance user experience in recommender systems.

SESSION: Virtual Assistants, Conversational Interactions, and Personalized Human-Robot Interaction

ChatGPT as a Conversational Recommender System: A User-Centric Analysis

  • Ahtsham Manzoor
  • Samuel C. Ziegler
  • Klaus Maria. Pirker Garcia
  • Dietmar Jannach

With the rapid advances in deep learning, we have witnessed a strongly increased interest in conversational recommender systems (CRS). Until recently, however, even the latest generative models exhibited major limitations and they frequently return non-meaningful responses according to previous studies. However, with the latest Generative AI-based dialog systems implemented with Generative Pre-Trained Transformer (GPT) models, a new era has arrived for CRS research. In this work, we study the use of ChatGPT as a movie recommender system. To this purpose, we conducted an online user study involving N=190 participants, who were tasked to evaluate ChatGPT’s responses in a multitude of dialog situations. As a reference point for the analysis, we included a retrieval-based conversational method in the experiment, which was found to be a robust approach in previous research.

Our study results indicate that the responses by ChatGPT were perceived to be significantly better than those by the previous system in terms of their meaningfulness. A detailed inspection of the results showed that ChatGPT excelled when providing recommendations, but sometimes missed the context when asked questions about a movie within a longer dialog. A statistical analysis revealed that information adequacy and recommendation accuracy of the responses had the strongest influence on the perceived meaningfulness of the responses. Finally, an additional analysis showed that the human perceptions of meaningfulness correlated only very weakly with computational metrics such as BLEU or ROUGE, emphasizing the importance of involving humans in the evaluation of a CRS.

Does the Long Tail of Context Exist and Matter? The Case of Dialogue-based Recommender Systems

  • Konstantin Bauman
  • Alexey Vasilev
  • Alexander Tuzhilin

Context has been an important topic in recommender systems over the past two decades. Most of the prior CARS papers manually selected and considered only a few crucial contextual variables in an application, such as time, location, and company of a person. This prior work demonstrated significant recommendation performance improvements when various CARS-based methods have been deployed in numerous applications. In this paper, we study “context-rich” applications dealing with a large variety of different types of contexts. We demonstrate that supporting only a few of the most important contextual variables that could be manually identified, although useful, is not sufficient. In particular, we develop an approach to extract a large number of contextual variables for the dialogue-based recommender systems. In our study, we processed dialogues of bank managers with their clients and managed to identify over two hundred types of contextual variables forming the Long Tail of Context (LTC). We empirically demonstrate that LTC matters, and using all these contextual variables from the Long Tail leads to better recommendation performance.

GEARS: Generalizable Multi-Purpose Embeddings for Gaze and Hand Data in VR Interactions

  • Philipp Hallgarten
  • Naveen Sendhilnathan
  • Ting Zhang
  • Ekta Sood
  • Tanya R. Jonker

Machine learning models using users’ gaze and hand data to encode user interaction behavior in VR are often tailored to a single task and sensor set, limiting their applicability in settings with constrained compute resources. We propose GEARS, a new paradigm that learns a shared feature extraction mechanism across multiple tasks and sensor sets to encode gaze and hand tracking data of users VR behavior into multi-purpose embeddings. GEARS leverages a contrastive learning framework to learn these embeddings, which we then use to train linear models to predict task labels. We evaluated our paradigm across four VR datasets with eye tracking that comprise different sensor sets and task goals. The performance of GEARS was comparable to results from models trained for a single task with data of a single sensor set. Our research advocates a shift from using sensor set and task specific models towards using one shared feature extraction mechanism to encode users’ interaction behavior in VR.

Pervasive Chatbots: Investigating Chatbot Interventions for Multi-Device Applications

  • Mayowa Olapade
  • Tarlan Hasanli
  • Abdul-Rasheed Ottun
  • Adeyinka Akintola
  • Mohan Liyanage
  • Huber Flores

The inherent social characteristics of humans make them prone to adopting distributed and collaborative applications easily. Although fundamental methods and technologies have been defined and developed over the years to construct these applications, their adoption in practice is uncommon because end-users may be puzzled about how to use them without much hassle. Indeed, commonly, these applications require a certain level of technical expertise and awareness to use them correctly. Fortunately, AI-chatbot interventions are envisioned to assist and support various human tasks. In this paper, we contribute pervasive chatbots as a solution that fosters a more transparent and user-friendly interconnection of devices in distributed and collaborative environments. Through two rigorous user studies, firstly, we quantify the perception of users toward distributed and collaborative applications (N = 56 participants). Secondly, we analyze the benefits of adopting pervasive chatbots when compared with the chatbot reference model designed for assistance and recommendations (N = 24 participants). Our results suggest that pervasive chatbots can significantly enhance the practicability of distributed and collaborative applications, reducing the time and effort needed for collaboration with surrounding devices by 57%. With this information, we then provide design and development implications to integrate pervasive chatbot interventions in distributed and collaborative environments. Moreover, challenges and opportunities are also provided to highlight the remaining issues that need to be addressed to realize the full vision of pervasive chatbots for any multi-device application. Our work paves the way towards the proliferation of sophisticated and highly decentralized computing environments that are easily interconnected.

Supporting Group Decision-Making: Insights from a Focus Group Study

  • Amra Delić
  • Hanif Emamgholizadeh
  • Francesco Ricci
  • Judith Masthoff

In everyday life, we make decisions in groups about a variety of issues. In group decision-making, group members discuss options, exchange preferences and opinions, and make a common decision. Decision support systems and group recommender systems facilitate this process by enabling preference elicitation, generating recommendations, and supporting the process. We are here interested in building a conversational system, namely, a chat app, enhanced with an AI agent supporting the group decision-making process. To design the system, rather than solely relying on our assumptions, we took one step back and conducted a comprehensive focus group study. This approach has allowed us to gain original insights into the specific needs and preferences of the future end-users, i.e., group members, ensuring that our system design aligns more closely with their requirements. The focus group study involved fourteen participants in three group compositions: friends, families, and couples. Our findings reveal that most of the group members define a good choice as one that maximizes overall satisfaction without leaving any member dissatisfied. Dealing with competing group members emerged as a primary concern, with study participants requesting specific help from the AI agent to address this challenge. Participants identified personality and group structure as crucial characteristics for the AI agent to properly operate, though some expressed privacy concerns. Lastly, participants expected an AI agent to provide private interactions with individual members, proactively guide discussions when necessary, continually analyze group interactions, and tailor support to those interactions.

To Adapt or Not to Adapt ? Older Adults Enacting Agency in Dialogues with an Unknowledgeable Agent

  • Helena Lindgren
  • Vera C Kaelin
  • Ann-Margreth Ljusbäck
  • Maitreyee Tewari
  • Michele Persiani
  • Ingeborg Nilsson

Health-promoting digital agents, taking on the role of an assistant, coach or companion, are expected to have knowledge about a person’s medical and health aspects, yet they typically lack knowledge about the person’s activities. These activities may vary daily or weekly and are contextually situated, posing challenges for the human-agent interaction. This pilot study aimed to explore the experiences and behaviors of older adults when interacting with an initially unknowledgeable digital agent that queries them about an activity that they are simultaneously engaged in. Five older adults participated in a scenario involving preparing coffee followed by having coffee with a guest. While performing these activities, participants educated the smartwatch-embedded agent, named Virtual Occupational Therapist (VOT), about their activity performance by answering a set of activity-ontology based questions posed by the VOT. Participants’ interactions with the VOT were observed, followed by a semi-structured interview focusing on their experience with the VOT. Collected data were analyzed using an activity-theoretical framework. Results revealed participants exhibited agency and autonomy, deciding whether to adapt to the VOT’s actions in three phases: adjustment to the VOT, partial adjustment, and the exercise of agency by putting the VOT to sleep after the social conditions and activity changed. Results imply that the VOT should incorporate the ability to distinguish when humans collaborate as expected by the VOT and when they choose not to comply and instead act according to their own agenda. Future research focuses on how collaboration evolves and how the VOT needs to adapt in the process.

TOC Adjunct Proceedings

The ACM OpenTOC service enables visitors to download the articles below at no charge.

SESSION: Tutorials

Collaborative Team Recommendation for Skilled Users: Objectives, Techniques, and New Perspectives

  • Mahdis Saeedi
  • Christine Wong
  • Hossein Fani

Collaborative team recommendation involves selecting users with certain skills to form a team who will, more likely than not, accomplish a complex task successfully. To automate the traditionally tedious and error-prone manual process of team formation, researchers from several scientific spheres have proposed methods to tackle the problem. In this tutorial, while providing a taxonomy of team recommendation works based on their algorithmic approaches to model skilled users in collaborative teams, we perform a comprehensive and hands-on study of the graph-based approaches that comprise the mainstream in this field, then cover the neural team recommenders as the cutting-edge class of approaches. Further, we provide unifying definitions, formulations, and evaluation schema. Last, we introduce details of training strategies, benchmarking datasets, and open-source tools, along with directions for future works.

DECI: The 2nd Tutorial on Designing Effective Conversational Interfaces

  • Ujwal Gadiraju
  • Kuldeep Yadav

Conversational User Interfaces (CUIs) have been argued to have advantages over traditional GUIs due to having a more human-like interaction. The growing popularity of conversational agents has enabled humans to interact with machines more naturally. People are increasingly familiar with conversational interactions mediated by technology due to the widespread use of mobile devices and messaging services and a hungry market for conversational agents. Based on the recent advances in conversational AI, due to the proliferation of large language models, there are clear signs that the future of human-computer interaction will have a significant conversational component. Today, over two-thirds of the population on our planet has access to the Internet, with ever-lowering barriers to accessibility. This tutorial will showcase the benefits of employing novel conversational interfaces for crowd computing, human-AI decision making, health and well-being, and information retrieval. Given the widespread adoption of AI systems across several domains, we will discuss the potential of conversational interfaces in facilitating and mediating people’s interactions with AI systems and the opportunities and challenges that lie at this intersection from the user modeling and personalization standpoint. The tutorial will include interactive elements and discussions and provide participants with practical insights to inform the design of effective conversational interfaces.

Mastering Mind and Movement. ACM UMAP 2024 Tutorial on Modeling Intelligent Psychomotor Systems (M3@ACM UMAP 2024)

  • Miguel Portaz
  • Pablo Garcia
  • Rwitajit Majumdar
  • Olga C. Santos

The objective of this tutorial is to provide the researchers of the UMAP community with methodologies, tools and techniques to model complex psychomotor behaviours that can later personalize learning support in realms like sports, physical education or for rehabilitation purposes, providing insights into data gathering from activities that involve human movement. Research in the psychomotor field to provide personalization support to users poses several research challenges. In this tutorial we will focus on how to model psychomotor learning by exploring the fusion of computational analysis of the learning process and wearable / video analysis. Participants will engage in hands-on activities, recording specific movements and learning how to capture human body keypoints. Throughout the training, various tools are introduced, allowing participants to explore options that align with their specific needs and expertise. This tutorial aims to contribute to the thriving initiatives within the UMAP community, fostering the modeling of physical activities.

Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Emerging Trends

  • Erasmo Purificato
  • Ludovico Boratto
  • Ernesto William De Luca

The presented tutorial aims to serve as a comprehensive roadmap for the UMAP community into the current user modeling research, focusing on the paradigm shifts that have transformed the research landscape in recent times. We will provide a complete overview of the large, long-standing, and ever-growing research fields of user modeling and user profiling, both from a historical and a technical point of view. We will then examine the definitions associated with each key term in this research domain, aiming to eliminate ambiguity and confusion in their usage. As the core of our tutorial, we present in-depth the paradigm shifts that have occurred in recent years, especially due to technological evolution, as well as the current research directions and novel trends in the field. In particular, we illustrate and discuss the advances in the following topics: implicit and explicit user profiling, user behavior modeling, user representation, and beyond-accuracy perspectives. The audience will be engaged in discussions during the whole presentation to foster the development of an interactive event.

Detailed information and resources about the tutorial are available on the website: https://link.erasmopurif.com/tutorial-umap24.

Trustworthy User Modeling and Recommendation From Technical and Regulatory Perspectives

  • Markus Schedl
  • Vito Walter Anelli
  • Elisabeth Lex

This tutorial provides an interdisciplinary overview of fairness, non-discrimination, transparency, privacy, and security in the context of recommender systems. According to European policies, these are essential dimensions of trustworthy AI systems but also extend to the global debate on regulating AI technology. Since the aspects mentioned earlier require more than technical considerations, we discuss these topics from ethical, legal, and regulatory perspectives. While the tutorial’s primary focus is on presenting technical solutions that address the mentioned topics of trustworthiness, it also equips the primarily technical audience of UMAP with the necessary understanding of the social and ethical implications of their research and development and recent ethical guidelines and regulatory frameworks.

SESSION: Doctoral Consortium

Adaptive Search Support for Teachers in Lesson Planning

  • Ratan Sebastian

Teachers report difficulties engaging in search in the context of lesson planning. Despite the availability of customised search solutions like Learning Object Repositories, most teachers report using general-purpose search engines like Google. Unlike students – to whom most research in educational search is directed – teachers are experts capable of using many types of resources for their tasks. In addition, since they teach multiple classes, they switch educational contexts frequently. These circumstances present unique challenges for user modelling. This thesis investigates how an adaptive browser-based search engine augmentation could support teachers in both exploratory and lookup search tasks. A preliminary user study was conducted to understand the actual search behaviour of teachers in their everyday environment and to investigate the inference of search tasks and context from search behaviour. Using this inferred information, personalized search aids could be delivered through the extension such as search engine result page (SERP) summarization for exploratory search and context-based search snippets. The relevance of existing search snippets was investigated and potential for improvement using teachers’ search context was identified. With the rest of the PhD, I plan to expand the user study to get more concrete results about teacher search behaviour and implement and test this adaptive browser extension to see how it affects the search experience for teachers.

Balanced Explanations in Recommender Systems

  • Elaheh Jafari

Recommender systems have become essential in aiding users’ decision-making processes, yet ensuring users understand the rationale behind recommendations remains a challenge. Balancing the presentation of both positive and negative aspects of recommendations is important to foster user trust and satisfaction. In this PhD thesis, I want to investigate the balance between supporting recommendations and alerting users to potential risks. While prior research has broadly examined persuasive and informative explanations, my focus is on exploring methodologies to effectively caution users about associated risks. By prioritizing the warning aspect, the aim is to heighten user awareness of biases and potential harms, ultimately enhancing transparency and trust in the recommendation process.

Development of Personalised Educational Tools for AI Literacy Using Participatory Design

  • Maria Kasinidou

The flourishing presence of Artificial Intelligence (AI) in daily life emphasizes the necessity for education about AI. It is crucial to create tailored educational materials and tools to meet the unique educational needs of diverse groups. This ongoing project aims to develop personalised educational tools for AI literacy for different groups of society (i.e., children, teachers, and adults). The perception and knowledge of AI by these groups will be explored. Participatory design workshops with teachers and adults will assist in the development of personalised educational tools for these groups tailored to the specific needs of each group and their perceptions of AI. Finally, this project aims to identify the best practices for developing personalised educational tools for AI literacy for each group.

Improving Recommendations for Non-Mainstream Users by Addressing Subjective Item Views

  • Arsen Matej Golubovikj

Item representations and item-to-item similarity measures are powerful tools used by many modern recommender systems (RS). Typically, recommender systems consider a singular representation of items and similarity measure between items. Studies have shown, however, that users can disagree on item descriptors and item similarities. If we consider that each user has their own subjective view of items, a singular item representation cannot cater to all users. We link this observation to recent studies showing that modern recommender systems approaches produce better recommendations for mainstream users, users whose preferences align with the majority/popular view, and poor recommendations for non-mainstream users with uncommon or niche tastes. It is our assumption that this bias towards mainstream users is due to the fact that (i) common RS datasets are dominated by mainstream users and (ii) if we again consider that each user has their own subjective view of items, an item representation built from a mainstream user dominated dataset will better represent the views of mainstream users and misalign with the subjective views of non-mainstream users. In this PhD work, we aim to explore methods which improve recommendations for non-mainstream users, by aligning the system’s representation of items with the target user’s subjective view of items during the recommendation step. Specifically, we focus on subjective perceptions of item distances and item characteristics.

Improving Recommender Systems with Large Language Models

  • Sebastian Lubos

Recommender systems offer an important technology aiding users in discovering relevant items among a multitude of available options. The emergence of Large Language Models (LLMs) enables a powerful opportunity to improve the performance and flexibility of recommender systems in different aspects. Those involve possibilities to enhance the item retrieval and ranking, explanation of recommendations, and generation of content to be recommended. The objective of my Ph.D. is to investigate those possibilities to identify and develop methods for improving different aspects of recommender systems using LLMs. The findings are expected to further develop the field of recommender systems and reveal novel opportunities for future research.

Perspective Diversification News Recommender System

  • Uroš Sergaš

Promoting Green Fashion Consumption in Recommender Systems

  • Angelo Geninatti Cossatin

The fashion industry is a significant contributor to global carbon emissions, water consumption, and waste generation. My Ph.D. project explores two approaches to promote sustainability in fashion recommender systems. First, I aim to develop algorithms and user interfaces that nudge users towards more sustainable and ethical fashion choices by incorporating relevant data into the recommendation process and item presentation. This involves balancing user preferences with sustainability considerations and exploring effective ways to present this complex information to users. Second, I seek to make the recommendation algorithms themselves more environmentally sustainable by reducing their computational cost without compromising recommendation quality. To achieve these goals, I plan to leverage multimodal item representations, multi-criteria recommendation techniques, and novel algorithmic approaches. I expect to complete my Ph.D. in October, 2025.

Promoting Mental Health of Adolescents with a Mild Intellectual Disability

  • Simone Ooms

Promoting mental health in adolescents is crucial to minimize the development of mental health disorders, especially for those with a mild intellectual disability who face more mental health challenges. In general, they have more difficulty controlling their feelings and show an overrepresentation in feelings of loneliness, anxiety, and other emotional problems. To improve their mental health, traditional teaching methods are unsuitable. Digital mental health intervention is a promising approach to do this in a fun, rewarding, and personally relevant manner. This PhD research uses a participatory approach to (1) identify the needs of these adolescents and (2) how to design, develop, and evaluate a digital intervention for mental health promotion. First school observations and interviews with staff suggest to improve self-reliance skills, finding help, and reflecting on wellbeing. The design process will focus on developing a personalized intervention that suits each adolescent’s needs, as home context and support network vary.

Toward privacy-focused personalization: Designing a learning experience to facilitate privacy-personalization trade-off

  • Adrienn Toth

In recent years, as a result of the emergence of innovative technologies and applications, several solutions aiming to improve the quality of education have been appearing. However, mainly due to the human factor of lack of understanding and trust in these applications, adoption by higher education institutions remained low. In my doctoral thesis, I aim to contribute to overcoming this barrier in two main steps. First, I try to understand more deeply the nature of the existing contrast between perceived benefits and concerns of higher education students regarding one specific application area, artificial intelligence-based personalized learning. Building on these results, I will then design and evaluate a new personalized learning tool, developed specifically with user-centered data privacy and ethical considerations in mind that specifically respond to the identified concerns. The uniqueness of this tool is the transparency and controllability of different levels of personalization, which ensures that each student can freely choose the extent to which they are willing to make their data available in order to receive a personalized learning experience. The effects of this design principle on privacy concerns and learning outcomes will then be tested in multiple lab- and field studies. I believe this project fits well into both the “Personalizing Learning Experiences through User Modeling” and the “Fairness, Transparency, Accountability, and Privacy” tracks of the UMAP conference, which is why I have decided to apply for the Doctoral Consortium.

SESSION: Late-Breaking Results and Demos

“DecisionTime”: A Configurable Framework for Reproducible Human-AI Decision-Making Studies

  • Sara Salimzadeh
  • Ujwal Gadiraju

Empirical studies have extensively investigated human decision-making processes in various domains where AI systems are incorporated. However, comparing and replicating these studies can be challenging due to different experimental configurations. Moreover, the existing contexts often have limited scope and may not fully capture the complexity of real-world decision-making scenarios that are riddled with varying levels of uncertainty. Our framework addresses these practical gaps by providing a configurable and reproducible environment for conducting human-AI decision-making studies in the route planning domain that captures many complexities of real-world scenarios. Researchers can customize parameters, conditions, and factors involved in decision-making tasks to help address research and empirical gaps through rigorous experiments. With various modules such as map generation, chat components, and different AI systems available within the “DecisionTime” framework, researchers can effortlessly design experiments exploring multiple aspects of human-AI interaction and decision-making.

A Preliminary Analysis on Self and Peer Evaluation of Personality Models for Recommender Systems

  • Francesco Barile
  • Federico Maria Cau
  • Nava Tintarev

Personality has been introduced in recommender systems to address the cold-start problem, or to improve the recommendations by personalizing the degree of diversity for the specific user. For group recommender systems, personality has also been considered to select the most appropriate strategy for aggregating individual preferences or to model the dynamics of the group decision-making process. The most widely used models in this context are the Five Factor Model (FFM) and the Conflict Resolution Styles model (TKI or ROCI II). However, the challenge of eliciting this information remains an open problem, along with establishing correlations between self- and peer-evaluations of personality. In this work, we present the results of a preliminary study investigating the correlations between self and peer evaluations of personality. Furthermore, we analysed the correlations between the two personality models. Our findings show good consistency between self and peer personality evaluations for the FFM, but not for conflict resolution styles. This suggests that the FFM peer evaluations could be considered an alternative to self-evaluations. In contrast, for the conflict resolution styles, it may be necessary to measure both self and peer evaluations. This is relevant in group recommendation scenarios, where how group members perceive each other could impact the group decision-making process. Finally, our findings reveal correlations between the two personality models but do not support a consistent way to derive one from the other.

An Explanatory Model Steering System for Collaboration between Domain Experts and AI

  • Aditya Bhattacharya
  • Simone Stumpf
  • Katrien Verbert

With the increasing adoption of Artificial Intelligence (AI) systems in high-stake domains, such as healthcare, effective collaboration between domain experts and AI is imperative. To facilitate effective collaboration between domain experts and AI systems, we introduce an Explanatory Model Steering system that allows domain experts to steer prediction models using their domain knowledge. The system includes an explanation dashboard that combines different types of data-centric and model-centric explanations and allows prediction models to be steered through manual and automated data configuration approaches. It allows domain experts to apply their prior knowledge for configuring the underlying training data and refining prediction models. Additionally, our model steering system has been evaluated for a healthcare-focused scenario with 174 healthcare experts through three extensive user studies. Our findings highlight the importance of involving domain experts during model steering, ultimately leading to improved human-AI collaboration.

Analyzing the Interplay between Diversity of News Recommendations and Misinformation Spread in Social Media

  • Royal Pathak
  • Francesca Spezzano

Recommender systems play a crucial role in social media platforms, especially in the context of news, by assisting users in discovering relevant news. However, these systems can inadvertently contribute to increased personalization, and the formation of filter bubbles and echo chambers, thereby aiding in the propagation of fake news or misinformation. This study specifically focuses on examining the tradeoffs between the diversity of news recommendations and the dissemination of misinformation on social media. We evaluated classical recommender algorithms on two Twitter (now X) datasets to assess the diversity of top-10 recommendation lists and simulated the propagation of recommended misinformation within the user network to analyze the impact of diversity on misinformation spread. The research findings indicate that an increase in news recommendation diversity indeed contributes to mitigating the propagation of misinformation. Additionally, collaborative and content-based recommender systems provide more diversity in comparison to popularity and network-based systems, resulting in less misinformation propagation. Our study underscores the crucial role of diversity recommendations in mitigating misinformation propagation, offering valuable insights for designing misinformation-aware recommender systems and diversity-based misinformation intervention.

Beyond Static Calibration: The Impact of User Preference Dynamics on Calibrated Recommendation

  • Kun Lin
  • Masoud Mansoury
  • Farzad Eskandanian
  • Milad Sabouri
  • Bamshad Mobasher

Calibration in recommender systems is an important performance criterion that ensures consistency between the distribution of user preference categories and that of recommendations generated by the system. Standard methods for mitigating miscalibration typically assume that user preference profiles are static, and they measure calibration relative to the full history of user’s interactions, including possibly outdated and stale preference categories. We conjecture that this approach can lead to recommendations that, while appearing calibrated, in fact, distort users’ true preferences. In this paper, we conduct a preliminary investigation of recommendation calibration at a more granular level, taking into account evolving user preferences. By analyzing differently sized training time windows from the most recent interactions to the oldest, we identify the most relevant segment of user’s preferences that optimizes the calibration metric. We perform an exploratory analysis with datasets from different domains with distinctive user-interaction characteristics. We demonstrate how the evolving nature of user preferences affects recommendation calibration, and how this effect is manifested differently depending on the characteristics of the data in a given domain. Datasets, codes, and more detailed experimental results are available at: https://github.com/nicolelin13/DynamicCalibrationUMAP.

Combining Virtual Reality with a Biomechanical Model to Improve Parkinson’s Movement: Solution Proposal and Reference Learning Data

  • Ana Henriques
  • Cristiana Pinheiro
  • Cristina P. Santos

Parkinson’s disease imposes a substantial global health burden and causes severe motor impairments that compromise the quality of life. Virtual reality-based mirror training (VRMT) has the potential to improve motor function through immersive action observation, motor imagery, and real-time feedback. This study aims to propose a novel VRMT personalized for Parkinson rehabilitation and present reference learning data from a biomechanical model obtained from twelve healthy individuals performing the activities of lifting the leg, arising from a chair, and touching the nose included in the Movement Disorder Society Unified Parkinson’s Disease Rating Scale. The results reveal insights on the maintenance of balance when arising from a chair through linear variation of center of mass, minimal hand position shaking and thus minimal tremors during activity of touching the nose, and high maximum foot velocity of approximately 0.79±0.26 1/s during activity of lifting the leg. The data collected accurately represents the expected healthy execution and related variability for each activity, serving as a learning reference for the future development of VRMT.

Comp-HuSim: Persistent Digital Personality Simulation Platform

  • Chengyu Fan
  • Zaynab Tariq
  • Nafis Saadiq Bhuiyan
  • Michael G Yankoski
  • Trenton W Ford

Comp-HuSim (Complex Human Simulations) is a multi-agent persona simulation platform powered by generative AI models (GPT-3.5 turbo, GPT 4 turbo, and Mixtral(8×7B)). Each Comp-HuSim generative agent is created with random variables such as age, gender identity, economic status, Big-5 personality characteristics, geographic location, etc. Agents are then imbued with backstory memories, textured histories, diverse hobbies and interests, and a series of distinct attributes. In early experiments our agents engage in conversation with one another, reflect on their past experiences, and play games together in surprisingly sophisticated ways. Complex forms of human-like behaviors become visible in the Comp-HuSim project demonstration, demonstrating our approach’s promise for applications in Virtual Assistants, Conversational Interactions, and Personalized Human-Robot Interaction.

CulturAI: Exploring Mixed Reality Art Exhibitions with Large Language Models for Personalized Immersive Experiences

  • Nicolas Constantinides
  • Argyris Constantinides
  • Dimitrios Koukopoulos
  • Christos Fidas
  • Marios Belk

Mixed Reality (MR) technologies have transformed the way in which we interact and engage with digital content, offering immersive experiences that blend the physical and virtual worlds. Over the past years, there has been increasing interest in employing Artificial Intelligence (AI) technologies to improve user experience and trustworthiness in cultural contexts. However, the integration of Large Language Models (LLMs) into MR applications within the Cultural Heritage (CH) domain is relatively underexplored. In this work, we present an investigation into the integration of LLMs within MR environments, focusing on the context of virtual art exhibitions. We implemented a HoloLens MR application, which enables users to explore artworks while interacting with an LLM through voice. To evaluate the user experience and perceived trustworthiness of individuals engaging with an LLM-based virtual art guide, we adopted a between-subject study design, in which participants were randomly assigned to either the LLM-based version or a control group using conventional interaction methods. The LLM-based version allows users to pose inquiries about the artwork displayed, ranging from details about the creator to information about the artwork’s origin and historical significance. This paper presents the technical aspects of integrating LLMs within MR applications and evaluates the user experience and perceived trustworthiness of this approach in enhancing the exploration of virtual art exhibitions. Results of an initial evaluation provide evidence about the positive aspect of integrating LLMs in MR applications. Findings of this work contribute to the advancement of MR technologies for the development of future interactive personalized art experiences.

Dynamic Ridge Plot Sliders: Supporting Users’ Understanding of the Item Space Structure and Feature Dependencies in Interactive Recommender Systems

  • Lovis Bero Suchmann
  • Nicole Krämer
  • Jürgen Ziegler

Understanding the characteristics of the space of available items is instrumental for successfully exploring the options in interactive recommender system approaches, including conversational recommender systems. Forming an adequate mental model of the distribution of items with respect to their various features can prevent mismatches with the user’s preferences, either represented in an existing preference model or, even more importantly, in a process of preference development during the interaction. However, such a mental model is limited due to complex and opaque feature dependencies manifested in the item space that make it hard to predict whether a combination of feature preferences is realistically satisfiable or should be partially relaxed. Illuminating these dependencies in the recommender interface while the user scrutinizes their preference model has the potential to convey richer structural information, facilitate mental model formation, and thus allow better-informed decisions for both preferred feature ranges and recommended items. To this end, we introduce novel dynamic ridge plot sliders that employ guidelines from research on information visualization for mental model formation. The proposed sliders relate feature-wise item distributions to items’ overall utility values and in doing so provide predictive power about the implications of future user actions and preference adjustments. Furthermore, paradigms for interactions between multiple such sliders and other GUI elements are motivated via an interactive recommender prototype. We also present a preliminary user study with yet inconclusive results, and discuss the future potentials and conceivable application areas of our approach, as well as promising next research steps.

Exploring the Potential of Generative AI for Augmenting Choice-Based Preference Elicitation in Recommender Systems

  • Benedikt Loepp
  • Jürgen Ziegler

The recent boost in generative artificial intelligence has also reached the field of recommender systems. However, as is often the case, much of the work focuses on the algorithms, overlooking the crucial aspect of improving the systems from a user perspective. In this initial research, we explore the potential of large language models to achieve improvements in preference elicitation. The interactive choice-based method we are augmenting has previously demonstrated significant improvements in a number of aspects related to the user experience. Through an exploratory user study, we show that the item set comparisons presented by this method can be successfully accompanied by independently generated textual summaries, thereby improving the user experience even further.

Exploring the Role of Empathy in Designing Social Robots for Elderly People

  • Berardina Nadja De Carolis
  • Giuseppe Palestra
  • Giovanna Castellano

Social robots are emerging as a valuable tool in promoting holistic person-centered care and well-being among seniors. One aspect that often affects seniors’ lives is the feeling of loneliness and social isolation. In the context of the SISTER project, we aim to investigate the acceptance of a personal social robot as a tool to mitigate these aspects. Empathy plays an important role in establishing a human-robot emotional and social connection. As a first phase of our project, we investigate empathy’s impact on human-robot interaction when a social robot serves as a senior companion. For this purpose, two robot dialog versions were developed: a non-empathic version, acting as a conversational interface without considering users’ emotional states, and an empathic version in which emotion recognition is used to understand and respond empathically to the users’ emotional state. Thirty older adults were involved in the study, revealing that, as far as senior-robot interaction is concerned, the empathic condition resulted in a more usable and positive experience showing that this robot’s capability should be considered in future developments for personalizing the interaction.

Following Topics Across All Apps and Media Formats: Mobile Keyword Tracking as a Privacy-Friendly Data Source in Mobile Media Research

  • Philipp Krieter
  • Patrick Zerrer
  • Cornelius Puschmann
  • Stephanie Geise

Mining detailed content from mobile human-computer interactions often relies on broad log files, which limits the specificity of analyses. Our study presents a unique approach capitalizing on Optical Character Recognition to continuously detect keywords across all applications and media formats, correlating this with system logs for context and duration tracking. Such a strategy even allows tracing of topics within pictorial content like memes on social media. A privacy concept based on a whitelist for keywords and topics and anonymized log files addresses typical concerns of potential study participants regarding their personal data. In our four-month study involving 25 participants, we generated an expansive nine-million-point dataset. This detailed dataset not only validates the efficacy of our approach but also exemplifies its capacity for rich, cross-app, long-term interaction analysis. In order to clarify data protection issues, we conducted qualitative interviews with eight of the participants on a voluntary basis.

FormTwin: A Framework for Pen-based Data Collection

  • Konstantin Kuznetsov
  • Sara-Jane Bittner
  • Abdulrahman Mohamed Selim
  • Michael Barz
  • Daniel Sonntag

Paper and digital forms are widely used to collect user information across multiple domains, such as research, healthcare, and education. However, both types still lack application and follow-up interpretation: Paper forms need to be digitised meticulously to be analyzed or shared with team members efficiently; in comparison, digital forms cannot convey handwriting and might require technical literacy. We present the FormTwin data collection tool—an alternative to online forms, which allows for the efficient reuse of existent paper forms while providing the convenience of digital forms. FormTwin can digitise a wide range of forms with the integrated form annotation application. Then, it combines two input channels: A stylus on a tablet and a digital smart pen on plain paper, which duplicates the input on a mobile application in real-time. We aim to improve the efficiency and accessibility of data collection for practitioners with a modular system that combines both the digital accessibility of digital forms with keeping the needed technical literacy low and retaining the quality of hand-drawn sketches.

How the personality and memory of a robot can influence user modeling in Human-Robot Interaction

  • Benedetta Matcovich
  • Cristina Gena
  • Fabiana Vernero

In recent years, robotics has evolved, placing robots in social contexts, and giving rise to Human-Robot Interaction (HRI). HRI aims to improve user satisfaction by designing autonomous social robots with user modeling functionalities storing data on people to personalise interactions. Personality, a vital factor in human interactions, influences temperament, social preferences, and cognitive abilities. Despite much research on personality traits influencing HRI, little attention has been paid to the influence of the robot’s personality on the user modeling. Personality can influence not only temperament and how people interact with each other but also what they remember about an interaction. A robot’s personality traits could therefore influence what it remembers about the user and thus modify the user model and the consequent interactions. However, no studies investigating such conditioning have been found. This paper addresses this gap by proposing distinct user models that reflect unique robotic personalities, exploring the interplay between individual traits, memory, and social interactions to replicate human-like processes, and trying to provide users with more engaging and natural experiences.

IMPECT-POSE: A Complete Front-end and Back-end Architecture for Pose Tracking and Feedback

  • Abhishek Samanta
  • Hitesh Kotte
  • Patrick Handwerk
  • Khaleel Asyraaf Mat Sanusi
  • Mai Geisen
  • Milos Kravcik
  • Nghia Duong-Trung

This paper introduces IMPECT-POSE, an innovative front-end and back-end architecture designed to enhance fitness and sports training through precise body posture tracking. This system integrates advanced computer vision and artificial intelligence in pose estimation to provide real-time feedback on exercise execution, which is crucial for maintaining proper technique, reducing injury risks, and optimizing training outcomes. Our evaluations, conducted at two distinct locations with multiple participants, demonstrate the system’s capability to improve exercise performance significantly. The system’s flexibility allows sports professionals to monitor and guide clients remotely, enhancing the accessibility and effectiveness of training regimens. This research highlights the potential of augmented intelligence in transforming sports training, offering a scalable and effective alternative to conventional methods, and paving the way for future advancements in AI-driven personalized training programs. The continued development of this technology aims to refine its accuracy, broaden its applicability to diverse user preferences, and extend its use in practical, real-world settings.

Incorporating Editorial Feedback in the Evaluation of News Recommender Systems

  • Bilal Mahmood
  • Mehdi Elahi
  • Samia Touileb
  • Lubos Steskal
  • Christoph Trattner

Research in the recommender systems field typically applies a rather traditional evaluation methodology when assessing the quality of recommendations. This methodology heavily relies on incorporating different forms of user feedback (e.g., clicks) representing the specific needs and interests of the users. While this methodology may offer various benefits, it may fail to comprehensively project the complexities of certain application domains, such as the news domain. This domain is distinct from other domains primarily due to the strong influence of editorial control in the news delivery process. Incorporation of this role can profoundly impact how the relevance of news articles is measured when recommended to the users. Despite its critical importance, there appears to be a research gap in investigating the dynamics between the roles of editorial control and personalization in the community of recommender systems. In this paper, we address this gap by conducting experiments where the relevance of recommendations is assessed from an editorial perspective. We received a real-world dataset from TV 2, one of the largest editor-managed commercial media houses in Norway, which includes editors’ feedback on how news articles are being related. In our experiment, we considered a scenario where algorithm-generated recommendations, using the K-Nearest Neighbor (KNN) model, employing various text embedding models to encode different sections of the news articles (e.g., title, lead title, body text, and full text), are compared against the editorial feedback. The results are promising, demonstrating the effectiveness of the recommendation in fulfilling the editorial prospects.

Insights from the Review of Apps that Influence Environmental Sustainability

  • Ifeoma Adaji
  • Peter Idoko
  • Mikhail Ola Adisa

Environmental sustainability is the avoidance of the depletion of natural resources in order to maintain an ecological balance. To influence people to be environmentally sustainable, several mobile apps exist to teach sustainable behaviours. These apps hold great promise as interventions for influencing people to live sustainable lives since they are often designed using behaviour change strategies. However, the effectiveness of these apps as behaviour change tools is unclear. In addition, despite people downloading these apps, their engagement level is still low. Research suggests that user experience and usability determine the adoption and usage of apps. Research also indicates that user experience and usability of apps can be gleaned from the reviews written by users on the app store. Thus, reviews are a good source of determining the user experience and usability of users. To determine the effectiveness of sustainability apps as behaviour change tools and the users’ experience with the apps, we reviewed 70 sustainability apps that are available on the Google Play Store. First, using a popular behaviour change framework, the App Behaviour Change Scale (ABACUS), we investigated the persuasive strategies implemented by the apps and how these strategies were designed and implemented to achieve the targeted design objectives. Second, using natural language processing, we identified the common themes in the user reviews of the apps. The preliminary results presented here can influence the design of apps for influencing environmentally sustainable behaviour.

Interaction Visualization for Analysing and Improving User Models

  • Stefan Lengauer
  • Lin Shao
  • Mariia Tytarenko
  • Manfred Klaffenböck
  • Tobias Schreck

Many web-based systems such as online retail, information systems or search engines track the interactions users have with them. Tracked data can comprise high-level information like dwelling time, reviewed items, and clicked elements, but also fine-grained information in the form of mouse trajectories and keystrokes. While these data are often fed into user- or behavior models in recommender systems, there are few approaches for interactive visual exploration of multi-modal and complex interaction patterns. Yet, the thorough analysis could reveal important insights for the design and evaluation of said models. We propose a suitable visual analysis approach that allows to validate and correct models in an intuitive and interactive manner. Our tool provides insights into concrete user (inter)actions and also estimates more complex behavioral patterns. Level of detail views in our system outlines the certainty of detected behaviors and serve the explainability. Our approach can help engineers to understand user interactions and improve behavioral models.

Modeling the New Modalities of Personas: How Do Users’ Attributes Influence Their Perceptions and Use of Interactive Personas?

  • Ilkka Kaate
  • Joni Salminen
  • Soon-Gyo Jung
  • João M. Santos
  • Essi Häyhänen
  • Trang Xuan
  • Jinan Azem
  • Bernard J. Jansen

We investigate the impact of user demographics (age, gender) and experience (with personas and chatbots) on users’ perceptions of interactive personas. A within-subjects study was conducted with 54 participants, mostly engineers and computer scientists. Each participant used interactive personas with two interfaces: a web-based profile persona and a chat persona. The findings from regression analysis indicate that users’ age and gender (as well as persona’s gender) affect multiple perceptions of personas. In addition, the interface modality (profile vs. chat) has a significant impact. Findings highlight the need for designing interactive personas that appeal to diverse user bases to increase the general accessibility of interactive personas. They also support the notion that the persona interface itself regulates user perceptions even when the persona’s information remains the same.

Modelling Visual Attention for Future Intelligent Flight Deck – A Case Study of Pilot Eye Tracking in Simulated Flight Takeoff

  • Bo Fu
  • Angelo Ryan Soriano
  • Kayla Chu
  • Peter Gatsby
  • Nicolas Guardado

Piloting is a cognitively demanding task that not only requires conceptual knowledge and domain expertise, but also skills to intercept relevant visual cues in a changing environment. Pilots’ abilities to divide their visual attention effectively during critical flight maneuvers play a central role in their situation awareness and decision-making. Given that a large number of visual cues in the flight deck need to be processed by the human eyes, there is an opportunity to utilize eye tracking in the analytics of pilot performance. To this end, we present a case study involving 10 licensed pilots in a simulated flight takeoff scenario and demonstrate that specific gaze behaviors employed by a pilot may lead to varying levels of success. In particular, pilots who perform better during the climb phase may often exhibit faster visual searches, and those who perform better during the takeoff phase may often scan a larger area of the visual scene. Also, more successful pilots tend to report lower cognitive workload that is further supported by their pupil diameters in the experiment. This knowledge may be utilized in the development of user models to infer pilot success and failure, whereby an intelligent flight deck can adapt system behaviors to the pilot, whereby alerts, warnings, and automation may be initiated in the event of detecting irregularity in pilots’ gaze.

Repeating my Workouts or Exploring new Activities? A Longitudinal Micro-Randomized User Study for Physical Activity Recommender Systems

  • Ine Coppens
  • Toon De Pessemier
  • Luc Martens

While repeating activities can create healthy habits, exploring new physical activities is also important to increase health benefits and prevent boredom. Following habit formation and variety-seeking behavior theories, this study investigates the difference between repetition and exploration of physical activities in health recommender systems. An eight-week Micro-Randomized Trial is conducted in which 11 physically inactive adults receive personalized activity recommendations that are either (a) a repetition or exploration, (b) connected to a specific location, duration, or neither, and (c) accompanied by videos or Points-of-Interest, or not. Analyses of the 187 submitted activity recommendations suggest that the inactive participants prefer exploration, as exploration recommendations were submitted the most, had significantly larger star ratings and durations, and are dependent on moderators (b) and (c). More specifically, exploration for workouts received the highest star ratings and motivation. To our knowledge, this study is the first to investigate repetition and exploration for physical activities, contributing to effective recommender algorithms for healthy behavior change.

Towards Multi-Objective Behavior and Knowledge Modeling in Students

  • Siqian Zhao
  • Shaghayegh Sahebi

Traditional knowledge modeling methods have primarily focused on student knowledge modeling using assessed learning activities, often overlooking the critical interplay between students’ knowledge and behavioral preferences. However, students typically interact with multiple types of learning materials, such as questions (assessed), video lectures (non-assessed), and textbooks (non-assessed). We argue that student knowledge can affect their behavioral preferences, and the choice of learning material type can influence their knowledge. In this paper, we address this gap by proposing a novel framework that models student knowledge and behavior as a multi-task learning problem with two objectives. Our dual objectives are to predict student performance and their preferences for selecting different types of learning materials. We utilize the Pareto Multi-Task Learning (MTL) algorithm to effectively handle the complexities of this multi-objective optimization, applying it to two advanced multi-activity knowledge modeling methods, TAMKOT and GMKT, which we refer to as Pareto-TAMKOT and Pareto-GMKT, respectively. We evaluate the framework on one real-world dataset. Our experimental results demonstrate that both Pareto-TAMKOT and Pareto-GMKT improve upon their original models and outperform all baseline models. This underscores the benefits of treating the modeling of student knowledge and behavior as a multi-task learning problem and addresses this multi-objective challenge through the application of Pareto MTL.

Trust in a Human-Computer Collaborative Task With or Without Lexical Alignment

  • Sumit Srivastava
  • Mariët Theune
  • Alejandro Catala
  • Chris Reed

Lexical alignment is a form of personalization frequently found in human-human conversations. Recently, attempts have been made to incorporate it in human-computer conversations. We describe an experiment to investigate the trust of users in the performance of a conversational agent that lexically aligns or misaligns, in a collaborative task. The participants performed a travel planning task with the help of the agent, involving rescuing residents and minimizing the travel path on a fictional map. We found that trust in the conversational agent was not significantly affected by the alignment capability of the agent.

Understanding University Students’ Concerns Regarding Automated Learning Assessment Tools

  • Adrienn Toth
  • Zsolt Abraham
  • Verena Zimmermann

With increased technology use in education that leverages the benefits of user modeling, adaptation, and personalization, privacy of educational data gains relevance, yet privacy research in this area is still in its infancy. Current research on educational technologies focuses on the technological implementation and learning outcomes. Yet, potential privacy concerns or user perceptions that arise from the use of personal and sensitive data these systems require may impact user engagement, motivation, and learning. To explore potential predictors of students’ privacy-related risk perceptions, we conducted a study with N=66 university students who used an automated assessment software. The results show that awareness about the types of data the software collects and how it is used is associated with lower levels of risk perception and a higher preference for using the software. The findings present a first step of a research project aiming to provide a personalized learning experience while making related privacy implications graspable and controllable.

SESSION: cAESAR 2024: 5thWorkshop on Adapted intEraction with SociAl Robots

5th Workshop on Adapted intEraction with SociAl Robots (cAESAR)

  • Berardina Nadja De Carolis
  • Cristina Gena
  • Antonio Lieto
  • Silvia Rossi
  • Alessandra Sciutti

Human Robot Interaction (HRI) is a field of study dedicated to understanding, designing, and evaluating robotic systems for use by, or with, humans. In HRI there is a consensus about the design and implementation of robotic systems that should be able to adapt their behavior based on user actions and behavior. The robot should adapt to emotions, personalities, and it should also have a memory of past interactions with the user to become believable. This is of particular importance in the field of social robotics and social HRI. The aim of this Workshop is to bring together researchers and practitioners who are working on various aspects of social robotics and adaptive interaction. The expected result of the workshop is a multidisciplinary research agenda that will inform future research directions and hopefully, forge some research collaborations.

Achieving Kinetic Anthropomorphism in Robotic Precision Tasks

  • Rosanna Coccaro
  • Enrico Ferrentino
  • Antonio Parziale
  • Angelo Marcelli
  • Pasquale Chiacchio

In this position paper we address the issue of acceptance in human-robot interaction, a topic often associated with anthropomorphism. The literature around this concept lacks of a discussion around the kinetic aspect of anthropomorphism, especially in applications where machine accuracy is required. We briefly summarize the results we achieved and outline some research directions promised to impact human-robot collaboration in sectors like manufacturing and medicine.

Children’s Interpretation of Emotional Body Language Displayed by a Humanoid Robot: A Case Study

  • Ilaria Consoli
  • Claudio Mattutino
  • Cristina Gena

This paper presents an empirical study that examined how children interpret emotional body language displayed by the humanoid robot NAO. The purpose of the study is to provide insights into how children perceive and respond to emotional cues from robotic agents presenting an empirical evaluation that explores the effectiveness of using a humanoid robot to convey emotions to children. Through the examined results, the study aims to highlight the potential of using humanoid robots in educational and therapeutic contexts.

Dynamic Personalization of Multimedia Content Based on User Model

  • Massimo Donini
  • Cristina Gena
  • Alessandro Mazzei
  • Irene Borgini
  • Matteo Nazzario

This project investigates the potential of personalized interactions with social robots, like Pepper, by customizing images and videos to align with individual user profiles. The primary goal is to make conversations with social robots more engaging and relatable. By tailoring content to factors such as age, gender, interests, and cultural background, we aim to enhance user satisfaction and foster broader acceptance of social robots in everyday settings. This study opens up new possibilities for designing robots that can adapt to diverse user needs and preferences. Ultimately, this approach could lead to social robots that are more widely accepted and valued in a range of contexts, from customer service to education to healthcare. This project provides a path for future research in context-aware robot communication.

On the Way to a Transparent HRI

  • Alessandra Rossi
  • Silvia Rossi

Autonomous robots are expected to have safe and social interactions while sharing the environment and collaborating on several tasks with people. In such scenarios, robots’ behaviours must be transparent to people so that they can understand and predict the robots’ intentions and actions. In this work, we address the problem of transparency of a robot’s behaviours with particular attention given to the use of legibility of the robot’s actions, predictability of the final goal and the possibility of providing verbal explanations for a transparent human-robot interaction. Here, we also highlight different ways for adaptive generated explanations to be appropriately communicated to people and how to evaluate the overall transparency of the robots’ behaviours.

Robots to Make You Feel Good: Supporting Autistic Youths in Managing Medical Visit Challenges with Robot-Assisted Therapy

  • Linda Pigureddu
  • Cristina Gena
  • Berardina Nadja De Carolis
  • Margherita Attanasio
  • Monica Mazza

The following paper provides an overview of the “Feel Good” project, including its objectives, methodologies and plans. The premise of the project is to explore the integration of social robots within therapeutic frameworks aimed at easing the anxiety and distress associated with medical appointments for autistic youth. Social robots have shown effectiveness in supporting autism therapies, enhancing engagement and therapeutic outcomes. The aim of this initiative is to improve communication between healthcare providers and autistic youth. Through the implementation of various strategies, the initiative seeks to enhance the children’s capacity to identify and express their symptoms while also providing a comfortable healthcare experience. Furthermore, the initiative offers a secure environment where the children can work on their weaknesses and practice managing medical events. The activities are tailored according to their individual support needs and levels to ensure the best possible outcome.

SESSION: CRUM 2024: 2ndWorkshop on Context Representation in User Modelling

Second Workshop on Context Representation in User Modelling

  • Dipto Barman
  • Jovan Jeromela
  • Alok Debnath
  • Marloes Vredenborg
  • Anouk Van Kasteren
  • Judy Kay
  • Owen Conlan

The second edition of the Context Representation in User Modelling (CRUM) workshop, themed “Human-Centred Context”, aimed to provide a venue for research on contextual information within the evolving landscape of UMAP. This workshop explored context through a human-centred lens, emphasising the nuanced interplay between subjective and objective contexts in user experience rather than traditional domain-centred approaches. This theme encouraged accounting for the interaction between the user model and the contextual environment. The workshop also fostered discussion on the landscape of contextual representation – ranging from physical settings and social dynamics to technological interfaces – as well as on interaction design, user experiences, and influencing user behaviour. Prospective authors were invited to submit two types of papers: full papers of up to seven pages and two-page provocation papers. Accepted submissions are summarised in this workshop report.

Are We Losing Interest in Context-Aware Recommender Systems?

  • Laurens Rook
  • Markus Zanker
  • Dietmar Jannach

Contextual information is a prerequisite for timely offering of personalized decision support and recommendation. Yet, research on context-aware recommender systems (CARS) does not appear to be thriving, and finding public datasets containing context factors is a challenging task. We can make various assumptions about why this drop in research interest happened – be it ethical considerations or the popularity of opaque deep learning models that merely consider context in an implicit way. This is an unwelcome development. We argue that continued effort must be put on the creation of suitable datasets. Furthermore, we see significant opportunities in the development of next-generation CARS in the space of interactive AI assistants powered by Large Language Models.

Emotional Reframing of Economic News using a Large Language Model

  • Jia Hua Jeng
  • Gloria Kasangu
  • Alain Starke
  • Christoph Trattner

News media framing can shape public perception and potentially polarize views. Emotional language can exacerbate these framing effects, as a user’s emotional state can be an important contextual factor to use in news recommendation. Our research explores the relation between emotional framing techniques and the emotional states of readers, as well as readers’ perceived trust in specific news articles. Users (N = 200) had to read three economic news articles from the Washington Post. We used ChatGPT-4 to reframe news articles with specific emotional languages (Anger, Fear, Hope), compared to a neutral baseline reframed by a human journalist. Our results revealed that negative framing (Anger, Fear) elicited stronger negative emotional states among users than the neutral baseline, while Hope led to little changes overall. In contrast, perceived trust levels varied little across the different conditions. We discuss the implications of our findings and how emotional framing could affect societal polarization issues.

SESSION: ExUM 2024: 6thWorkshop on Explainable User Models and Personalised Systems

ExUM 2024 – 6th Workshop on Explainable User Modeling and Personalised Systems

  • Marco Polignano
  • Cataldo Musto
  • Amra Delić
  • Oana Inel
  • Amon Rapp
  • Giovanni Semeraro
  • Jürgen Ziegler

Adaptive and personalized systems have become pervasive technologies, gradually playing an increasingly important role in our daily lives. Indeed, we are now accustomed to interacting with algorithms that leverage the power of Language Models (LLMs) to assist us in various scenarios, from services suggesting music or movies to personal assistants proactively supporting us in complex decision-making tasks. As these technologies continue to shape our everyday experiences, ensuring that the internal mechanisms guiding these algorithms are transparent and comprehensible becomes imperative. The EU General Data Protection Regulation (GDPR) recognizes the users’ right to explanation when confronted with intelligent systems, highlighting the significance of this aspect. Regrettably, current research often prioritizes the maximization of personalization strategy effectiveness, such as recommendation accuracy, at the expense of model explainability. To address this concern, the workshop aims to provide a platform for in-depth discussions on challenges, problems, and innovative research approaches in the field. The workshop specifically focuses on investigating the role of transparency and explainability in recent methodologies for constructing user models and developing personalized and adaptive systems.

Classifying User Roles in Online News Forums: A Model for User Interaction and Behavior Analysis

  • Felix Scholz
  • Thomas Elmar Kolb
  • Julia Neidhardt

The growing exchange of opinions in online news forums brings together a diverse cross-section of users with varying opinions and motivations. Understanding these behaviors is crucial for unraveling the composition of these large user bases. This study proposes an explainable model aimed at classifying users based on their activity and interaction patterns in online news forums. The model leverages exploratory and statistical data analysis to reveal recurring behaviors and provides a tool to analyze the evolution of large user communities, offering an overview of their composition. The model identifies six active roles: Taciturn, Silent Voter, Regular, Conversationalist, Power User, and Celebrity, and one inactive role, Lurker. The model was evaluated for its predictive power, achieving a macro F1 score of 0.8632, demonstrating its robustness. By applying the model to a long-term dataset from the online news forum derStandard.at, an analysis of role distribution over time was conducted. The results indicated a gradual increase in user activity within the forum. Moreover, the study assessed the co-occurrence of roles in users’ long-term behavior and measured the frequency of role changes. This analysis aimed to determine whether users have consistent roles or exhibit various roles, which may depend on time or context.

Devising Scrutable User Models for Time Management Assistants

  • Jovan Jeromela
  • Owen Conlan

Intelligent Personal Assistants (IPAs) have become ubiquitous through integration into smartphones, smart speakers, and standalone devices. However, prior studies raised noteworthy usability concerns and determined that IPAs remain primarily used for simple tasks. Such findings contrast with the reported user aspirations for more proactive and truly personalised IPAs. By focusing on the use case of time management, in this paper, we contemplate how scrutability – i.e. the ability of the user to study their assistant and its underlying user model – fits within the vision of more complex IPAs. Furthermore, we describe our ongoing project investigating user interest in and expectations of the scrutability of a proactive calendaring assistant. Lastly, by deliberating on the challenges and benefits of making IPAs scrutable, this paper outlines potential avenues for further research.

Explanations in Open User Models for Personalized Information Exploration

  • Rully Agus Hendrawan
  • Peter Brusilovsky
  • Arun Balajiee Lekshmi Narayanan
  • Jordan Barria-Pineda

Open user models provide affordance for a transparent user control over recommendations based on shared symbolic representation within the system. Users must build their user profile by adding these symbols and tuning their importance to get meaningful recommendations. Since the link between these symbols and the reference explanation is often unavailable, it can be difficult for users to understand them. These symbols are often referred to as concepts, tags, areas, topics, labels, features, or keyphrases. This study showcases an information exploration system that helps students identify potential faculty members to collaborate with. The system works by matching user and faculty profiles that contain keywords or phrases representing topics/areas of interest. Students must develop their understanding of research topics while building their profiles, which can become challenging as they add more keywords. To support students in controlling the recommendation, we introduce post hoc explanations with three levels of detail: no explanations, individual explanation for topics, and explanation of the relationships between topics. This study explores how explanation is associated with the user context / tasks and the exploration process. Our observation suggests that expertise in the field is linked to exploring fewer novel topics and seeking fewer explanations but engaging more with explanations of relationships. In addition, we found that the engagement with faculty information is moderately correlated with the use of more advanced explanations.

Human Pose Estimation for Explainable Corrective Feedbacks in Office Spaces

  • Gaetano Dibenedetto
  • Marco Polignano
  • Pasquale Lops
  • Giovanni Semeraro

In working environments where prolonged sitting is ubiquitous, maintaining correct posture is crucial to alleviating musculoskeletal problems and improving general well-being. This research presents an innovative approach for assessing sitting postures using any camera conveniently accessible to the subject, be it integrated into their personal computer or an external device. The main contribution of our system is to detect key posture points and incorrect postures and provide the final user with personalized feedback and explanations to help them correct their postural alignment. Using a simple architecture i.e. the multilayer perceptron, we succeeded in identifying human postures. Moreover, by evaluating the results obtained from our classification model, we are able to obtain an explanation based on a post-hoc analysis. We observed that the most impactful joints for correct posture are, in order of importance, the wrists, elbows, and shoulders. Inspired by counterfactual explanations, we consequently provide personalized feedback for users with incorrect postures. These outcomes, although preliminary, show that the proposed pipeline is encouraging and can be pursued in future work where increasing the variety of data and improving detection approaches will be predominant. The reproducibility code is available here: https://github.com/GaetanoDibenedetto/Explainable-Corrective-Feedbacks

LLM-generated Explanations for Recommender Systems

  • Sebastian Lubos
  • Thi Ngoc Trang Tran
  • Alexander Felfernig
  • Seda Polat Erdeniz
  • Viet-Man Le

Users are often confronted with situations where they have to decide in favor or against an offered item, like a book, movie, or recipe. Those suggested items are commonly determined by a recommender system, which considers personal preferences to identify relevant items. However, those systems often lack transparency and comprehensibility in revealing why a specific item is recommended. For this purpose, explanations have been added as a powerful tool to help users with their final decisions. In this paper, we present and evaluate the capabilities of a Large Language Model (LLM) to come up with high-quality explanations to further improve the support of users for three different recommendation approaches, including feature-based recommendation, collaborative filtering, and knowledge-based recommendation. We explain how an LLM can be applied to generate personalized explanations and evaluate the explanation goals in an online user study. Our findings highlight that LLM-generated explanations are highly appreciated by users as they help in the evaluation of recommended items. Furthermore, we discuss which characteristics of the LLM-based explanations were perceived positively and how those findings can be used for future research.

Perception versus Reality: Evaluating User Awareness of Political Selective Exposure in News Recommender Systems

  • Alain Starke
  • Anders Sandvik Bremnes
  • Erik Knudsen
  • Damian Trilling
  • Christoph Trattner

News Recommender Systems (NRSs) have become increasingly pivotal in shaping the news landscape, particularly in how news is disseminated. This has also led to concerns about information diversity, especially regarding selective exposure in the realm of political news. Users may not recognize that news content presented to them is subject to selective exposure, through users that incorporate political beliefs. Within the U.S. two-party system, our research explores the interactions between NRSs and users’ ability to discern news articles that align with their political biases. We performed an online experiment (N = 160) to address the issue of user awareness and self-recognition of selective exposure within NRSs. Users were asked to select any number of news articles that matched their political orientation (i.e., Democrat or Republican) from a list of 50 news articles (5 Democrat, 5 Republican, 40 filler articles), which were either ranked saliently towards their political orientation or randomly. Contrary to expectations, our findings reveal no significant difference in article selection between participants exposed to a baseline random order and those who where presented with the more salient and easy to select version. We did observe that Republicans performed worse than Democrats in identifying aligning articles, based on precision and recall metrics.

ScrollyPOI: A Narrative-Driven Interactive Recommender System for Points-of-Interest Exploration and Explainability

  • Ibrahim Al-Hazwani
  • Tiantian Luo
  • Oana Inel
  • Francesco Ricci
  • Mennatallah El-Assady
  • Jürgen Bernard

Recommender systems can help web users find more relevant content, improve their online experience, and support them in the discovery of new Points-of-Interest (POI). Yet, challenges persist in dealing with the cold-start problem and in recommendation explainability. To address these, we have created ScrollyPOI, an interactive POI recommender system based on Data Humanism principles. Utilizing scrollytelling, we address the cold-start problem by engaging users in reflecting on previous positive experiences. Additionally, ScrollyPOI enhances explainability through input and output explanations. The system uses stacked bar charts and word clouds to explain how user preferences inform recommendations (input). Finally, ScrollyPOI employs a multi-layered approach to explain why specific POIs are recommended (output). We have evaluated ScrollyPOI’s interface and experience through a preliminary study, highlighting its potential for transparent explanations in the POI recommendation domain. Our findings underscore ScrollyPOI’s efficacy in collecting preferences and enhancing recommendation transparency, positioning it as a platform for studying explainability goals in the POI domain.

The Effect of Proactive Cues on the Use of Decision Aids in Conversational Recommender Systems

  • Yuan Ma
  • Jürgen Ziegler

Conversational recommender systems (CRS) are increasingly able to conduct complex conversations with the user due to the advances in natural language processing techniques. As a consequence, dialogs in CRS can go beyond the mere recognition of product-related intents, offering decision aids that let users, for example, compare or critique recommendations. Users may, however, not be aware of such functionality and fail to actively query the system for such functions. It may thus be helpful, if the system proactively provides cues for using the advanced functions. Whether and how such proactive cues are perceived and used by the user has not been investigated yet. We report an online study investigating user interaction behavior under two interaction schemes: a proactive scheme prompting the user to use the functions and a passive scheme without prompts where users can freely enter their requests. To compare the two schemes, we implemented a chatbot in the domain of smartphones. Two groups of participants on Prolific (total n=270) used the system, which operates either with a proactive or passive scheme. In addition to interaction data, we measured several psychological factors and users’ subjective assessment of the system through questionnaires. We mainly found: 1. There is no significant difference in user experience under the two schemes. 2. Users tend to accept prompts when the system provides them. 3. Overall, more intuitive users (as measured by an established decision-making style instrument) tend to accept the system’s prompts more than more rational persons. 4. A tendency that acceptance of interaction prompts aligns with self-reported dialog-related preferences. The results provide heuristic suggestions for dialog strategy design and the potential personalization of CRS.

SESSION: GMAP 2024: 3rdWorkshop on Group Modeling, Adaptation and Personalization

GMAP 2024: 3rd Workshop on Group Modeling, Adaptation and Personalization

  • Francesco Barile
  • Amra Delić
  • Ladislav Peska
  • Isabella Saccardi
  • Federica Vinella

While most recommender systems cater to individual users’ needs, there are numerous situations where these systems are needed to meet groups’ demands. These systems are broadly labelled as Group Recommender Systems (GRSys). Traits like interpersonal relationships, group mood, and emotional contagion are essential to fulfilling the group’s needs. However, the group’s characteristics are frequently ill-defined and dynamic and are typically absent from systems modeling. Moreover, GRSys must maneuver between the needs of the group and the individuals when opinions differ and can contradict each other. The third edition of GMAP proposes consolidating a community of scholars interested in group modeling, adaptation, and personalization. Through the workshop, researchers continue their examination of the difficulties and possibilities of creating efficient procedures and instruments to facilitate collective decision-making. GMAP 2024 offered this unique opportunity to gather scholars from different fields to enrich discussions over GRSys’ research. The workshop also allowed attendees to strengthen their networks and establish new connections conducive to cutting-edge collaborative research.

A Preliminary Study of the Impact of Personality on Satisfaction in Group Contexts

  • Francesco Barile
  • Federico Maria Cau
  • Nava Tintarev

When groups of people conduct recommended activities together, several factors affect their individual satisfaction. This includes the satisfaction of the other group members, due to phenomena such as interpersonal social influence and emotional contagion. To model the change in individual satisfaction in a group context, it is important to consider factors like the individual personality and the type of relationships between group members. This research aims to investigate how individual satisfaction related to a group activity is influenced by the preferences of other people present. We focus on groups with two members, in the context of listening to music. We performed an exploratory user study (n=26; 13 pairs), finding that people adjust their preferences to their companion, but we did not see an effect of the impact of tie strength (friend vs. stranger). We also found indications that personality traits can predict whether participants are more (Dominating and Avoiding), or less likely (Agreeableness) to adjust their preferences to their companion. Although influenced by the limited sample size, our results support the hypothesis that individual personalities impact the individual satisfaction shift. However, measuring different phenomena that can affect users’ behaviors in the considered experimental setting seems crucial, and deeper investigations with a larger sample size are necessary to produce more robust results.

Anticipating Eating Preferences in Group Decision Making

  • Hanif Emamgholizadeh
  • Amra Delić
  • Francesco Ricci

In a decision-making scenario, one group member may need, independently from the other members, to choose an item, e.g., a restaurant, that will be experienced by the group. In a prior research on restaurant recommendation, we have identified two primary tasks of such an organizer of a lunch/dinner event, who is in charge of selecting a proper restaurant for the group: anticipating the other group members’ preferences, to properly enter these preferences in the recommender system, and reconciling incompatible preferences, if they arise in the elicitation phase. To support the first task, we augmented a group recommender system with a machine learning model that predicts group members’ food preferences, about dishes that can be consumed in the restaurant, based on other available information about the members (demographics and preferred cuisine). However, in a user study, we found that supporting functionality to be not effective in improving the quality of the choices made by the organizer.

In this paper, we investigate the causes of this poor performance. We analyze the possibility that the poor performance may relate to: the deployed ML models used for food preference prediction, the amount of data about food preferences used for training the ML model, and the complexity of the preference anticipation task. The results of our experiments suggest that neither the ML model nor the scarcity of the preference data is responsible for the observed poor value of the implemented preference recalling function. While a user study seems to confirm the impossibility of accurately predicting food preferences from the considered user’s characteristics (demographics and preferred cuisine).

Evaluating Group Fairness in News Recommendations: A Comparative Study of Algorithms and Metrics

  • Blake Huebner
  • Thomas Elmar Kolb
  • Julia Neidhardt

Beyond accuracy metrics, such as fairness and diversity, have become widely studied topics in recommender systems. Improving these metrics is important not only from an ethical and legal perspective, but can also improve overall user satisfaction. Although these metrics are widely discussed, very little empirical research has been done, especially comparing multiple algorithms across different metrics. This work explores the role of fairness and diversity in news recommender systems, specifically in the context of the Austrian media landscape. This study aims to identify the most effective approaches for generating fair and diverse news recommendations, while addressing the potential negative consequences of biased recommendations and filter bubbles, such as societal polarization and the suppression of information. This includes an extensive literature review of relevant group unfairness metrics and state-of-the-art fairness-aware algorithms. A dataset of articles from an Austrian newspaper was used for empirical research, with analysis performed on fairness, and diversity of recommendations. The key message of the study is that accuracy and fairness can be achieved simultaneously with the right modeling approach, while diversity can be held constant using these modeling techniques. The study recommends the use of Personalized Fairness based on Causal Notion models for accuracy and reducing certain unfairness metrics, and finds Fairness Objectives for Collaborative Filtering models more effective at reducing other types of unfairness. The findings contribute to the field by demonstrating the importance of incorporating these metrics into the design and evaluation of recommender systems.

Towards automated feedback to a team member on their performance

  • Judith Masthoff
  • Isabella Saccardi

Teamwork is prevalent in many settings including in higher education. Unfortunately, teamwork does not always go well. We previously studied how a peer assessment survey can be used by individuals to give their opinions on their teammates. This allows somebody (such as a teacher in an educational setting) to monitor how well the teams are doing. We also studied how to provide support to the person who fills out the survey and adapt such support to their personality and the ratings they have given. This paper focuses on another aspect, namely what feedback to provide to a person who has been rated by their teammates. We discuss the challenges of providing such feedback automatically and provide a research agenda for solving this issue.

User Experience of Recommender System: A User Study of Social-aware Fashion Recommendations System

  • Aayesha Aayesha
  • Muhammad Afzaal
  • Julia Neidhardt

User experience, which encompasses users’ feelings and perceptions, is regarded as a key element in the evaluation of recommender systems. The existing literature extensively works on recommendation generation strategies with focus on the accuracy by considering objective aspects of the system. Although some of the current works considered subjective aspects of the recommendation systems from a user-centric perspective to evaluate the recommender system, however, a comprehensive analysis that could investigate factors to improve user experience was of limited focus. In this paper, we propose a methodology that provides a comprehensive multi-perspective analysis of a social-aware fashion recommender system and analyses the impact of user’s personal attributes and profiles on their experiences in various aspects of system use. A user study was conducted to realize the proposed methodology. The obtained insights highlighted that user experiences vary not only from the perspective of using a recommender system but also by varying their personal attributes (age, gender, hobby) and profiles.

SESSION: HAAPIE 2024: 9th International Workshop on Human Aspects in Adaptive and Personalized Interactive Environments

HAAPIE 2024: 9th International Workshop on Human Aspects in Adaptive and Personalized Interactive Environments

  • Panagiotis Germanakos
  • Vania Dimitrova
  • Ben Steichen
  • Bruce Ferwerda
  • Marko Tkalcic

Nowadays, the profound digital transformation has upgraded the role of the computational system into an intelligent multidimensional communication medium that creates new opportunities, competencies, models and processes. The need for human-centered adaptation and personalization is even more recognizable since it can offer hybrid solutions that could adequately support the rising multi-purpose goals, needs, requirements, activities and interactions of users. The HAAPIE workshop1 embraces the essence of the “human-machine co-existence” and brings together researchers and practitioners from different disciplines to present and discuss a wide spectrum of related challenges, approaches and solutions. In this respect, the ninth edition of HAAPIE includes 4 long papers and 2 short papers.

Challenges and Strategies in Personalised Planning Support for University Students with Autism

  • Robin Cromjongh
  • Maria Młocka
  • Almila Akdag
  • Judith Masthoff
  • Hanna Hauptmann

Students in higher education with Autism Spectrum Disorder (ASD) face many challenges that might differ from their neurotypical peers. One area where students with autism face deficits is planning. This paper looks into the challenges faced and the strategies applied for planning by students with autism compared to neurotypical students. We aim to identify where personalisation and adaptivity may help them become independent and effective in their planning behaviour. This research indicates which personalisation needs designers of assistive technologies should consider for planning and task management. We present an online survey with 30 neurotypical students (NTS) and 34 students with (self-)diagnosed autism (ASDS) and interviews with six students with autism for more in-depth insights. Results indicate that ASDS experience problems with gaining a clear overview of what they need to do, knowing when they might do it, and following that plan to execution. In contrast to NTS, they also struggle with fitting routine household and self-care tasks into their schedule. We identified time-independent planning, identifying external pressure and defining sub-tasks as promising planning strategies for students with autism when combined with adaptation to their personal needs.

Exploring Adaptive Social Comparison for Online Practice

  • Kamil Akhuseyinoglu
  • Emma Mcdonald
  • Aleksandra Klasnja Milicevic
  • Carrie Demmans Epp
  • Peter Brusilovsky

Students experience motivational issues during online learning which has led to explorations of how to better support their self-regulated learning. One way to support students uses social reference frames or social comparison in student-facing learning analytics dashboards (LADs) and open learner models (OLMs). Usually, the social reference frame communicates class averages. Despite the positive effects of class-average-based social comparison on students’ activity levels and learning behaviors, comparison to class average can be misleading for some students and offer an irrelevant reference frame, motivating only low or high performers. Such conflicting findings highlight a need for an investigation of social reference frames that are not based on the “average” student. We extend the research on social comparison in education by conducting two complementary classroom studies. The first explores the effects of different fixed social reference frames in a non-mandatory practice system, while the second introduces an adaptive social reference frame that dynamically selects the peers who serve as a comparison group when students are engaged in online programming practice. We reported our analyses from both studies and shared students’ subjective evaluations of the system and its adaptive comparison functionality.

Harmonizing Ethical Principles: Feedback Generation Approaches in Modeling Human Factors for Assisted Psychomotor Systems

  • Miguel Portaz
  • Angeles Manjarres
  • Olga C. Santos

As the demand for personalized and adaptive learning experiences increase, there is a urgent need for providing effective feedback mechanisms within critical systems, such as in psychomotor learning systems. This proposal introduces an approach for the integration of retrieval-augmented generation tools to provide comprehensive and insightful feedback to users. By combining the strengths of retrieval-based techniques and generative models, these tools offer the potential to enhance learning outcomes by delivering tailored feedback that is both informative and engaging. The proposal also emphasises the importance of incorporating explainability and transparency concepts. Following the hybrid intelligence paradigm it is possible to ensure that the feedback provided by these tools is not only accurate but also understandable to humans. This approach fosters trust and promotes a deeper understanding of the psychomotor learning process, empowering users and facilitators to make informed decisions about the psychomotor learning path. The hybrid intelligence paradigm, which combines the strengths of both human and artificial intelligence, plays a crucial role in the deployment of these solutions. By taking advantage of the cognitive capabilities of human experts alongside the computational power of artificial intelligence algorithms, it is possible to offer personalised feedback that takes into account both technical accuracy and pedagogical effectiveness. Through these collaborative efforts it is also possible to create learning environments that are inclusive, adaptable, and beneficial to lifelong learning. In conclusion, this proposal introduces retrieval-augmented generation tools for providing feedback in psychomotor learning systems, which represents a significant step towards in its personalization, and whose ethical implications align with the new regulations on the implementation of intelligent technologies in critical systems.

Navigating Serendipity – An Experimental User Study On The Interplay of Trust and Serendipity In Recommender Systems

  • Irina Nalis
  • Tobias Sippl
  • Thomas Elmar Kolb
  • Julia Neidhardt

Recommender systems play a crucial role in our daily lives, constantly evolving to meet the diverse needs of users. As the pursuit of improved user experiences continues, metrics such as serendipity have emerged within the realm of beyond-accuracy paradigms. However, integrating serendipitous recommendations presents complex challenges, necessitating a delicate balance between novelty, relevance, and user engagement. In this interdisciplinary experimental user study, we address these challenges within the context of a book recommender system. By investigating the impact of interface design changes on user trust, a key determinant of satisfaction with serendipitous recommendations, we measured trust levels for both individual recommended items and the recommender system as a whole. Our findings indicate that while interface enhancements did not yield significant increases in trust, they did notably elevate serendipity ratings for previously unknown books. These results highlight the intricate interplay between technical and psychological factors in the design of recommender systems, emphasizing the importance of human-centered approaches in the creation of more responsible AI applications. This research contributes to ongoing discussions surrounding user-centric recommendation systems and aligns with broader themes of digital humanism and responsible AI.

Towards Integrating Human-in-the-loop Control in Proactive Intelligent Personalised Agents

  • Awais Akbar
  • Owen Conlan

This research explores the integration of Human-in-the-loop (HITL) control within Proactive Intelligent Personalised Agents (PIPAs) that possess the capability to proactively anticipate users’ needs and perform tasks on their behalf. The proactive assistance offered by PIPAs is tailored to individual users’ preferences and behaviours. However, it is crucial to personalise the level of proactivity exhibited by PIPAs to align with users’ preferences regarding the potential delegation of autonomy. This necessitates HITL control to regulate PIPAs’ autonomy levels, ensuring appropriate user involvement in decision-making. Through a simulation-based approach, this research explores the integration of HITL control in PIPAs. It investigates the conditions that trigger HITL control, its mechanisms, and the challenges associated with these triggers.

Using Large Language Models for Adaptive Dialogue Management in Digital Telephone Assistants

  • Hassan Soliman
  • Milos Kravcik
  • Nagasandeepa Basvoju
  • Patrick Jähnichen

The advent of modern information technology such as Large Language Models (LLMs) allows for massively simplifying and streamlining the communication processes in human-machine interfaces. In the specific domain of healthcare, and for patient practice interaction in particular, user acceptance of automated voice assistants remains a challenge to be solved. We explore approaches to increase user satisfaction by language model based adaptation of user-directed utterances. The presented study considers parameters such as gender, age group, and sentiment for adaptation purposes. Different LLMs and open-source models are evaluated for their effectiveness in this task. The models are compared, and their performance is assessed based on speed, cost, and the quality of the generated text, with the goal of selecting an ideal model for utterance adaptation. We find that carefully designed prompts and a well-chosen set of evaluation metrics, which balance the relevancy and adequacy of adapted utterances, are crucial for optimizing user satisfaction in conversational artificial intelligence systems successfully. Importantly, our research demonstrates that the GPT-3.5-turbo model currently provides the most balanced performance in terms of adaptation relevancy and adequacy, underscoring its suitability for scenarios that demand high adherence to the information in the original utterances, as required in our case.

SESSION: HUMAD 2024: International Workshop on Human-Centered Modeling and Adaptation for Digital Transformation

HUMAD 2024: International Workshop on Human-Centered Modeling and Adaptation for Digital Transformation

  • Simone Balloccu
  • Alessandro Sebastian Podda
  • Livio Pompianu
  • Roberto Saia
  • Angelo Antonio Salatino

While digital transformation brings broad positive impacts, its rapid propagation varies across territories, accentuating disparities not only in the social and organisational contexts but even in industrial sectors. Acknowledging this digital divide, the HUMAD workshop aimed to reshape industrial and digital landscapes for equity and accessibility, specifically emphasising the pivotal role of user modelling and artificial intelligence. Discussions centred on personalised user modelling and holistic, human-centred approaches tailored for an industrial context. The workshop explored applications in critical sectors, including smart cities, tourism, industrial production, healthcare, education, and well-being, focusing on delivering quality digital interactions and services tuned to industrial and human-centric needs. The objective was to elevate the quality of digital interactions and services, ensuring a more uniform and inclusive distribution of the benefits of industrial digital transformation. Beyond immediate concerns, HUMAD delved into user modelling’s potential to drive growth in disadvantaged territories. By comprehending the unique challenges these regions face, the aim was to cultivate sustainable solutions within an industrial framework. In the light of the above, we invited contributions that navigate the intricacies of user modelling, proposing innovative strategies to reduce disparities and develop a more equitable, human-centric digital future.

An Initial Investigation of Mental Well-being Monitoring through Personal Healthcare Devices

  • Silvia Maria Massa
  • Eleonora Porcu
  • Daniele Riboni

Smart sensory devices, such as smart watches, scales, and blood pressure gauges, are increasingly adopted by individuals aiming to improve their health and fitness. Those devices gather extensive data about cardiovascular parameters, physical activities, sleep quality, and behavior. Thanks to data analytics and artificial intelligence algorithms, they provide insights into the health status of individuals. Derived data is used to support self-care interventions and to provide practitioners with additional health information acquired on a continuous basis. However, most of the current solutions focus on the physical dimension of health, while the mental dimension is often neglected. In this paper, we present the initial investigation of a system to recognize a wide range of psychological parameters, including behavioral inhibition/activation, anxiety, and stress, leveraging data acquired from personal healthcare devices. We experimented with the application of different supervised learning algorithms on features extracted from heart, sleep, and inertial sensor data acquired from a cohort of 21 individuals over 24 hours each. Our preliminary findings suggest that our method may yield promising outcomes in recognizing different aspects of mental well-being. However, due to the limited size of the used dataset, a more comprehensive experimental evaluation, with a broader number of participants and carried out over an extended monitoring period, is imperative to substantiate the results.

Blockchain Technology for Certifying Waste Management within the digital transformation for industry and SME

  • Lodovica Marchesi
  • Maria Ilaria Lunesu
  • Roberto Tonelli

The need for recycling and a circular economy has grown significantly in recent years, accompanied by challenges such as greenwashing and fraudulent recycling practices. Blockchain technology is a new emerging technology that aims to facilitate information sharing, enable comprehensive government oversight, and establish effective incentive mechanisms. This paper explores digital transformation within blockchain frameworks to revolutionize waste management certification processes. Through the description of a real-world scenario, involving the recycling of food containers, in our case study pizza boxes within a pizzeria, this study highlights the tangible benefits, and challenges encountered from integrating blockchain technology into waste management processes. This case study demonstrates the potential for technology-driven solutions to address environmental challenges while fostering community engagement and incentivizing sustainable practices with the digital transformation of SMEs.

Charting the Landscape of Digital Health: Towards A Knowledge Graph Approach to News Media Analysis

  • Vanni Zavarella
  • Diego Reforgiato
  • Sergio Consoli
  • Gianni Fenu

In this paper, we present our currently on-going work on a method for analyzing digital health transformation in our society by constructing a Knowledge Graph from a large corpus of 7.8 million English news articles, dating from 1987 through 2023. We firstly sampled around 95k articles relevant to the Digital Health topic by training and deploying a Deep Learning binary classifier via fine-tuning BERT. Successively, by deploying NLP techniques, we extracted triples from the identified articles to form a Digital Health News Knowledge Graph, which consists of 431k distinct triples connecting 186k entities through 1866 relations. The constructed Knowledge Graph provides insights into the evolution of Digital Health in news media and serves as a resource for further research in the field. The analysis that we have carried out reveals significant trends in Digital Health as reflected in the news, with notable peaks coinciding with key events like the COVID-19 pandemic.

Design of an AI-driven Architecture with Cobots for Digital Transformation to Enhance Quality Control in the Food Industry

  • Paola Busia
  • Claudio Marche
  • Paolo Meloni
  • Diego Reforgiato Recupero

In recent years, the rapid evolution of smart technologies has spurred enterprises to undergo digital transformations, revolutionizing their business processes and operations. This shift, known as Digital Transformation, has permeated diverse sectors, particularly impacting production systems. Notably, Artificial Intelligence (AI) and robotic automation have emerged as pivotal drivers in this transformation, promising enhanced efficiency and innovation in industrial digitization. This paper presents a novel architecture designed to facilitate digital transformation within enterprises, harnessing the capabilities of advanced collaborative robots (cobots) and cutting-edge image segmentation techniques. Focused on a practical scenario within a food production environment, our proposed architecture aims to seamlessly integrate a cobot and a camera in an automatic system for efficient cardboard disposal. Specifically, our attention is drawn to the challenge of differentiating sections of food packaging suitable for disposal from those contaminated with stains or organic residues, a task with significant implications for waste management efficiency. By leveraging a cloud-based architecture and deploying AI algorithms for image segmentation, localization, and robot guidance, our study showcases the tangible benefits and practical applicability of these methodologies in real-world settings. This research not only highlights the potential of AI-driven solutions in addressing specific industrial challenges but also underscores the broader impact of digital transformation on optimizing operational processes and driving innovation across sectors.

Exploring Architectural Choices and Emerging Challenges in Data Management for IoT: A Focus on Digital Innovation and Smart Cities

  • Silvio Barra
  • Ferdinando D’Alessandro
  • Oleksandr Sosovskyy

Digital innovation is an important and evolving process that refers to the transformation and integration of digital technologies into various aspects of everyday life and work. In this context, innovative digital solutions are often adopted and developed, including the Internet of Things (IoT). In order to fully understand the potential, trends, and challenges related to the implementation of these technologies in data management, we seek to identify the most promising architectural choices for an IoT infrastructure from the perspective of key aspects in the context of Data Management. In this way, we aim to provide a comprehensive overview of IoT systems to guide organizations towards more informed decisions, offering cutting-edge solutions and contributing to digital transformation. This work aims to examine aspects such as Data Collection, Data Aggregation, Data Integration, Data Security, Data Retention, and Data Analysis, with particular attention to emerging challenges in these contexts. Within the context of digital innovation, our study focuses on several areas of interest. For example, in the field of smart cities, we explore how the use of Cloud, Fog, and Edge Computing technologies can improve video surveillance and promote the use of Artificial Intelligence (AI) for better management of smart cities. Cloud computing, characterized by centralized data processing and storage, represents a well established paradigm of IoT infrastructure. Fog computing extends these capabilities to the network edge, bringing computation and data storage closer to the data source, thereby reducing latency and enhancing real-time processing. Edge computing takes this concept further by processing data directly at or near the data source, minimizing the need for data to traverse long distances to centralized hubs, thus enabling even faster response times and improved efficiency. Additionally, we examine how human-centric methods and user modeling tools can facilitate the transition to intelligent environments, such as workplaces, healthcare, and cities. Considering the emerging challenges in IoT data management, our work provides an overview of the solutions offered by Cloud, Fog, and Edge Computing, enabling industries and small and medium-sized enterprises to adopt targeted and sustainable digital transformation strategies.

Modelling Patient-Therapist Collaboration for Brain Injury Rehabilitation in Virtual Reality

  • Stephanie Elena Crowe
  • Bahareh Shahri
  • Thammathip Piumsomboon
  • Simon Hoermann
  • Annalu Waller

In the era of digital transformation, the utilisation of Virtual Reality (VR) applications in brain injury rehabilitation presents a novel frontier with vast potential. However, concerns persist about how traditional communication pathways between therapists and patients can be transformed into digital equivalents. Immersive virtual reality submerses patients in a virtual world, with therapists taking the role of external spectators. This has implications for the therapeutic alliance (TA), which is based on interpersonal bonds and effective communication between therapist and patient. This study investigated the use of immersive VR in clinical settings for the rehabilitation of acquired brain injury (ABI). The aim was to understand how to design a smart virtual therapeutic system that can be adapted to various needs of patients and therapists during the intervention. Through observation and semi-structured interviews, patient-therapist interactions were analysed during VR-assisted and traditional rehabilitation sessions, revealing challenges related to effective collaboration. Findings indicated changes in interaction and communication with the introduction of VR: informal dialogue decreased, while shared laughter and therapist feedback increased. Despite the visual disconnect imposed by VR headsets, therapists continued to use guiding gestures and engaged with patients. Interviews highlighted the importance therapists place on rapport-building, and both relied on each other for expertise and reassurance. This research offers valuable insights into the potential to transit, transform and enhance traditional rehabilitation practices with the use of virtual reality and suggests avenues for human-centred design of VR applications tailored to ABI rehabilitation and TA, ultimately enhancing therapeutic outcomes in this domain.

Modelling Users for User Modelling: Dynamic Personas for Improved Personalisation in Digital Behaviour Change

  • Igor Matias
  • Farhat-ul-Ain
  • Darina Akhmetzyanova
  • Vladimir Tomberg

Behaviour change, crucial in health interventions, demands an understanding of the end user’s psychological, social, and environmental contexts, often overlooked in user modelling. This paper advocates integrating behaviour change theories into user models to improve personalisation based on individual characteristics. We introduce a novel, enriched with psychological theories, framework for using dynamic user Personas within Digital Behaviour Change Interventions, allowing adaptation to evolving user behaviours and preferences. The dynamic user Personas offer a detailed representation that accounts for the complex stages of behavioural change and COM-B patterns. We also explore the transition in user modelling from expert-driven to data-driven methods, highlighting the importance of acknowledging both interpersonal and intrapersonal variations in user behaviours. Our proposed semi-automated system model merges these modelling methods, enhancing personalisation in health interventions by addressing human behavioural complexities. Additionally, we discuss the ethical and privacy issues involved, ensuring responsible and secure data management. In conclusion, our approach conceptually links behaviour change, interaction design, and user modelling, setting a foundation for further empirical research and digital health applications.

RoadSense3D: A Framework for Roadside Monocular 3D Object Detection

  • Salvatore Carta
  • Modesto Castrillón-Santana
  • Mirko Marras
  • Sondos Mohamed
  • Alessandro Sebastian Podda
  • Roberto Saia
  • Marco Sau
  • Walter Zimmer

Utilizing monocular cameras for 3D object understanding is widely recognized as a cost-effective approach, spanning applications such as autonomous driving, augmented/virtual reality or roadside monitoring. Despite recent progress, persistent challenges arise in creating generalized models adaptable to unforeseen scenarios and diverse camera configurations. In this work, we focus on the task of monocular 3D object detection within roadside environments. To begin, we introduce a versatile methodology for generating and labeling datasets tailored to roadside scenarios, addressing limitations encountered in real-world settings. Subsequently, we develop an array of deep learning models tailored to this task, refining them to address practical challenges that emerge during real-world application. Lastly, leveraging our framework, we curated a synthetic benchmark dataset comprising 1,415,680 frames and 8,902,636 labeled 3D objects, ultimately assessing the performance of existing models across all datasets.

Towards Knowledge Graph Refinement: Misdirected Triple Identification

  • Salvatore Carta
  • Alessandro Giuliani
  • Marco Manolo Manca
  • Leonardo Piano
  • Livio Pompianu
  • Sandro Gabriele Tiddia

In the current digital transformation scenario, Knowledge Graphs (KGs) represent an across-the-board instrument for representing knowledge in a structured form. Such tools allow to effectively enhance the performance of Artificial Intelligence models in manifold contexts, such as reasoning or information retrieval. Nevertheless, the effectiveness of KGs is often affected by the incorrect directionality of some of their edges, due in most cases to human error or the inefficiency of automatic and semi-automatic graph creation methods. This paper proposes a classification-based approach to identify misdirected triples within a KG, aiming to support and assist humans in creating graph refinement. Triples are the main component of KGs, and they model the connection between nodes with a <subject, predicate, object> form. Our proposal allows us to refine a KG by devising a classification-based approach for recognizing whether the subjects and objects are not compliant with the logic directionality of the corresponding predicate, meaning that they should be switched (e.g., the triple <U.S.A., is capital, Washington> should be inverted as <Washington, is capital, U.S.A.>). We compare traditional machine learning techniques with cutting-edge advanced methods, including pre-trained language models and large language models. Extensive experiments have been performed across several datasets, confirming the effectiveness of our proposal.

Towards Zero-shot Knowledge Graph building: Automated Schema Inference

  • Salvatore Carta
  • Alessandro Giuliani
  • Marco Manolo Manca
  • Leonardo Piano
  • Sandro Gabriele Tiddia

In the current Digital Transformation scenario, Knowledge Graphs are essential for comprehending, representing, and exploiting complex information in a structured form. The main paradigm in automatically generating proper Knowledge Graphs relies on predefined schemas or ontologies. Such schemas are typically manually constructed, requiring an intensive human effort, and are often sensitive to information loss due to negligence, incomplete analysis, or human subjectivity or inclination. Limiting human bias and the resulting information loss in creating proper Knowledge Graphs is paramount, particularly for user modeling in various sectors, such as education or healthcare. To this end, we propose a novel approach to automatically generating a proper entity schema. The devised methodology combines the language understanding capabilities of LLM with classical machine learning methods such as clustering to properly build an entity schema from a set of documents. This solution eliminates the need for human intervention and fosters a more efficient and comprehensive knowledge representation. The assessment of our proposal concerns adopting a state-of-the-art entity extraction model (UniNER) to estimate the relevance of the extracted entities based on the generated schema. Results confirm the potential of our approach, as we observed a negligible difference between the topic similarity score obtained with the ground truth and with the automatically generated schema (less than 1% on average on three different datasets). Such an outcome confirms that the proposed approach may be valuable in automatically creating an entity schema from a set of documents.

SESSION: PATCH 2024: 15th International Workshop on Personalized Access to Cultural Heritage

15th International Workshop on Personalized Access to Cultural Heritage (PATCH 2024)

  • Liliana Ardissono
  • Tsvi Kuflik
  • Noemi Mauro
  • George Raptis
  • Alan Wecker

Following the successful series of PATCH workshops, PATCH 2024 will be again the meeting point between state-of-the-art cultural heritage research and personalization – using any technology, while focusing on ubiquitous and adaptive scenarios, to enhance the personal experience in Natural and Cultural Heritage sites. The workshop aims to bring together researchers and practitioners interested in exploring the potential of ICT technology (onsite and online) to enhance the visit experience. The expected result of the workshop is sharing and discussing novel ideas and creating a multidisciplinary research agenda that will inform future research directions and hopefully, forge some research collaborations. This summary provides an overview of the papers accepted for presentation at the workshop and publication in its proceedings.

Artwork Segmentation in Eye-Tracking Experiments: Challenges and Future Directions

  • Alessio Ferrato
  • Carla Limongelli
  • Mauro Mezzini
  • Giuseppe Sansonetti
  • Alessandro Micarelli

Eye-tracking technology has gained prominence in cultural heritage studies, facilitating behavioral analysis and visitor engagement assessments. This paper explores the challenges and future directions of artwork segmentation in eye-tracking experiments, aiming to automate the identification of areas of interest. Although existing segmentation approaches, such as semantic segmentation models, show promise, they face limitations in accurately segmenting diverse artwork styles. We propose hybrid segmentation as a viable strategy, combining multiple techniques for improved accuracy. Through qualitative analysis, we evaluate segmentation models on public domain artworks, highlighting the strengths and weaknesses of each approach.

Exploring the Cultural Heritage of a Territory through the Cinematography Production

  • Liliana Ardissono
  • Vincenzo Lombardo
  • Diego Magro
  • Noemi Mauro
  • Andrea Nasi
  • Livio Scarpinati
  • Silvio Alovisio
  • Luca Mazzei

The cultural heritage of a place is mainly presented in websites and guides that describe Points of Interest focusing on specific topics or temporal windows. As such, they offer a detailed but fragmented viewpoint of the territory they describe and weakly support the provision of a longitudinal perspective on places and people’s lifestyles.

In the Revisualizing Italian Silentscapes (RevIS) project, we aim to achieve this perspective by exploiting cinematographic production through the multifaceted exploration of films and their scenes in a movie catalog. Providing advanced search support enhances the preservation and fruition of the cultural heritage related to early and recent movies. However, in this paper, we focus on exploring Points of Interest through the description of landscapes and related places or locations in films, a novel aspect we introduce. Specifically, we present a module of the RevIS app supporting the location-based exploration of movies.

Green Destination Recommender: A Web Application to Encourage Responsible City Trip Recommendations

  • Ashmi Banerjee
  • Tunar Mahmudov
  • Wolfgang Wörndl

Tourism Recommender Systems (TRS) have evolved from only providing user recommendations to becoming convergence points for multiple stakeholders. This necessitates recognizing the interests of all stakeholders, particularly in the tourism sector, which faces challenges like seasonality and resource constraints. Our stakeholder classification identifies consumers, item providers, platform, and society as key stakeholders, highlighting the complexity of real-world relationships. Fairness in TRS demands a multistakeholder approach, integrating sustainability to address the broader societal impact.

While research has focused on fair recommendation systems in tourism, the focus on generating sustainable recommendations remains limited. This demo paper aims to enhance fairness in TRS, mainly focusing on society as a stakeholder. We introduce the Green Destination Recommender (GDR), an application that prioritizes Societal Fairness (S-Fairness) by encouraging environmentally conscious decisions. GDR recommends sustainable tourism destinations based on the user’s starting location, travel month, and specific interests. The application promotes ecologically friendly options by recommending less popular yet appealing destinations, considering the emissions from transport to reach the destination and the seasonal demand to balance visitor numbers year-round.

Point-of-Interest Recommender Systems: Nudging towards Sustainable Tourism

  • Noemi Mauro
  • Livio Scarpinati
  • Fabio Ferrero
  • Angelo Geninatti Cossatin
  • Claudio Mattutino

With the increasing attention to environmental sustainability, matching user preferences and green behavior has become a must in several domains, including tourism. However, changing people’s traveling habits is hard and requires a relevant motivation effort. This short paper investigates the exploitation of digital nudges to promote sustainable tourism in personalized mobile guides for natural and cultural heritage exploration. The ultimate goal is to enhance point-of-interest recommender systems with the capability to drive users toward the selection of itineraries that they like and that can be managed by exploiting green means of transportation. For this purpose, we propose to integrate the recommendation of Points of Interest that satisfy the user’s interests with an explicit presentation of the environmental impact of traveling to such places, using digital nudges to drive the user toward the selection of sustainable tour management solutions.

The Curator’s Helper

  • Rotem Dror
  • Daniel Hutchinson
  • Mason Jones
  • Victoria Van Hyning
  • Tsvi Kuflik

Galleries, libraries, archives, and museums (GLAM) are distinct but interconnected institutions that play crucial roles in preserving, studying, and sharing knowledge, culture, and heritage (GLAM). Many hold and preserve unique artifacts that they make available to the public in various forms, on-site through exhibitions and displays, and remotely, through digital surrogates such as images, audio/video files, in digital exhibits and through various forms of description such as catalog, archival, and museum records management systems. Oftentimes these artifacts and resources need be delivered to the public in varying modes and for varying reasons: no one description suits all occasions and purpose. Short and accurate descriptions of. GLAM resources and artifacts are vital to the work of public engagement. The appearance of large language models and interfaces to interact with them such as Chatgpt4 opens new opportunities for automatic content creation, while posing also new challenges. In this position paper, we propose to use these new tools as an aid for the curator to create suggestions for content that may be used as descriptions of artifacts while enabling also the adaptation of the content for varied audiences and even to personal preferences.

Towards Identifying Visitor Types in Virtual Museums Using Spatial and Interaction Data

  • Filippos Panos
  • George E. Raptis
  • Christina Katsini
  • Christos Katsanos

Understanding the visitors’ behaviors can shape more immersive museum visits by providing personalized experiences. The first step to providing personalized experiences is to identify visitor types with common visit behavior characteristics. Several attempts have been made with promising results focusing on physical museums. Given the rise of virtual museum visits, in this paper, we take the first step toward identifying visitor types in virtual museums. To this end, we leverage data captured while visitors move within a virtual museum and interact with its exhibits. Using a machine learning approach, we identified four well-defined and distinct clusters that describe different types of virtual museum visitors.

SESSION: UCAI 2024: Workshop on User-Centered Artificial Intelligence

UCAI 2024: Workshop on User-Centered Artificial Intelligence

  • Daniel Buschek
  • Julian Frommel
  • Hanna Hauptmann
  • Hendrik Heuer
  • Aletta Smits

The proliferation of AI-based techniques poses a range of new challenges for the design and engineering of intelligent and adaptive systems since they tend to either act as black boxes or, more generally, not offer the user sufficient transparency, control, and interaction opportunities, which are considered major goals of user-centred design in the HCI field. This workshop aims to share and discuss recent developments at the intersection of HCI and AI and to explore novel methodological, technical, and interaction approaches. Researchers and practitioners with diverse disciplinary backgrounds can and should contribute to addressing the challenges in this emerging field of human-centred artificial intelligence.

“How Good Is Your Explanation?”: Towards a Standardised Evaluation Approach for Diverse XAI Methods on Multiple Dimensions of Explainability

  • Aditya Bhattacharya
  • Katrien Verbert

Artificial Intelligence (AI) systems involve diverse components, such as data, models, users and predicted outcomes. To elucidate these different aspects of AI systems, multifaceted explanations that combine diverse explainable AI (XAI) methods are beneficial. However, popularly adopted user-centric XAI evaluation methods do not measure these explanations across the different components of the system. In this position paper, we advocate for an approach tailored to evaluate XAI methods considering the diverse dimensions of explainability within AI systems using a normalised scale. We argue that the prevalent user-centric evaluation methods fall short of facilitating meaningful comparisons across different types of XAI methodologies. Moreover, we discuss the potential advantages of adopting a standardised approach, which would enable comprehensive evaluations of explainability across systems. By considering various dimensions of explainability, such as data, model, predictions, and target users, a standardised evaluation approach promises to facilitate both inter-system and intra-system comparisons for user-centric AI systems.

Representation Debiasing of Generated Data Involving Domain Experts

  • Aditya Bhattacharya
  • Simone Stumpf
  • Katrien Verbert

Biases in Artificial Intelligence (AI) or Machine Learning (ML) systems due to skewed datasets problematise the application of prediction models in practice. Representation bias is a prevalent form of bias found in the majority of datasets. This bias arises when training data inadequately represents certain segments of the data space, resulting in poor generalisation of prediction models. Despite AI practitioners employing various methods to mitigate representation bias, their effectiveness is often limited due to a lack of thorough domain knowledge. To address this limitation, this paper introduces human-in-the-loop interaction approaches for representation debiasing of generated data involving domain experts. Our work advocates for a controlled data generation process involving domain experts to effectively mitigate the effects of representation bias. We argue that domain experts can leverage their expertise to assess how representation bias affects prediction models. Moreover, our interaction approaches can facilitate domain experts in steering data augmentation algorithms to produce debiased augmented data and validate or refine the generated samples to reduce representation bias. We also discuss how these approaches can be leveraged for designing and developing user-centred AI systems to mitigate the impact of representation bias through effective collaboration between domain experts and AI.

The Need for User-centred Assessment of AI Fairness and Correctness

  • Simone Stumpf
  • Evdoxia Taka
  • Yuri Nakao
  • Lin Luo
  • Ryosuke Sonoda
  • Takuya Yokota

AI needs to be fair and robust, especially to meet demands of new regulation. Regular assessments are key but it is unclear how we can involve stakeholders without a background in AI in these efforts. This position paper provides an overview of the problems in this area, discusses the current work and looks ahead to future research needed to make headway in user-centric assessment of AI.

Towards Exploring Personalized Hyperlink Recommendations Through Machine Learning

  • Agorakis Bompotas
  • Panagiotis Triantafyllopoulos
  • George E. Raptis
  • Christina Katsini
  • Christos Makris

The Internet offers a wealth of content, making it increasingly difficult for users to navigate website information. The volume of hyperlinks on a website often leaves users struggling with content overload, hindering their ability to find relevant information of high interest. This problem highlights the critical need for tools to improve the user experience by providing personalized hyperlink recommendations on a specific website. This paper introduces HypeRec, a browser extension that attempts to address this problem by leveraging and comparing different machine learning and recommendation algorithms to guide users to content consistent with their interests and preferences. Our approach involves extracting hyperlinks from a webpage and subjecting the corresponding textual content to natural language processing techniques. In this way, it simplifies the users’ navigation within a website and promotes a more intuitive and satisfying web browsing experience.

SESSION: UKDE 2024: 1st InternationalWorkshop on User-Centered Practices of Knowledge Discovery in Educational Data

1st International Workshop on User-Centered Practices of Knowledge Discovery in Educational Data

  • Francesca Maridina Malloci
  • Paola Mejia
  • Agathe Merceron
  • Anna Monreale
  • Daniela Rotelli

By offering a large number of highly diverse resources, learning platforms have been attracting lots of participants, and the interactions with these systems have generated a vast amount of learning-related data. Their collection, processing and analysis have promoted a significant growth of machine learning and knowledge discovery approaches and have opened up new opportunities for supporting and assessing educational experiences in a data-driven fashion. Being able to understand students’ behavior and devise models able to provide data-driven decisions pertaining to the learning domain is a primary property of learning platforms, aiming at maximizing learning outcomes. However, the use of knowledge discovery in education also raises a range of ethical challenges including transparency, reliability fairness, and inclusiveness. In this workshop event, we focus on providing a common ground for researchers and practitioners working in this vibrant area, with the ultimate ambitious goal of bridging the UMAP community with the domain-oriented educational sister communities.

A Learning Analytics Dashboard for K-12 English Teachers – Bridging the Gap Between Student Process Data and Teacher Needs

  • Leona Colling
  • Mareike Kholin
  • Detmar Meurers

Educational technologies are being used more and more in secondary school settings. This increases the amount of students’ learning related data produced and stored. To keep up with this rise and to get most out of the collected data, teachers need digital tools that support and facilitate their pedagogical decision-making process. Learning analytics dashboards can be a good source to provide teachers with necessary insights into their students’ learning processes. However, for such tools to be effective and actionable, they have to be aligned with teachers’ needs and thus, provide and visualize data in a concise and structured way. We therefore conducted a survey study with 11 English teachers from K-12 secondary schools in Germany who evaluated the assumed usefulness of possible dashboard features. Based on these findings, we developed a teacher dashboard incorporating the most desired functionalities, such as a quickly accessible summary of strengths, weaknesses and support needs, or an overview of current misconceptions and competencies alongside additional metrics in order to support multiple teaching practices. The implementation and the underlying calculations are described, focusing on the importance of learners’ process data to provide teachers with a detailed and revealing view on their students’ and class learning states. In an evaluation study of the dashboard’s prototype with mock data, teachers (n=6) gave high ratings for the dashboard’s usability.

Collecting and Implementing Ethical Guidelines for Emotion Recognition in an Educational Metaverse

  • Dario Di Dario
  • Viviana Pentangelo
  • Maria Immacolata Colella
  • Fabio Palomba
  • Carmine Gravino

The metaverse represents a persistent, online 3D universe where people can interact, socialize, and work toward common goals. Education represents a key application domain, as it has the potential to enhance experiential learning and collaboration between learners and between learners and educators. However, challenges to the widespread adoption of educational metaverses persist. This paper focuses on emotional isolation, i.e., the feeling of emotional disconnection or loneliness, which can hinder learners’ motivation and participation. Machine learning-enabled emotional recognition systems have the potential to address this challenge, offering educators with feedback on the emotional states of learners within the metaverse. Yet, the integration of emotion recognition systems raises ethical concerns regarding consent, privacy, and algorithmic bias. In this short paper, we conduct a first step toward extracting ethical considerations from the literature on the use of emotion recognition in the educational metaverse. Then, we report these guidelines and finally implement one of the most critical —i.e., protection of privacy— within SENEM, an educational metaverse platform available in the literature. Through this research, we aim to raise awareness within the research community and promote responsible deployment of emotion recognition technology in educational metaverses, aiming to create a supportive and inclusive learning environment for all students.

Exploring Student Interactions with AI in Programming Training

  • Gianni Fenu
  • Roberta Galici
  • Mirko Marras
  • Diego Reforgiato

In recent years, the integration of artificial intelligence (AI) in education has collected significant attention due to its potential to revolutionize learning experiences and support student skill development. This study delves into the dynamics of student interactions with AI support within the domain of C programming education, with a specific focus on the utilization of ChatGPT, a conversational AI model, during training sessions. Through manual clustering analysis, this research unveils distinct patterns of student engagement, elucidating diverse problem-solving approaches and varying levels of interaction with ChatGPT. Our findings underscore the importance of acknowledging individual differences in learning strategies and preferences, highlighting the necessity for personalized educational interventions tailored to meet the diverse needs of learners. However, despite the strides made in AI-supported learning, gaps persist in the existing literature, particularly concerning our understanding of how students approach prompts and exercises when utilizing AI-driven educational tools. This research aims to address this gap by shedding light on the nuanced dynamics of student-AI interactions during training of C programming, offering insights into effective pedagogical strategies and instructional design principles for integrating AI technologies into educational settings. This study makes a significant contribution to the continuous endeavors of educators and AI developers by furthering the discussion on AI-facilitated learning. It aims to enhance student engagement, learning outcomes, and overall educational experiences through the integration of technology into learning environments.

Knowledge Graph-Based Recommendation System for Personalized E-Learning

  • Duaa Baig
  • Diana Nurbakova
  • Baba Mbaye
  • Sylvie Calabretto

Due to the large amount of available e-learning data, identifying relevant information from e-learning data presents significant challenges. A recommendation system is a popular solution to provide relevant data to any user but it also faces challenges such as scalability, processing large volumes of data, addressing the cold start problem, predicting personalized recommendations, and providing an adaptive recommendation, etc.. In this paper, we present an efficient knowledge graph-based recommendation framework, which can provide personalized e-learning recommendations to existing or new target learners without having enough historical data of that target learner. The proposed framework includes five modules i.e. Data Module, Knowledge Graph Representation Module, Community Building Module, Graph Embedding Module, and Recommendation Module. It is based on knowledge graphs to deal with huge amounts of data and to identify the hidden relationships between data. The proposed framework aims to provide personalized recommendations to learners, it utilizes a clustering-based community generation model for better identifying the interests or preferences of learners.

Learner-centered Ontology for Explainable Educational Recommendation

  • Neda Afreen
  • Giacomo Balloccu
  • Ludovico Boratto
  • Gianni Fenu
  • Francesca Maridina Malloci
  • Mirko Marras
  • Andrea Giovanni Martis

Ontologies form the core of knowledge graphs, which act as faithful, semantic-rich sources for training models in delivering explainable recommendations. These models learn to extract logical paths between learners and resources to be recommended within the knowledge graph, according to behavior- and content-based patterns. Extracted paths are then used not only to provide recommendations, but also to generate accompanying textual explanations. Despite the potential of this approach, current ontologies derived from the traditional learner-resource interaction data fall short in terms of richness from an educational perspective. Conversely, general-purpose ontologies, while comprehensive in educational aspects, are overly complex for recommendation tasks. Unfortunately, a suboptimal ontology might prevent to articulate reasoning paths, and thus explanations, relevant for learners within the knowledge graph. To counter this limitation, in this paper, we propose LOXER, a novel ontology designed to unlock learner-centered logical paths for explainable educational recommendation. Our design integrates insights from diverse sources, including feedback from a local co-design group of learners, observations from specialized traditional large-scale educational recommendation datasets, and connections with well-known vocabularies of other existing ontologies. To validate our ontology, we conducted an evaluation of the explanation types it enables, involving university and lifelong learners and assessing explanation properties like effectiveness, decision-making speed, motivation, satisfaction, and confidence. Results show our ontology’s ability to foster diverse considerations during the learners’ decision-making process and to establish a semantic structure for knowledge graphs for explainable recommendation.

LLMs for Knowledge Modeling: NLP Approach to Constructing User Knowledge Models for Personalized Education

  • Diana Domenichini
  • Filippo Chiarello
  • Vito Giordano
  • Gualtiero Fantoni

This study proposes a method for developing a user knowledge model based on their past learning experiences. The focus is on analyzing academic data, particularly lesson records, to extract information about educational concepts. The ultimate goal is to construct a comprehensive profile that reflects the user’s accumulated knowledge throughout their learning journey. Two distinct methods are introduced for concept extraction: a gazetteer-based Named Entity Recognition approach and prompt engineering using ChatGPT. The effectiveness of these methods is assessed through a case study involving a graduate student at the University of Pisa. These knowledge profiles hold significant relevance in today’s educational landscape. With the prevalence of lifelong learning, individuals from diverse academic backgrounds participate in professional development courses. This diversity in past learning experiences can pose a challenge for instructors and course designers who must adapt lessons to be understandable and engaging for an audience with heterogeneous knowledge bases. The analysis of academic data offers a systematic approach to modeling each individual’s acquired knowledge. This, in turn, facilitates the personalization of learning content and pathways based on students’ unique learning experiences. The outcome is an inclusive learning environment that caters to the specific needs of each participant, thereby promoting compelling and stimulating learning experiences.

Multiple-Choice Question Generation Using Large Language Models: Methodology and Educator Insights

  • Giorgio Biancini
  • Alessio Ferrato
  • Carla Limongelli

Integrating Artificial Intelligence (AI) in educational settings has brought new learning approaches, transforming the practices of both students and educators. Among the various technologies driving this transformation, Large Language Models (LLMs) have emerged as powerful tools for creating educational materials and question answering, but there are still space for new applications. Educators commonly use Multiple-Choice Questions (MCQs) to assess student knowledge, but manually generating these questions is resource-intensive and requires significant time and cognitive effort. In our opinion, LLMs offer a promising solution to these challenges. This paper presents a novel comparative analysis of three widely known LLMs – Llama 2, Mistral, and GPT-3.5 – to explore their potential for creating informative and challenging MCQs. In our approach, we do not rely on the knowledge of the LLM, but we inject the knowledge into the prompt to contrast the hallucinations, giving the educators control over the test’s source text, too. Our experiment involving 21 educators shows that GPT-3.5 generates the most effective MCQs across several known metrics. Additionally, it shows that there is still some reluctance to adopt AI in the educational field. This study sheds light on the potential of LLMs to generate MCQs and improve the educational experience, providing valuable insights for the future.

Transparent Learner Knowledge State Modeling using Personal Knowledge Graphs and Graph Neural Networks

  • Rawaa Alatrash
  • Mohamed Amine Chatti
  • Qurat Ul Ain
  • Shoeb Joarder

Learner modeling is pivotal in different applications of adaptive and personalized systems in the educational domain, such as recommender systems and intelligent tutoring systems. However, how learner models are inferred and used in the these systems are often not transparent to learners. In many cases, learner models are presented as a black-box, where learners have no means to control or modify their models. To address these issues, in this paper, we present an innovative approach to learner modeling, particularly focusing on modeling learners’ knowledge states. To this end, we combine Personal Knowledge Graphs (PKGs), Graph Convolutional Networks (GCNs), and transformer sentence encoders (SBERT) to construct a transparent learner model. Specifically, we explicitly involve learners in modeling their knowledge state by enabling them to mark concepts as ’Did Not Understand’ (DNU) in the MOOC platform CourseMapper. This results in the construction of a user-controllable and scrutable PKG for the learner, thus increasing the transparency of the learner modeling process. Furthermore, we leverage GCNs and SBERT to model the learner knowledge state based on an enhanced representation of their DNUs. In this way, we provide a simple yet effective method for learner modeling which can be used to improve performance in downstream tasks, such as adaptive systems, recommendation, and personalized search.

SESSION: WeBIUM 2024: 1stWorkshop onWearable Devices and Brain-Computer Interfaces for User Modelling

Wearable Devices and Brain-Computer Interfaces for User Modelling (WeBIUM)

  • Tommaso Colafiglio
  • Tommaso Di Noia
  • Domenico Lofù
  • Angela Lombardi
  • Fedelucio Narducci
  • Paolo Sorino

Wearable Devices (WDs), encompassing a spectrum from smartwatches to fitness trackers, continuously furnish a wealth of physiological and activity-related data. This trove of information facilitates the creation of robust user models, offering a dynamic lens into users’ daily lives, health patterns, and interaction behaviours. Furthermore, the integration of Brain-Computer Interfaces (BCIs), directly interfacing with neural signals, presents a distinctive vantage point into cognitive processes and emotional states. This integration enriches user models, providing a profound understanding of mental states and engagement levels. While BCIs cannot be strictly categorized as WDs, the latest hardware developments are reaching a size comparable to earphones, anticipating their wearable integration in the near future. However, the exploitation of data from these devices for user modelling and profiling, enhancing personalized activities such as music listening and movie watching, remains an area ripe for exploration.

This workshop proposes an in-depth exploration of the transformative impact that data from WDs and BCIs can exert on user modelling. This initiative seeks to pave the way for a nuanced comprehension of individual preferences, cognitive states, and overall user experiences. The workshop invites researchers, practitioners, and enthusiasts to the convergence of wearable technology, BCIs, and user modelling. Through interactive sessions and discussions, participants will delve into the methodologies, challenges, and opportunities associated with harnessing data from these innovative sources. By fostering collaboration and facilitating knowledge exchange, the workshop aims to propel the current understanding of user modelling by exploiting WDs and BCIs. Attendees will acquire insights into cutting-edge research findings, practical applications, and potential future developments within this rapidly evolving field. Ultimately, the workshop aspires to inspire new research directions, catalyze interdisciplinary collaborations, and cultivate innovative solutions that leverage the synergy between wearable technologies and BCIs to elevate the field of user modelling to unprecedented heights.

ARIEL: Brain-Computer Interfaces meet Large Language Models for Emotional Support Conversation

  • Paolo Sorino
  • Giovanni Maria Biancofiore
  • Domenico Lofù
  • Tommaso Colafiglio
  • Angela Lombardi
  • Fedelucio Narducci
  • Tommaso Di Noia

In an era characterized by unprecedented virtual connectivity, paradoxically, individuals often find themselves disconnected from genuine human interactions. The advent of remote working arrangements, compounded by the influence of digital communication platforms, has fostered a sense of isolation among people. Consequently, the prevailing socio-technological landscape has underscored the critical need for innovative solutions to address the emotional void. Conversational systems help people improve their everyday tasks with informative dialogues, and recent applications employ them to target emotional support conversation tasks. Nevertheless, their understanding of human feelings is limited, as they depend solely on information discernible from the text or the users’ emotional declarations. Recently, Brain-Computer Interfaces (BCIs), devices that analyze electroencephalographic (EEG) signals, have increasingly become popular given their minimally invasive nature and low cost, besides enabling the detection of users’ emotional states reliably. Hence, we propose ARIEL, an emotionAl suppoRt bcI dEvices and Llm-based conversational agent that aims at supporting users’ emotional states through conversations and monitoring them via BCI. In this way, it is possible to comprehend the users’ feelings reliably, thus making the conversational agent aware of users’ emotional evolution during conversations. Our framework makes the LlaMA 2 chat model communicate with an emotion recognition BCI-based system to achieve the emotional support conversation goal. Also, we present a controlled running example that shows the potential of our model and its effective functioning, made possible by a wisely designed hard-prompt strategy. In the future, we will conduct an in-vivo experiment to evaluate the system and its components.

Assessing Human Visual Attention in Retail Human-Robot Interaction: A YOLOv8-Nano and Eye-Tracking Approach

  • Kamlesh Kumar
  • Yuhao Chen
  • Boyi Hu
  • Yue Luo

Objectives: This research delves into the dynamics of human-robot interaction (HRI) in retail environments, with a focus on robot detection from videos captured via an eye-tracking system. Methods: The study employs YOLOv8-nano model for real-time robot detection during grocery shopping tasks. All videos were processed using the YOLOv8 model to test inference speed while performing eye-tracking data analysis as a case study. Results: The YOLOv8 model demonstrated high precision in robot detection, with a mean average precision (mAP) of approximately 97.3% for Intersection over Union (IoU), 100% precision, and 99.87% recall for box detection. The model’s ability to process an average of 160.36 frames per second (FPS) confirmed its suitability for real-time applications. In the case study on the impact of a robot’s presence on human eye movements, the presence of a robot contributes to greater consistency in gaze fixation behavior, potentially leading to more predictable patterns of visual attention. Conclusion: The study’s findings contribute significantly to the design of safer and more efficient cobot systems. They provide a deeper understanding of human responses in real-world scenarios, which is crucial for the development of effective HRI systems.

EmoSynth Real Time Emotion-Driven Sound Texture Synthesis via Brain-Computer Interface

  • Tommaso Colafiglio
  • Domenico Lofù
  • Paolo Sorino
  • Angela Lombardi
  • Fedelucio Narducci
  • Fabrizio Festa
  • Tommaso Di Noia

In electroacoustic music composition, particularly in sound synthesis techniques, Deep Learning (DL) provides very effective solutions. However, these architectures generally have a high level of automation and use textual language for human interaction. To improve the relationship between composers and artificial intelligence systems, brain-computer interfaces (BCIs) are an effective and direct systems, which have led to considerable improvements in this area. The proposed system employs emotion recognition through electroencephalogram (EEG) signals to control four Variational Autoencoders (VAE) that generate new sound textures. A dataset was acquired using the MUSE2 headset to train four Machine Learning (ML) models capable of classifying human emotions based on Russell’s circumplex model. VAEs were trained to produce different sound variations from an audio dataset that allows composers to integrate their sounds. In addition, a graphical user interface (GUI) was developed to facilitate the real-time generation of sound textures, with the support of an external MIDI controller. This GUI continuously provides visual information about the detected emotions and the activity of the left and right brain hemispheres.

Exploring Federated Learning for Emotion Recognition on Brain-Computer Interfaces

  • Sofia Mongardi
  • Pietro Pinoli

In recent years, data-driven methodologies based on artificial intelligence (AI) technologies are taking the lead in many biomedical fields, including the interpretation of electroencephalogram (EEG) experiments collected by wereable Brain Computer Interfaces (BCI). Yet, the effectiveness of data in medical research hinges significantly on its volume and diversity. Traditional biomedical research faces constraints due to limited access to datasets often confined within individual medical centers, hindering potential breakthroughs. Reluctance to share patient data across institutions is driven by ethical, legal, and privacy concerns, with laws like GDPR rigorously protecting against privacy breaches. Overcoming these obstacles requires a paradigm shift towards decentralized AI training procedures, enabling secure and efficient usage of sensible data. In this work, we examine and evaluate the potential of federated learning for the task of emotion recognition from BCI data, focusing on performance with respect to centralized approaches. Our focus lies on comparing its performance against centralized approaches, delving into key metrics such as accuracy, efficiency, and privacy preservation.

Exploring the Usability and Trustworthiness of AI-Driven User Interfaces for Neurological Diagnosis

  • Angela Lombardi
  • Sofia Marzo
  • Tommaso Di Noia
  • Eugenio Di Sciascio
  • Carmelo Ardito

This study explores the application of Artificial Intelligence (AI) in the diagnosis of Mild Cognitive Impairment (MCI) and Alzheimer’s Disease (AD), through Human-Computer Interaction (HCI), Human-Centered AI (HCAI), and Explainable AI (XAI). It evaluates three user interfaces designed to integrate AI insights with the clinical understanding of neurologists, aiming to refine diagnostic processes. Neurology professionals were involved to gauge their knowledge and confidence in the AI-supported diagnoses. Utilizing a remotely administered questionnaire, this research investigates clinicians’ views on XAI outputs, focusing on how results are visualized and their ability to engender trust in AI’s clinical utility. This method emphasizes the importance of clear, trustworthy AI systems in healthcare and underscores the essential role of effective human-AI collaboration in enhancing patient care and diagnostic precision.