Publication

Publications

This is a list of my publications in the last 5 years. For a full list please see Google Scholar.

2025

Self-supervised Random Mask Attention GAN in Tackling Pose-invariant Face Recognition

J Liao, T Guha and V Sanchez

Pattern Recognition, Vol 159, March 2025

PDF

Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics

S Madan, M Gahalawat, T Guha, R Goecke, R Subramanian

PLOS One, Vol 20(1):e0313883, January 2025

PDF

Robust understanding of human-robot social interactions through multimodal distillation

T Bian, M Chollet and T Guha

ACM MM 2025

PDF

Analyzing character representation in media content using multimodal foundation model: Effectiveness and trust

E Taka, D Bhattacharya, J Garde-Hansen, S Sharma and T Guha

ACM ICMI 2025

PDF

Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions

T Bian, Y Ma, M Chollet, V Sanchez and T Guha

IEEE ICME 2025

PDF

CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification

Y Ma, V Sanchez and T Guha

IEEE ICME 2025

PDF

Boosting Tiny Face Detection in Video with an Integral Score Framework

R Levya, S Guodong, O Bahadir, V Sanchez and T Guha

IEEE FG 2025

Active Listener: Continuous Generation of Listener's Head Motion Response in Dyadic Interactions

B Ghosh, E Li and T Guha

IEEE ICASSP 2025

PDF

2024

NAPE: Numbering As a Position Encoding in Graphs

O Ajayi, H Wen and T Guha

IEEE Access, vol 12, pp 166200-166210, November 2024

PDF

On the Effects of Obfuscating Speaker Attributes in Privacy-Aware Depression Detection

N Aloshban, A Esposito, A Vinciarelli and T Guha

Pattern Recognition Letters, Vol 186, pp 300-305, October 2024

PDF

Assessing Privacy Risks of Attribute Inference Attacks against Speech-based Depression Detection System

B Alsenani, A Esposito, A Vinciarelli and T Guha

ECAI 2024

PDF Code

Detecting In-car VR Motion Sickness from Lower Face Action Units

G Li, T Guha, O Onuoha, Z Qiu, A Grant, Z Feng, Z Zhang, K Pohlmann, M McGill, S Brewster and F Pollick

IEEE ISMAR 2024

PDF

WorkBench: A Benchmark Dataset for Agents in a Realistic Workplace Setting

O Styles, S Miller, P Cerda-Mardini, T Guha, V Sanchez and B Vidgen

COLM 2024

PDF Code

Is Distance a Modality? Multilabel Learning for Speech-Based Joint Prediction of Attributed Traits and Perceived Distances in 3D Audio Immersive Environment

E Fringi, N Alareef, L Picinalli, S Brewster, T Guha and A Vinciarelli

ACM ICMI 2024

PDF

2023

Explainable Depression Detection via Head Motion Patterns

M Gahalawat, R F Rojas, T Guha, R Subramanian and R Goecke

ACM ICMI 2023

PDF

Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack

B Alsenani, T Guha and A Vinciarelli

INTERSPEECH 2023

PDF

Heterogeneous Graph Learning for Acoustic Event Classification

A Shirian, M Ahmadian, K Somandepalli and T Guha

IEEE ICASSP 2023

PDF Code

Robust Multiview Multimodal Driver Monitoring System using Masked Multihead Self Attention

Y Ma, V Sanchez, S Nikan, D Upadhyay, B Atote and T Guha

IEEE/CVF CVPR Workshop 2023

PDF Code

2022

Multi-camera Trajectory Forecasting with Trajectory Tensors

O Styles, T Guha and V Sanchez

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 44(11), pp. 8482-8491, November 2022

PDF Code

Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data

A Shirian, K Somandepalli and T Guha

IEEE Journal of Selected Topics in Signal Processing, vol. 26(6), pp. 1391-1401, October 2022

PDF Code Video presentation

Dynamic Emotion Modeling with Learnable Graphs and Graph Inception Network

A Shirian, S Tripathi and T Guha

IEEE Transactions on Multimedia (TMM), vol. 24, pp. 780-790, February 2022

PDF Code

Learning Long-Term Spatio-Temporal Graphs for Active Speaker Detection

K Min, S Roy, S Tripathi, T Guha and S Majumdar

ECCV 2022

PDF Code

FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion

Y Ma, T Guha and V Sanchez

IEEE ICIP 2022

PDF

Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition

J Liao, T Guha and V Sanchez

IEEE ICIP 2022

PDF

Visually-aware Acoustic Event Detection using Heterogeneous Graphs

A Shirian, K Somandepalli, V Sanchez and T Guha

INTERSPEECH 2022

PDF Code

Graph-based Transform based on 3D Convolutional Neural Network for Intraprediction of Imaging Data

D Roy, T Guha and V Sanchez

IEEE DCC 2022

2021

Computational Media Intelligence: Human-centered Machine Analysis of Media

K Somandepalli, T Guha, V Martinez, N Kumar, H Adam and S S Narayanan

Proceedings of the IEEE (PIEEE), vol. 109(5), pp. 891 - 910, May 2021

PDF

Emotion Sensing from Head Motion Capture

A Samanta and T Guha

IEEE Sensors Journal, vol. 21(4), pp. 5035 - 5043, February 2021

PDF

In Defense of Scene Graphs for Image Captioning

K Nguyen, S Tripathi, B Du, T Guha and T Nguyen

IEEE/CVF ICCV 2021

PDF Code

Head Matters: Explainable Human-centered Trait Prediction from Head Motion Dynamics

S Madan, M Gahalawat, T Guha and R Subramanian

ACM ICMI 2021

PDF

Towards Autism Screening through Emotion-guided Eye Gaze Response

S Ghosh and T Guha

IEEE EMBC 2021

PDF

Compact Graph Architecture for Speech Emotion Recognition

A Shirian and T Guha

IEEE ICASSP 2021

PDF Code

GBT based on Graph Neural Network for Predictive Transform Coding

D Roy, T Guha and V Sanchez

IEEE DCC 2021

2020 and Older

Please see Google Scholar

Notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. In most cases, these works may not be reposted or reproduced in any way, in whole or in part, without explicit permission of the copyright holder.