Publications 

This is a list of  my publications in the last 5 years. For a full list please see Google Scholar.  


2024

Is Distance a Modality? Multilabel Learning for Speech-Based Joint Prediction of Attributed Traits and Perceived Distances in 3D Audio Immersive Environment

E Fringi, N Alareef, L Picinalli, S Brewster, T Guha and A Vinciarelli

ACM ICMI  2024          


WorkBench: A Benchmark Dataset for Agents in a Realistic Workplace Setting

O Styles, S Miller, P Cerda-Mardini, T Guha, V Sanchez and B Vidgen

COLM  2024

PDF           Code


Assessing Privacy Risks of Attribute Inference Attacks against Speech-based Depression Detection System

B Alsenani, A Esposito, A Vinciarelli and T Guha

ECAI  2024


2023

Explainable Depression Detection via Head Motion Patterns

M Gahalawat, R F Rojas, T Guha, R Subramanian and R Goecke

ACM  ICMI  2023

PDF 


Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack

B Alsenani, T Guha and A Vinciarelli

INTERSPEECH  2023

PDF 


Heterogeneous Graph Learning for Acoustic Event Classification

A Shirian, M Ahmadian, K Somandepalli and T Guha

IEEE  ICASSP  2023

PDF           Code


Robust Multiview Multimodal Driver Monitoring System using Masked Multihead Self Attention

Y Ma, V Sanchez, S Nikan, D Upadhyay, B Atote and T Guha

IEEE/CVF  CVPR  Workshop  2023

PDF           Code



2022

Multi-camera Trajectory Forecasting with Trajectory Tensors

O Styles, T Guha and V Sanchez

IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44(11), pp. 8482-8491, November 2022

PDF          Code


Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data

A Shirian, K Somandepalli and T Guha

IEEE Journal of Selected Topics in Signal Processing, vol. 26(6), pp. 1391-1401, October 2022

PDF         Code


Dynamic Emotion Modeling with Learnable Graphs and Graph Inception Network

A Shirian, S Tripathi and T Guha

IEEE Transactions on Multimedia, vol. 24, pp. 780-790, February 2022

PDF          Code


Learning Long-Term Spatio-Temporal Graphs for Active Speaker Detection

K Min, S Roy, S Tripathi, T Guha and S Majumdar

ECCV  2022

PDF          Code


FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion

Y Ma, T Guha and V Sanchez

IEEE  ICIP  2022

PDF 


Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition

J Liao, T Guha and V Sanchez

IEEE ICIP  2022

PDF


Visually-aware Acoustic Event Detection using Heterogeneous Graphs

A Shirian, K Somandepalli, V Sanchez and T Guha

INTERSPEECH  2022

PDF        Code


Graph-based Transform based on 3D Convolutional Neural Network for Intraprediction of Imaging Data

D Roy, T Guha and V Sanchez

IEEE DCC  2022

2021

Computational Media Intelligence: Human-centered Machine Analysis of Media

K Somandepalli, T Guha, V Martinez, N Kumar, H Adam and S S Narayanan

Proceedings of the IEEE, vol. 109(5), pp. 891 - 910, May 2021

PDF

Emotion Sensing from Head Motion Capture

A Samanta and T Guha

IEEE Sensors Journal, vol. 21(4), pp. 5035 - 5043, February 2021

PDF 

In Defense of Scene Graphs for Image Captioning

K Nguyen, S Tripathi, B Du, T Guha and T Nguyen

IEEE/CVF  ICCV  2021

PDF         Code


Head Matters: Explainable Human-centered Trait Prediction from Head Motion Dynamics

S Madan, M Gahalawat, T Guha and R Subramanian

ACM  ICMI  2021

PDF


Towards Autism Screening through Emotion-guided Eye Gaze Response

S Ghosh and T Guha

IEEE  EMBC 2021

PDF


Compact Graph Architecture for Speech Emotion Recognition

A Shirian and T Guha

IEEE  ICASSP  2021

PDF         Code

GBT based on Graph Neural Network for Predictive Transform Coding

D Roy, T Guha and V Sanchez

IEEE  DCC  2021  (Extended abstract)

2020

Dynamic Character Graph via Online Face Clustering for Movie Analysis                                                           

P Kulshreshtha and T Guha

Multimedia Tools and Applications, vol. 79(43-44), pp. 33103 - 33116, November 2020.

PDF  

Attention-selective Network for Face Synthesis and Pose-invariant Face Recognition

J Liao, A Kot, T Guha and V Sanchez

IEEE  ICIP  2020

Multi-camera Trajectory Forecasting: Pedestrian Trajectory Prediction in a Network of Cameras

O Styles, T Guha, V Sanchez and A Kot

IEEE/CVF  CVPR  Workshops  2020

PDF         Database         **Best student paper award**

Ensemble Network for Ranking Images based on Visual Appeal

S Singh, V Sanchez and T Guha

IEEE  ICASSP  2020

PDF         Code

Variational Recurrent Sequence-to-Sequence Retrieval for Stepwise Illustration

V Batra, A Halder, Y He, G Vogiatzis, H Ferhatosmanoglu and T Guha

ECIR  2020

PDF

Multiple Object Forecasting: Predicting Future Object Locations in Diverse Environments

O Styles, T Guha and V Sanchez

IEEE  WACV  2020

PDF         Data & Code

Coordinated Joint Multimodal Embeddings for Generalized Audio-visual Zero-shot Classification

K K Parida, N Matiyali, T Guha and G Sharma

IEEE  WACV  2020

PDF         Data & Code



2019 and Older

Please see Google Scholar

Notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. In most cases, these works may not be reposted or reproduced in any way, in whole or in part, without explicit permission of the copyright holder.