Publications 

This is a list of  my publications in the last 6 years. For a full list please see  Google Scholar.  


2023

Explainable Depression Detection via Head Motion Patterns

M Gahalawat, R F Rojas, T Guha, R Subramanian and R Goecke

ACM Int. Conf. on Multimodal Interaction (ICMI), Paris, France, October 2023.


Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack

B Alsenani, T Guha and A Vinciarelli

INTERSPEECH, Dublin, Ireland, August 2023.

PDF 


Heterogeneous Graph Learning for Acoustic Event Classification

A Shirian, M Ahmadian, K Somandepalli and T Guha

Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Rhodes, Greece, June 2023.

PDF           Code


Robust Multiview Multimodal Driver Monitoring System using Masked Multihead Self Attention

Y Ma, V Sanchez, S Nikan, D Upadhyay, B Atote and T Guha

Int. Conf. on Computer Vision and Pattern Recognition Workshops (CVPR-W), Vancouver, Canada, June 2023.

PDF           Code



2022

Multi-camera Trajectory Forecasting with Trajectory Tensors

O Styles, T Guha and V Sanchez

IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44(11), pp. 8482-8491, November 2022. 

PDF          Code


Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data

A Shirian, K Somandepalli and T Guha

IEEE Journal of Selected Topics in Signal Processing, vol. 26(6), pp. 1391-1401, October 2022.

PDF         Code


Dynamic Emotion Modeling with Learnable Graphs and Graph Inception Network

A Shirian, S Tripathi and T Guha

IEEE Transactions on Multimedia, vol. 24, pp. 780-790, February 2022

PDF          Code


Learning Long-Term Spatio-Temporal Graphs for Active Speaker Detection

K Min, S Roy, S Tripathi, T Guha and S Majumdar

European Conf. on Computer VIsion (ECCV), Tel Aviv, Israel, October 2022. 

PDF          Code


FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion

Y Ma, T Guha and V Sanchez

Int. Conf. on Image Processing (ICIP), Bordeaux, France, October 2022. 

PDF 


Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition

J Liao, T Guha and V Sanchez

Int. Conf. on Image Processing (ICIP), Bordeaux, France, October 2022. 

PDF


Visually-aware Acoustic Event Detection using Heterogeneous Graphs

A Shirian, K Somandepalli, V Sanchez and T Guha

INTERSPEECH, Incheon, Korea, September 2022. 

PDF        Code


Graph-based Transform based on 3D Convolutional Neural Network for Intraprediction of Imaging Data

D Roy, T Guha and V Sanchez

Data Compression Conference (DCC), Snowbird, US, March 2022

2021

Computational Media Intelligence: Human-centered Machine Analysis of Media

K Somandepalli, T Guha, V Martinez, N Kumar, H Adam and S S Narayanan

Proceedings of the IEEE, vol. 109(5), pp. 891 - 910, May 2021. 

PDF

Emotion Sensing from Head Motion Capture

A Samanta and T Guha

IEEE Sensors Journal, vol. 21(4), pp. 5035 - 5043, February 2021.

PDF 

In Defense of Scene Graphs for Image Captioning

K Nguyen, S Tripathi, B Du, T Guha and T Nguyen

Int. Conf. on Computer Vision (ICCV), Virtual, October 2021.

PDF         Code


Head Matters: Explainable Human-centered Trait Prediction from Head Motion Dynamics

S Madan, M Gahalawat, T Guha and R Subramanian

ACM Int. Conf. on Multimodal Interaction (ICMI), Virtual, October 2021.

PDF


Towards Autism Screening through Emotion-guided Eye Gaze Response

S Ghosh and T Guha

Int. Conf. on IEEE Engineering in Medicine and Biology Society (EMBC), Virtual, October 2021.

PDF


Compact Graph Architecture for Speech Emotion Recognition

A Shirian and T Guha

Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Virtual, June 2021.

PDF         Code

GBT based on Graph Neural Network for Predictive Transform Coding

D Roy, T Guha and V Sanchez

Data Compression Conference (DCC), Virtual, March 2021. (Extended abstract)

2020

Dynamic Character Graph via Online Face Clustering for Movie Analysis                                                           

P Kulshreshtha and T Guha

Multimedia Tools and Applications, vol. 79(43-44), pp. 33103 - 33116, November 2020.

PDF  

Attention-selective Network for Face Synthesis and Pose-invariant Face Recognition

J Liao, A Kot, T Guha and V Sanchez

Int. Conf. on Image Processing (ICIP), Virtual, October 2020.

Multi-camera Trajectory Forecasting: Pedestrian Trajectory Prediction in a Network of Cameras

O Styles, T Guha, V Sanchez and A Kot

Int. Conf. on Computer Vision and Pattern Recognition Workshops (CVPR-W), Virtual, June 2020.

PDF         Database         **Best student paper award**

Ensemble Network for Ranking Images based on Visual Appeal

S Singh, V Sanchez and T Guha

Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Virtual, May 2020.

PDF         Code

Variational Recurrent Sequence-to-Sequence Retrieval for Stepwise Illustration

V Batra, A Halder, Y He, G Vogiatzis, H Ferhatosmanoglu and T Guha

European Conference on Information Retrieval (ECIR), Lisbon, Portugal, April 2020.

PDF

Multiple Object Forecasting: Predicting Future Object Locations in Diverse Environments

O Styles, T Guha and V Sanchez

Winter Conf. on Applications of Computer Vision (WACV), Aspen, US, March 2020. 

PDF         Data & Code

Coordinated Joint Multimodal Embeddings for Generalized Audio-visual Zero-shot Classification

K K Parida, N Matiyali, T Guha and G Sharma

Winter Conf. on Applications of Computer Vision (WACV), Aspen, US, March 2020. 

PDF         Data & Code


2019


Dirichlet Latent Variable Model: A Dynamic Model based on Dirichlet Prior for Audio Processing

A Kumar, T Guha and P K Ghosh

IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 27(5), pp. 919-931, March 2019.

PDF


Motion-capture Patterns of Voluntarily Mimicked Dynamic Facial Expressions in Children and Adolescents with and without ASD

E Zane, Z Yang, L Pozzan, T Guha, S S Narayanan and R Grossman 

Journal of Autism and Developmental Disorder, vol. 49(3), pp. 1062 - 1079, March 2019.

PDF


Learning Affective Correspondence between Music and Image

G Verma, E G Dhekane and T Guha

Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, May 2019.

PDF        Database


Computational Analysis of Gaze Behavior in Autism during Interaction with Virtual Agents

Z Akhtar and T Guha

Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, May 2019.

PDF


GBT with Weighted Self-loops for Predictive Transform Coding based on Template Matching

D Roy, T Guha and V Sanchez

Data Compression Conference (DCC), Snowbird, US, March 2019.

PDF


2018


A Computational Study of Expressive Facial Dynamics in Children with Autism

T Guha, Z Yang, R Grossman and S S Narayanan 

IEEE Transactions on Affective Computing, vol. 9 (1), pp. 14-20, Jan-March 2018.

PDF


Unsupervised Discovery of Character Dictionaries in Animation Movies

K Somandepalli, N Kumar, T Guha and S S Narayanan

IEEE Transactions on Multimedia, vol. 20 (3), pp. 539-551, 2018.

PDF


Learning Spontaneity to Improve Emotion Recognition in Speech

K Mangalam and T Guha

INTERSPEECH, Hyderabad, India, September 2018.

PDF


An Online Algorithm for Constrained Face Clustering in Videos

P Kulshreshtha and T Guha

Int. Conf. on Image Processing (ICIP), Athens, Greece, October 2018.

PDF        Code


A Dynamic Latent Variable Model for Source Separation

A Kumar, T Guha and P K Ghosh 

Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Canada, April 2018.

PDF


Multichannel Attention Network for Analyzing Visual Behavior in Public Speaking

R Sharma, T Guha and G Sharma 

Winter Conf. on Applications of Computer Vision (WACV), Lake Tahoe, US, March 2018.

PDF 


2017 and Older

Please see Google Scholar

Notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. In most cases, these works may not be reposted or reproduced in any way, in whole or in part, without explicit permission of the copyright holder.