I am Anuj Diwan (अनुज दिवाण) , a Computer Science Ph.D. student at the University of Texas at Austin. My research interests broadly lie in Speech Recognition, Natural Language Processing, and Machine Learning.
Visit my website to know more!
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
Code for 'Textless Speech-to-Speech Translation With Limited Parallel Data'
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
Code for 'When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants', ACL 2023
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
Code for 'Textless Speech-to-Speech Translation With Limited Parallel Data'
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
Code for 'When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants', ACL 2023