Shruti Bhargava

 

I am a Senior ML Research Engineer working on conversational AI in Apple's Siri Core Modeling team, where I develop advanced language understanding systems and LLM response strategies. I completed my Masters in Computer Science from the University of Illinois at Urbana-Champaign (UIUC), advised by Prof. David Forsyth, and my Bachelors from IIT Kanpur, India.

Shruti Bhargava Profile Photo
Apple Inc.
Research Engineer
Apple Inc.
2019 - Present
University of Illinois
MS, CS
UIUC
2017 - 2019
Apple Inc. Internship
Intern
Apple Inc.
2018 Summer
Coordinated Science Laboratory
Research Assistant
CSL
2017 Fall
Microsoft Research
Intern
Microsoft Research
2016 Summer
Max Planck Institute
Intern
Max Planck Institute
2015 Summer
IIT Kanpur
BTech, CS
IIT Kanpur
2013 - 2017

Patents

Publications

SynthDST visualization

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking
Atharva Kulkarni, Bo-Hsiang Tseng, Joel Ruben Antony Moniz, Dhivya Piraviperumal, Hong Yu, Shruti Bhargava
EACL (Oral) 2024

Data generation framework that can efficiently generate synthetic data for dialogue schemas using countable templates. This bridges the gap between zero-shot and training data based few-shot prompting for dialog state tracking with LLMs.

Context LLM visualization

Can Large Language Models Understand Context?
Yilun Zhu, Joel Ruben Antony Moniz, Shruti Bhargava, Jiarui Lu, Dhivya Piraviperumal, Site Li, Yuan Zhang, Hong Yu, Bo-Hsiang Tseng
EACL Findings 2024

A benchmark by adapting four tasks and nine existing datasets, featuring prompts designed to assess the context-understanding abilities of LLMs. In the ICL setting, models struggle with understanding nuanced contextual signals compared to SOTA fine-tuned models. Assessment of quantized models provides promising insights on the 3-bit post-training quantization.

ScreenRef visualization

Referring to Screen Texts with Voice Assistants
Shruti Bhargava, Anand Dhoot, Ing-Marie Jonsson, Hoang Long Nguyen, Alkesh Patel, Hong Yu, Vincent Renkens
ACL Industry Track 2023

Novel experience for users to refer to data-detectable entities on their phone screens when interacting with voice assistants. Screen reference resolution data strategy and a lightweight, general-purpose model that only uses the text extracted from the UI. The proposed model is modular, offering flexibility, better interpretability, and efficient run-time performance.

CREAD visualization

CREAD: Combined Resolution of Ellipses and Anaphora in Dialogues
Bo-Hsiang Tseng, Shruti Bhargava, Jiarui Lu, Joel Ruben Antony Moniz, Dhivya Piraviperumal, Lin Li, Hong Yu
NAACL 2021
[code]

Resolving references and understanding ellipses are crucial for dialogue agents to generate coherent responses. A joint benchmark for the two tasks by annotating the dialogue-based coreference dataset, MuDoCo, with rewritten queries. A novel joint learning framework that boosts query rewrite and outperforms SOTA for coreference resolution.

Dialog State Tracking visualization

Conversational semantic parsing for dialog state tracking
Jianpeng Chen, ... , Shruti Bhargava, ... , Jason D Williams, Hong Yu, Diarmuid O Seaghdha, Anders Johannsen
EMNLP 2020
[Dataset]

Fresh perspective on dialog state tracking as a semantic parsing task over hierarchical representations, with compositionality, cross-domain knowledge sharing, and coreference. We present TreeDST, a dataset of 27k conversations with tree-structured states and system acts. Our encoder-decoder model leads to a 20% improvement over SOTA.

Gender debiasing visualization

Exposing and Correcting the Gender Bias in Image Captioning Datasets and Models
Shruti Bhargava, David Forsyth
arxiv 2019

The task of image captioning implicitly involves gender identification. MS COCO dataset contains blatant gender bias in captions, arising from two main sources: statistical variation in data and flawed annotations. Biased data leads to concerning predictions by models. We propose a novel framework for gender-neutral captioning and independent gender classification using masking, reducing contextual bias. On an anti-stereotypical dataset, our approach outperforms the SOTA gender-based approaches.

Dandelion++ visualization

Dandelion++ lightweight cryptocurrency networking with formal anonymity guarantees
Giulia Fanti, Shaileshh Bojja Venkatakrishnan, Surya bakshi, Bradley Denby, Shruti Bhargava, Andrew Miller, Pramod Viswanath
SIGMETRICS 2018
[code]

Bitcoin's networking stack is shown to have anonymity vulnerabilities owing to the mechanism for broadcasting transactions, leading to large-scale deanonymization attacks. We present Dandelion++, a first-principles defense with near-optimal information-theoretic guarantees.

Mentorship Experience

Awards and Scholarships

Teaching

I have served as a Teaching Assistant for graduate and undergraduate courses spanning optimization, machine learning, algorithms, and programming:

Academic Research

During my academic journey, I had the opportunity to collaborate with inspiring researchers across leading institutions.

[Web Cite]