Experience
- Applied Research Scientist, Thomson Reuters Lab, India
- Period: August 2024 - Current
- Legal AI Reasoning and Model Enhancement:
- Pioneered a legal synthetic data generation pipeline for Process Reward Models (PRMs), creating domain-specific training datasets that improved legal reasoning capabilities on LegalBench benchmark tasks.
- Evaluated different Test-Time Scaling paradigms for legal reasoning and optimizing inference-time computation.
- Architected an IRAC (Issue-Rule-Application-Conclusion) Knowledge Graph framework using Thomson Reuters’ Westlaw corpus and court case data, generating high-quality preference datasets that improved legal reasoning alignment in fine-tuned LLMs.
- AI-Powered Legal Document Update System:
- Built a comprehensive end-to-end LLM Workflow pipeline to update Word Document with XML parsing and intelligent contextual mapping for paragraph identification and edit, achieving 70%+ success rate while preserving complete document formatting integrity.
- Engineered human-in-the-loop validation interface with reasoning chains and alert-point mapping, resulting in 60% reduction in manual processing time while maintaining legal accuracy through strategic human oversight and audit trails.
- DocEvolver: Created an MVP for “Cursor for Word” as an Extension for updating and understanding MS Word Files for Lawyer-Editors
- Search-and-Replace Agentic System:
- Architected a multi-agent AI system for automated Word editing with comprehensive validation pipeline (schema enforcement, content integrity, audit logging), achieving 98% accuracy, resulting 65% reduction in manual content revision.
- Constructed error-resolving agents with function calling and multi-turn reasoning to fix XML issues, implementing few-shot learning and self-healing mechanisms, resulting in widespread adoption across teams, processing hundreds of documents monthly with sub-8 second processing time per section.
- Additional Tools:
- Truth Social Monitor: Created a monitoring system for Trump’s Truth Social posts with sub-3 second latency, generating automated alerts for Reuters journalists
- Page Flipper: Revived version of the Page Monitor extension for website tracking, eliminating Visual Ping subscriptions for the team.
- These tools provided Reuters with a critical competitive advantage over competitors.
- Research Intern, Microsoft Research India
- Period: January 2024 - July 2024
- Programming with Representations (PwR):
- Led backend development for Microsoft’s PwR Studio platform, focusing on the Natural Language to Domain Specific Language (NL2DSL) translation system using GPT 3.5 & 4, and developed a symbolic translation pipeline that generates finite state machines structured as custom DSL, achieving 85% reduction in hallucinations.
- Formulated Rubrics and Evaluations with error correction over DSL which resulted in valid DSL generation increasing from 65% to 95%.
- Jugalbandi-Studio-Engine:
- Architected Python-based platform that converts DSL into scalable finite-state-machine-based chatbot applications. This framework reduced development time by 80% and increased accessibility for government agencies and NGOs.
- The platform was featured in Satya Nadella’s keynote talks, and I represented Microsoft Research in the pilot project, enabling 15+ nontech organizations to develop AI-powered conversational bots.
- Jugalbandi(JB) Manager:
- Established a chatbot management platform supporting multiple channels (WhatsApp, Telegram, Web) with multilingual text and voice capabilities; integrated Bhashini Speech models with Azure service failover mechanisms, ensuring 70% faster deployment of new chatbots.
- Open-sourced work on GitHub:
- PwR-NL2DSL, a tool to convert Natural Language to Domain Specific Language.
- PwR-Studio - Studio environment for Programming with Representations
- Jugalbandi Studio - Open-source chatbot framework
- Jugalbandi Manager - Chatbot management platform
- Our complete system has been picked up by Bhashini to build Chatbots across government initiatives.
- Mentors: Sriram Rajamani, B. Ashok, Akash Lal, Sameer Segal
- Research Intern, AI Institute, University of South Carolina
-
Period: December 2022 - April 2024
- Master thesis on Knowledge Enabled Multimodal Ingredient Substitution, built knowledge graph incorporating 27K ingredients and 40K substitution pairs. Enabled precise ingredient recommendations using multimodal and constraint-based searches. Developed an LLM-based query module for the ingredient substitution knowledge graph. Also submitted this work at AAAI-25.
-
Resources: GitHub Repository Dataset on Kaggle Dataset used in UC Irvine + Stanford Health Hackathon 2024 - Formulated cross-modal Recipe Retrieval and developed cooking action recognition for recipe analysis, achieving 95% Recall score leading to paper Cook-Gen at IEEE SMC’23.
- Mentors: Revathy Venkataramanan Dr. Amit Sheth
-
- Visiting Researcher at Societal Computing, Saarland University (SIC)
- Period: May 2023 - August 2023
- Project: Time and Multispectral Domain Satellite Image Super-Resolution
- Worked on Satellite Image Superresolution, utilizing Temporal and Multispectral Information for super-resolution. Utilized high temporal frequency of low-resolution images for Wildlife Tracking. Enhanced disaster analysis through high-temporal frequency & low-resolution images using GAN & Diffusion models.
- Technologies: GANs, Diffusion Models, Computer Vision, Remote Sensing, PyTorch
- Mentors: Ingmar Weber, Ferda Ofli
- Research Intern, University of Maryland, Baltimore US
- Period: October 2022 - April 2023
- Project: Personalized AI Assistant, funded under HealthCareNLP grant.
- Worked on developing Personalized Response Generation models using reward scaling over BART & T5, paper K-PERM accepted at AAAI Symposium’24 and improved NUBIA score by 10%. The work has been focused on using Knowledge and Persona for loss scaling resulting better response generation.
- Technologies: NLP, Information Retrieval, Large Language Models, Conversational Models, Question Answering, Generative AI
- Mentors: Manas Gaur
- AI Intern, EdgeNeural.ai, Pune, India
- Period: June 2022 - August 2022
- Project: Accelerated inference, optimized models through quantization, CPU/GPU customization. Developed training and optimization pipelines for automatic model training and hosting.
- Technologies: Optical Character Recognition (OCR), Object Detection (YOLO, SSD), TensorRT, GPU Optimization, OpenVINO, Docker, AWS, PyTorch, TensorFlow
- Collaborators: Sarvesh Devi, Chidhambararajan, Dhanraj
- Research Intern, Video Analytics Lab, IISc, Bangalore
- Period: May 2022 - August 2022
- Project: Implemented StyleGAN-based architectures for disentangled video interpretation in multiple domains, enhancing image and video generation using Generative Adversarial Networks.
- Technologies: GANs, Recurrent Neural Networks, PyTorch, TensorFlow, Python
- Mentor: Rishubh Parihar
- Research Intern, Visual Learning and Intelligence Lab, Indian Institute of Technology, Hyderabad
- Period: November 2021 - April 2022
- Project: Researched Medical Image Processing with Prof. Dr. C. Krishna Mohan. Developed a Novel Architecture for improved classification of low-quality images and unbalanced datasets. Published SFFNet for Panoramic Dental X-ray Segmentation at IEEE APSCON 2023.
- Technologies: Medical Imaging, Healthcare, Deep Learning, TensorFlow, PyTorch
- Mentor: R Sai Chandra Teja
- Collaborators: Dhruv Makhwana, Rohit Pawar
- Computer Vision Engineer, AI Mage (WETHEKOO)
- Period: March 2021 - April 2021
- Project: Developed Fashion Tagging Engine using Deep Learning, optimized and deployed computer vision models on edge devices, improving customer satisfaction.
- Technologies: TensorFlow, Computer Vision, Siamese Neural Networks, Tagging Engine, Segmentation
- Software Engineer, Rhizicube Technologies
- Period: June 2021 - September 2021
- Project: Oversaw Server & REST API development and Database for Consumer Data Platform Using Golang(Gin). Built a real-time streaming data pipeline using Apache Kafka. Wrote LinkedIn Scrapper and Generalized Organization Website Crawler to scrap data using Selenium & Beautiful Soup.
- Technologies: Back-End Web Development, Relational Databases, Kafka, MySQL, Data Scraping, Go (Programming Language), Database Design, Selenium, Python
- Collaborators: Udit Sarin, Yash Goyal