Academic Background: I graduated with a master's degree in Computer Science and Engineering, specializing in Artificial Intelligence from The State University of New York at Buffalo. My academic journey had been diverse, covering a wide range of subjects, including NLP, Computational Linguistics, Pattern Recognition, Machine Learning, and Algorithms Design. My Bachelor's in Electronics Engineering from the esteemed Indian Institute of Technology (BHU) Varanasi, served as the foundation that inspired me to delve into the captivating fields of Artificial Intelligence and Machine Learning.
Practical Experience: Complementing my academic foundation, I've engaged in real-world projects that have further enriched my knowledge. Notably, my master's project involved the development of an innovative open-domain retrieval-augmented generation chatbot. Additionally, as a Research Assistant in our network lab under the guidance of Professor Hongxin Hu, I contributed to the creation of a cutting-edge Network Intrusion Detection System driven by deep learning.
Technical Skills: My skill set is versatile and robust, encompassing proficiency in various programming languages such as Python, Java, C++, PLSQL, and SQL. I also possess expertise in a plethora of frameworks and tools, including PyTorch, TensorFlow, Keras, Langchain, LlamaIndex, VectorDBs(ChromaDB, Faiss, elasticsearch), MLFlow, PySpark, Pandas, Flask, MongoDB, and SpringBoot. Furthermore, I have hands-on experience working with cloud platforms like Oracle, Azure, Google Cloud Platform (GCP), Amazon Web Services (AWS), Atlas, OpenAI.
Professional Roles: My journey in the realm of advanced NLP, generative AI, Large Language Models (LLMs), and machine learning has been shaped by a series of impactful roles:
Senior Data Scientist at Oracle: I lead the development of Generative AI initiatives and spearhead innovative use cases in statistical data science and machine learning
Senior Data Scientist at Hilabs: Here, I led development of innovative end-to-end system for contract document processing, integrating advanced AI technologies for entity extraction and indexing. Successfully implemented scalable solutions, ensuring security and privacy, with proven effectiveness through client validation and integration.
Machine Learning Engineer Intern at Apexanalytix: Here, I played a pivotal role in the development of a Knowledge-Specific Conversational Agent, refining my skills in crafting high-performance conversational systems powered by Large Language Models (LLMs).
Senior Application Engineer(ML) at Oracle: In this role, I engineered an in-memory multivariate anomaly prediction service with a focus on model explainability, delving into traditional machine learning, data analysis, and database optimization.
Project Engineer(ML) at Wipro Limited: Here, I implemented chatbot services that significantly reduced workloads and enhanced the user experience of the employee helpline system.
Summary: Experienced AI professional with a passion for leveraging advanced technologies like NLP, Machine Learning, and Large Language Models (LLMs) to solve real-world problems. Strong academic background complemented by over 5 years of industry experience, leading projects and implementing scalable AI solutions. Eager to continue contributing to groundbreaking advancements in AI and software development.
I am highly enthusiastic about exploring opportunities in this space. If you have relevant openings or are interested in having a conversation on these topics. Feel free to check out my
Resume
and drop me an
e-mail.
I'd be happy to connect for a quick call to discuss!
Joined Oracle as a full time Senior Data Scientist
Feb '24  
Joined Hilabs as a full time Senior Data Scientist
Jan '24  
Joined Apexanalytix as a full time Data Scientist
Jan '24  
Graduated with my Masters in CS (with a specialization in AI) from UB
Aug '23  
Fall Internship : Got internship extension at Apexanalytix as Machine Learning Engineer Intern (Part Time)
Aug '23  
Started my final semester(Fall 2023) at UB.
May '23  
Summer Internship : Joined Apexanalytix as Machine Learning Engineer Intern
Feb '23  
Started working as a Research Assistant in UB networking lab under guidance of Professor Hongxin Hu
Jan '23  
Started my second semester(Spring 2023) at UB.
Aug '22  
Started my graduate studies at State University of New York at Buffalo(UB) - Fall 2022 Semester.
Sept '20  
Joined Oracle (OFSS) as full time Senior Application Engineer (ML)
June '20  
Completed my 2 years working at Wipro Limited.
June '18  
Joined Wipro Limited as full time Project Engineer (ML)
May '18  
Completed my Bachelor of Engineering(BE/Btech) in Electronics Engineering from IIT BHU (Varanasi).
State University of New York at Buffalo
Master of Science | Computer Science and Engineering Specialization: Artifical Intelligence
Aug '22 - Dec '23'
Coursework:
•
CSE-635: Natural Language Processing and Text Mining
•
CSE-567: Computational Linguistics
•
CSE-574: Intro Machine Learning
•
CSE-555: Intro Pattern Recognition
•
CSE-587: Data Intensive Computing
•
CSE-531: Algorithms Analysis and Design
•
CSE-529: Algorithms for Modern Computing Systems
•
CSE-589: Modern Networking Concepts
•
MGO-665: Technological Entrepreneurship
•
CSE-705: Seminar on Recent Advances in Deep Learning & Reinforcement Learning
Indian Institute of Technology (Banaras Hindu University), Varanasi
Bachelor of Technology | Electronics Engineering
July '14 - May '18
Senior Data Scientist | Oracle Malvern, PA, US
Aug '24 - Present
ProSchedAI: Predictive Scheduling and Delay Mitigation •
Led a project at Oracle to address construction clients' concerns about schedule delays in their projects and activities, which were affecting operational efficiency and profitability
•
Developed a machine learning solution to predict both the occurrence and duration of delays, while providing explanations and mitigation strategies using large language models (LLMs)
•
Designed and implemented a multi-task learning model using classification and regression techniques, alongside LLM-generated reports to explain delay reasons and offer actionable insights
•
Achieved 92% accuracy in delay prediction, with an average error of ±2.5 days for delay estimates. Clients reduced project delays by 15% after implementing the recommendations provided
Senior Data Scientist | Hilabs Bethesda, MD, USA
Feb '24 - May '24
GenAI Driven Contract Analyzer: Streamlining Claim Verification •
Led development of scalable end-to-end system for processing contract documents of insurance providers, emphasizing entity extraction to facilitate pricing configuration in claim processing applications
•
Engineered custom document processing pipeline with LayoutLM extracting entities with images, tables, and text elements
•
Developed Langchain service for indexing extracted elements and managed metadata within OpenSearch to optimize retrieval
•
Implemented a robust infrastructure leveraging self-hosted fine-tuned Mistral AI LLM and asynchronous processing via SQS to efficiently handle requests, while prioritizing the utmost security and privacy of sensitive legal contracts
•
Successful POCs followed by integration for 3 clients to validate solution effectiveness and potential for widespread adoption
Data Scientist/ML Engineer Intern | Apexanalytix Greensboro, NC, USA
May '23 - Feb '24
Generative Knowledge Specific Chatbot •
Developed advanced Retrieval Augmented Generation(RAG) Chatbot with LLM for intelligent knowledge access across teams
•
Engineered efficient retrieval pipeline with Parent Child document indexing using LangChain and Chroma VectorDB
•
Designed agile RAG pipeline, integrating MMR scoring for chunk retrieval and Azure OpenAI LLM for response generation
•
Implemented user feedback collection to monitor chatbot performance and gather data for iterative refinement and fine-tuning
•
Achieved 89% approval in human evaluations and integrated technology into 12 internal and 18 external client applications
Senior Application Engineer(ML) | Oracle Bengaluru, India
Sept '20 - Aug '22
Preemptive Anomaly Prediction in Corporate Billing •
Designed and implemented in-memory multivariate anomaly prediction system for corporate billing, addressing complex monthly billing challenges
•
Utilized Oracle in-database ML for Semi-Supervised classification with both local and global model explainability
•
Optimized service with indexing and parallel processing, automating 1.2M bills and 5M segments in 20 minutes, achieving an impressive 92% anomaly class precision
•
Integrated system with both Cloud and On-premise services of Oracle Revenue Management and Billing (ORMB), filed innovation patent at USPTO
Project Engineer(ML) | Wipro Limited Bengaluru, India
June '18 - Sept '20
Chatbot Services for Employee Helpline Portal’s Ticketing System •
Developed Employee Helpline Portal Chatbot with effective retrieval of historical ticket resolutions for enhanced user support
•
Implemented query intent classification and BERT-powered semantic search to deliver precise responses
•
Successfully integrated the chatbot into the portal and achieved the target human agent intervention reduction of 70%
•
Achieved 79% accuracy score in evaluation and decreased wait times from 18 to 4 minutes, improving overall user experience
•
Implemented open-domain chatbot with retrieval-based and casual conversation capabilities.
•
Developed logical and rule-based dialog manager for effective query redirection based on user interaction
•
Integrated retrieval and generative models, improving conversational accuracy by 35% while maintaining resource efficiency.
Network-based Intrusion Detection System (NIDS)
Research Assistant, Supervisor: Dr. Hongxin Hu
Collaborators: Feng Wei
PyTorch, TensorFlow, Data Analysis, Deep Learning, Python, Flask, Jupyter Notebook | Feb. 2023 - May. 2023 [code][Results][Data Analysis]
•
Implemented an Intrusion Detection System using deep neural detectors to efficiently identify and respond to potential security threats in the network, mitigating the risk of data loss and downtime.
•
Conducted comprehensive analysis on 12 different network attack datasets and evaluated the performance of 4 deep neural detectors. Reported insights into the limitations and effectiveness of methods.
•
Designed, and implemented an LLM-powered SQL Database Agent, enabling intuitive natural language interactions with SQL databases..
•
Seamlessly integrated LangChain to extract comprehensive table descriptions and contextual information directly from SQL databases. This context was then leveraged to enhance the generative capabilities of the OpenAI model.
•
Introduced a robust query execution layer within the agent, proficiently managing SQL query execution and proficiently handling database validation errors to ensure accurate and reliable query results.
•
Independently designed and developed the Offers Search Engine, implementing hybrid semantic and exact search techniques for improved offer retrieval.
•
Spearheaded the integration of search and reranking pipelines, resulting in a more 40% more accurate search and 70% reduction in retrieval times for extensive datasets.
•
Successfully deployed the service with a user-friendly interface using Streamlit, enhancing user engagement and ensuring a seamless experience for offer searches.
•
Spearheaded the design and implementation of a Retrieval Augmented Generation (RAG) chatbot system, seamlessly integrating Large Language Models (LLMs) with retrieval mechanisms to enhance information quality and context relevance.
•
Optimized retrieval pipeline with Langchain and integrated OpenAI and LAMMA generative models, elevating user trust and experience through coherent responses from diverse sources of generation.
•
developed a user-friendly solution with a Streamlit-based UI interface, ensuring an intuitive and accessible user experience. Significantly, I enhanced the system's performance to provide seamless access to a vast database of 30 documents, with response times consistently under 5 seconds.
•
Developed a client-server chat application following the conventional client-server model, providing the capability for numerous clients to log in, establish their identity, and communicate with each other via the central server.
•
Utilized socket programming to establish robust connections between the server and clients, ensuring reliable message transmission. Implemented a buffering system to manage message storage and retrieval, particularly for clients who were offline at the time of message receipt.
•
Conducted comprehensive testing of the application in a live production environment to assess its accuracy and efficiency, guaranteeing that it met the performance requirements and functioned reliably under real-world conditions
Graduate Research Assistant | CactiLab - UB
Jan '22 - Mar '22 | Jan '23 - Mar '23
Working as a graduate research assistant for SUNY research foundation(RF) under the supervision of Dr. Hongxin Hu, at the CactiLab - University at Buffalo. I collaborated with Feng Wei and utilized PyTorch, TensorFlow, Python, Flask, and Jupyter Notebook to develop a Network-based Intrusion Detection System (NIDS). This system effectively harnessed the power of deep learning to proactively identify and respond to potential security threats within network environments, thereby reducing the risk of data loss and network downtime. In addition to the implementation of the NIDS, I conducted in-depth analyses on 12 distinct network attack datasets, evaluating the performance of four different deep neural detectors. My research yielded valuable insights into the strengths and limitations of these methods, contributing to our understanding of effective network security measures.
Paper Title
Virtual Conversation with Real-Time Prediction of Body Moments/Gestures
This template is a modification to Jon Barron's website. Find the source code to my website here.
Do not scrape the HTML from this page itself, as it includes analytics tags that you do not want on your own website, use the github code instead and either remove the google tag or replace it with your own tag.