I am Abhishek

Name: Abhishek

Email: abhishek.f@northeastern.edu

Phone: (603) 800-3041

Skills
Python 100%
SQL 100%
R 75%
JAVA 80%

About me

As a data scientist with a strong software engineering background, I specialize in creating scalable, data-driven solutions across various disciplines including machine learning, cloud computing, and database management. My expertise spans efficient data pipeline design, cloud platforms (Azure, AWS), and real-time monitoring systems using technologies like Kafka and Elasticsearch.
My career highlights include:

  • Leading the development of NLP-based chatbots at DataworksAI, utilizing LLMs within a Retrieval-Augmented Generation (RAG) pipeline.

  • Developing a deep learning model for the multiclass image classification project, optimizing performance through data augmentation.

  • I developed predictive models utilizing both classical and quantum machine learning techniques for classification.

I am motivated by the challenge of converting complex data into actionable insights, consistently creating innovative solutions that deliver concrete value in the field of data science.

Resume

Summary

Seasoned AI/ML professional with 5+ years of experience, combining data science and software engineering expertise. Specializes in NLP-driven chatbots, big data warehousing, ETL automation, cloud-based AI/ML solutions, and technical team leadership. Proven ability to transform advanced AI/ML concepts into practical, high-impact applications across diverse industries.

Education

Masters in Analytics

Major Concentration: Applied Machine Intelligence
2022 - 2024

Northeastern University, Boston, MA

Courses: Data Mining, Data Management & Big Data, Enterprise Analytics, Intermediate Analytics, Predictive Analytics, Fundamentals of AI, Applications of AI, AI System Technologies

GPA: 3.89

Bachelor of Engineering in Computer Science

2013 - 2017

Rajiv Gandhi Technical University (RGPV), Bhopal, India

Courses: Data Structures, Database Management, Design and Analysis of Algorithms & Object-Oriented Programming

Professional Experience

Data Scientist

2024 - Present

DataworksAI, Boston, MA

  • Engineered LLM-powered chatbots using GPT-3.5 Turbo, achieving 87% accuracy in Text-to-SQL automation and contextual responses through LangChain and LlamaIndex frameworks.
  • Implemented advanced prompt engineering techniques (few-shot learning, zero-shot learning, chain-of-thought reasoning) to optimize LLM performance and accuracy.
  • Developed real-time conversation management using Redis and FastAPI, integrating vector-based storage with MongoDB for retrieval-augmented generation (RAG).
  • Deployed and scaled chatbot prototypes on Red Hat OpenShift AI using Docker and Kubernetes, implementing CI/CD pipelines and A/B testing.
  • Applied supervised fine-tuning (SFT) principles to enhance model performance through user feedback and prompt optimization.
  • Collaborated with analytics and AI professionals to identify automation opportunities and optimize LLM applications.

Senior Software Engineer

2018 - 2022

PowerSchool, LLC., Bengaluru, India

  • Developed a data warehouse using Databricks, integrating student data from Azure Data Lake, RESTful APIs, MongoDB, and AWS S3.
  • Automated ETL using RPA and Airflow for multiple portals, reducing project completion time by 35% with Docker and Git Actions CI/CD.
  • Developed SQL Server stored procedures using T-SQL to enhance RPA project data logging, boosting error-tracking efficiency by 40%.
  • Applied real-time monitoring system using SageMaker, Kafka, Elasticsearch, Kibana boosting automated ETL process efficiency by 15%.
  • Developed regression models using Python, Scikit-learn to forecast monthly CRM/ERP data migration requests. Visualized results with Tableau/Power BI, leading to a 20% improvement in data migration planning efficiency.
  • Mentored 20+ new hires in problem-solving and automating ETL pipelines within Agile framework, using JIRA for tracking.
  • Developed MySQL/Java web interface for database exports, improving productivity 9%. Containerized with Kubernetes for scalability.

Leveraged Technologies

8

Certifications

12

Projects

7

Recognition

2

Extra Curriculum Activities

Portfolio

University Explorer: Chatbot

PostgreSQL | LLMs | MongoDB | StreamLit |
Langchain

Out of Pattern Detection

Azure | Airflow | Kafka | ElasticSearch |
Kibana | Pandas | Numpy | Pyspark

Viral Rash Classification

Tensorflow | Pytorch | OpenCV | Pillow | Ultralytics | Numpy | Pandas | Seaborn

Glioma Grade Classification

Scikit-learn | Qiskit | Seaborn | Matplotlib

Message Distribution Analysis

Snowflake | Azure | Power BI | Python

NYC Trip Data Analysis

Pyspark | Scikit-learn | Tableau | Python

Boston Housing Dataset

AWS(S3) | Bokeh | Panel | Scikit-learn | Python

Contact

GitHub

Call Us

+1 (603) 800-3041

Email Us

abhishek.f@northeastern.edu

LinkedIn