Aveg Ganorkar

Aveg Ganorkar

Data & AI Engineer · College Park, MD

I build data systems and AI products that work in production — streaming pipelines, LLM-powered applications, warehouse modeling, and the interfaces that sit on top of them. Comfortable owning projects end-to-end in startup and research environments.

Currently completing my MS in Data Science at the University of Maryland. Strong in Python, SQL, and cloud-native tooling with a track record of shipping reliable systems fast. I care about the details — from schema design to the dashboard stakeholders actually use.

Experience

Jun 2025 – Aug 2025Drug Hunter

Data Science & Analytics Intern

Drug Hunter

  • Built Airflow-orchestrated NLP pipelines processing 1M+ biomedical records with 93% classification accuracy, reducing manual tagging by 50%.
  • Developed RAG systems using pgvector and LLM APIs for semantic search over large-scale biomedical datasets — natural language querying over research corpora.
  • Built LLM-powered workflows integrating Anthropic and OpenAI APIs for automated extraction, classification, and summarization of unstructured research content.
  • Designed Great Expectations validation checks and pipeline monitoring to maintain data integrity across production pipelines serving product and research teams.
Mar 2025 – Sep 2025University of Maryland

Research Assistant

University of Maryland

  • Built a full-stack analytics platform (Python + Streamlit) enabling self-serve simulation and data exploration for interdisciplinary research teams, replacing fragmented spreadsheet workflows.
  • Integrated OpenAI APIs and Airtable workflows to automate data collection and preprocessing pipelines supporting fraud detection research.
  • Developed automated validation pipelines and interactive dashboards to monitor research results across technical and non-technical stakeholders.
Aug 2023 – Jul 2024Smart Tech Software

Data Analyst

Smart Tech Software

  • Engineered automated ELT pipelines using Airflow and Snowflake, boosting pre-processing efficiency by 42% and consolidating data from multiple siloed systems.
  • Wrote complex SQL (CTEs, window functions, aggregations) and dbt models to define trusted metrics for Operations, Sales, and Finance.
  • Deployed AWS Lambda real-time communication channels and Slack webhook integrations for automated reporting and anomaly alerting.
  • Created Power BI and Tableau dashboards tracking KPIs, revenue trends, and operational performance for business leadership.
Jul 2022 – Jul 2023White Heaven Entertainment

Data Engineer

White Heaven Entertainment

  • Owned end-to-end data infrastructure as sole data engineer: ELT pipelines, API integrations, backend services, and warehouse architecture.
  • Automated ELT pipelines using SQL and AWS Lambda, improving pre-processing efficiency by 42% and enabling real-time delivery across internal systems.
  • Built monitoring, validation, and alerting layers for data consistency and pipeline reliability across production systems.

Education

Aug 2024 – May 2026
University of Maryland, College Park

M.S. Data Science & Analytics

University of Maryland, College Park

Machine Learning, Big Data Systems, Data Systems, Statistical Modeling, Probability & Statistics, Data Visualization, Algorithms for Data Science, Computer Vision

Aug 2019 – May 2023
RCOEM, Nagpur

B.E. Electronics & Communication Engineering

RCOEM, Nagpur

Data Structures, Algorithms, Database Management, Operating Systems, Computer Networks, Computer Architecture, Software Engineering

Skills

Languages

Python, SQL, JavaScript/Node.js, TypeScript, R, Bash

AI & LLMs

Claude API, OpenAI API, Gemini API, RAG pipelines, pgvector, embeddings, prompt engineering, NLP

ML & Data Science

Scikit-learn, Pandas, NumPy, regression, classification, clustering, time-series forecasting, anomaly detection

Data Engineering

Apache Kafka, Apache Flink, Airflow, dbt, ETL/ELT design, dimensional modeling, CDC patterns, event-driven architecture

Warehousing & DB

Snowflake, BigQuery, PostgreSQL, MySQL, Redis, pgvector, Great Expectations

Automation

n8n (self-hosted), Airflow, webhook integrations, API orchestration, Google Sheets API

Backend & APIs

FastAPI, Node.js, RESTful API design, microservices, OAuth integrations

Cloud & Infra

AWS (S3, EC2, Lambda, RDS), Docker, Docker Compose, Oracle Cloud, Linux, Nginx, CI/CD

Visualization

Tableau, Power BI, Streamlit, Grafana, Prometheus

Computer Vision

MediaPipe BlazePose, OpenCV, pose estimation, image processing, stereo vision

What people say

He possesses exceptional technical expertise and demonstrates outstanding teamwork abilities. His problem-solving capabilities and collaborative spirit ensured our projects' success. His positive attitude and creativity were instrumental in generating innovative ideas.

Malhar Deshkar

Malhar Deshkar

SRE 2 @ Nutanix · Colleague

Aveg was an integral asset to our team, consistently bringing forward innovative solutions and embracing challenges with poise and determination. His management, leadership, and communication skills were consistently displayed in every initiative he led.

Shreya Chakravarty

Shreya Chakravarty

Data Scientist @ Kinaxis · Manager

Working with Aveg on our final year project was an absolute pleasure. His initiative in getting things started, dedication to seeing them through, and his quick problem-solving skills made him an invaluable part of the team.

Meeti Khandelwal

Meeti Khandelwal

Software Engineer II @ JPMorganChase · Colleague

Aveg's skills in Python and JavaScript played a key role in our success. Our joint work led to multiple research papers and presentations. Beyond his technical abilities, he stood out for his supportive and collaborative nature — always willing to help.

Sujal Agrawal

Sujal Agrawal

Software Engineer II @ Fivetran · Colleague