
Aveg Ganorkar
Data & AI Engineer · College Park, MD
I build data systems and AI products that work in production — streaming pipelines, LLM-powered applications, warehouse modeling, and the interfaces that sit on top of them. Comfortable owning projects end-to-end in startup and research environments.
Currently completing my MS in Data Science at the University of Maryland. Strong in Python, SQL, and cloud-native tooling with a track record of shipping reliable systems fast. I care about the details — from schema design to the dashboard stakeholders actually use.
Experience
Data Science & Analytics Intern
Drug Hunter
- Built Airflow-orchestrated NLP pipelines processing 1M+ biomedical records with 93% classification accuracy, reducing manual tagging by 50%.
- Developed RAG systems using pgvector and LLM APIs for semantic search over large-scale biomedical datasets — natural language querying over research corpora.
- Built LLM-powered workflows integrating Anthropic and OpenAI APIs for automated extraction, classification, and summarization of unstructured research content.
- Designed Great Expectations validation checks and pipeline monitoring to maintain data integrity across production pipelines serving product and research teams.
Research Assistant
University of Maryland
- Built a full-stack analytics platform (Python + Streamlit) enabling self-serve simulation and data exploration for interdisciplinary research teams, replacing fragmented spreadsheet workflows.
- Integrated OpenAI APIs and Airtable workflows to automate data collection and preprocessing pipelines supporting fraud detection research.
- Developed automated validation pipelines and interactive dashboards to monitor research results across technical and non-technical stakeholders.
Data Analyst
Smart Tech Software
- Engineered automated ELT pipelines using Airflow and Snowflake, boosting pre-processing efficiency by 42% and consolidating data from multiple siloed systems.
- Wrote complex SQL (CTEs, window functions, aggregations) and dbt models to define trusted metrics for Operations, Sales, and Finance.
- Deployed AWS Lambda real-time communication channels and Slack webhook integrations for automated reporting and anomaly alerting.
- Created Power BI and Tableau dashboards tracking KPIs, revenue trends, and operational performance for business leadership.
Data Engineer
White Heaven Entertainment
- Owned end-to-end data infrastructure as sole data engineer: ELT pipelines, API integrations, backend services, and warehouse architecture.
- Automated ELT pipelines using SQL and AWS Lambda, improving pre-processing efficiency by 42% and enabling real-time delivery across internal systems.
- Built monitoring, validation, and alerting layers for data consistency and pipeline reliability across production systems.
Education
M.S. Data Science & Analytics
University of Maryland, College Park
Machine Learning, Big Data Systems, Data Systems, Statistical Modeling, Probability & Statistics, Data Visualization, Algorithms for Data Science, Computer Vision
B.E. Electronics & Communication Engineering
RCOEM, Nagpur
Data Structures, Algorithms, Database Management, Operating Systems, Computer Networks, Computer Architecture, Software Engineering
Skills
Python, SQL, JavaScript/Node.js, TypeScript, R, Bash
Claude API, OpenAI API, Gemini API, RAG pipelines, pgvector, embeddings, prompt engineering, NLP
Scikit-learn, Pandas, NumPy, regression, classification, clustering, time-series forecasting, anomaly detection
Apache Kafka, Apache Flink, Airflow, dbt, ETL/ELT design, dimensional modeling, CDC patterns, event-driven architecture
Snowflake, BigQuery, PostgreSQL, MySQL, Redis, pgvector, Great Expectations
n8n (self-hosted), Airflow, webhook integrations, API orchestration, Google Sheets API
FastAPI, Node.js, RESTful API design, microservices, OAuth integrations
AWS (S3, EC2, Lambda, RDS), Docker, Docker Compose, Oracle Cloud, Linux, Nginx, CI/CD
Tableau, Power BI, Streamlit, Grafana, Prometheus
MediaPipe BlazePose, OpenCV, pose estimation, image processing, stereo vision
What people say
“He possesses exceptional technical expertise and demonstrates outstanding teamwork abilities. His problem-solving capabilities and collaborative spirit ensured our projects' success. His positive attitude and creativity were instrumental in generating innovative ideas.”
Malhar Deshkar
SRE 2 @ Nutanix · Colleague
“Aveg was an integral asset to our team, consistently bringing forward innovative solutions and embracing challenges with poise and determination. His management, leadership, and communication skills were consistently displayed in every initiative he led.”
Shreya Chakravarty
Data Scientist @ Kinaxis · Manager
“Working with Aveg on our final year project was an absolute pleasure. His initiative in getting things started, dedication to seeing them through, and his quick problem-solving skills made him an invaluable part of the team.”
Meeti Khandelwal
Software Engineer II @ JPMorganChase · Colleague
“Aveg's skills in Python and JavaScript played a key role in our success. Our joint work led to multiple research papers and presentations. Beyond his technical abilities, he stood out for his supportive and collaborative nature — always willing to help.”
Sujal Agrawal
Software Engineer II @ Fivetran · Colleague
