Skip to content
View DeepikaReddygari's full-sized avatar

Highlights

  • Pro

Organizations

@DATS6103-Team5 @LAiSER-Software @GDSC-GWU @DATS6401

Block or report DeepikaReddygari

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DeepikaReddygari/README.md

Hi - I'm Deepika Reddygari 👋

Data Engineer | MS Data Science @ GWU (GPA 3.9) | Washington, DC

I design and build robust data pipelines and warehousing solutions that transform raw data into actionable insights. With 2+ years of hands-on experience in ETL, data modeling, and real-time processing, I've optimized workflows to reduce latency by 30% and ensured data accuracy across enterprise systems.


About Me

I'm a data engineer passionate about building efficient, scalable data infrastructure. Starting my career at Tata Consultancy Services (TCS), I developed expertise in SQL, ETL pipeline design, and data warehousing using Star Schema. Now, I'm leveraging Python, PySpark, and advanced analytics to solve complex data challenges at scale.


Technical Skills

Data Engineering: ETL Pipeline Design · Data Warehousing (Star Schema) · Data Modeling · Real-time & Batch Processing · Automated Data Validation
Languages & Databases: SQL (Advanced) · Python (Pandas, PySpark, OpenCV) · Java · SQL Server · MongoDB
Tools & Platforms: Informatica Cloud · Git · Linux · Docker · Jira · Power BI
Certifications & Awards: Star of The Month (TCS) · OSPO Award (Real-Time Danger Detection System)


Featured Projects

🏆 Real-Time Danger Detection System

OSPO Award Winner

  • Engineered a real-time computer vision pipeline using Python, OpenCV, and YOLOv3 to detect hazardous conditions
  • Built automated notification system with instant alerts and visual evidence
  • Recognition: OSPO Award and cash prize for technical innovation in public safety

💾 SQL-Based Data Warehousing for EV Market Analysis

  • Designed a Star Schema data warehouse with optimized ETL procedures
  • Created query-ready datasets for business intelligence and analytics
  • Optimized data access and retrieval workflows for efficient query performance

📊 Operational Efficiency Analysis (NYC Taxi Dataset)

  • Analyzed 12+ million records using PySpark and Parquet to identify high-density traffic zones
  • Implemented complex spatial transformation logic within ETL workflows
  • Demonstrated capability with large-scale data processing and feature engineering

🤖 Reinforcement Learning for Pseudo-Labeling

GWU Data Science Capstone · Aug 2025 – Present

  • Designed a Reinforcement Learning (RL) pipeline to perform pseudo-labeling, improving downstream model accuracy compared to standard self-training baselines
  • Automated the generation of high-confidence labels for unlabeled data, reducing reliance on manual annotation
  • Benchmarked RL against non-RL approaches to quantify trade-offs in accuracy and efficiency, delivering a reproducible workflow

📝 Classification of Mathematical Problems Using NLP

GWU, Washington, DC · Jan 2025 – May 2025

  • Applied machine learning and transformer-based NLP models to classify math problems into eight categories
  • Developed data preprocessing and evaluation pipelines for model comparison and reproducibility
  • Documented methodologies and findings for academic dissemination

🔥 Data-Driven Thermal Energy Storage Analysis

GWU

  • Built Python-based thermal models to analyze sensible–latent heat storage systems and assess discharge efficiency
  • Applied machine learning optimization to predict system performance under varying conditions
  • Achievement: Paper accepted for poster presentation at the 7th Battery and Energy Storage Conference, 2025

🎬 Smart Clip: AI-Powered Video Summarization & Content Generation

GenAI / LLMs

  • Developed end-to-end Generative AI pipeline using OpenAI models and LangChain to automatically generate transcripts, summaries, and viral clips from long-form video content
  • Optimized system prompts to improve summarization accuracy and context retention by 20%

Publications

  • Fabrication and Study of PCM-based Waste Heat Recovery System – POCER 2019, Nottingham University
  • Thermal Energy Storage System Performance Using PCM-Variable Heat – POCER 2019, Nottingham University
  • Paper accepted for poster presentation at the 7th Battery and Energy Storage Conference, 2025

Achievements & Awards

  • 🏆 OSPO Award – Real-Time Danger Detection System with cash prize
  • Star of The Month – Tata Consultancy Services (TCS)
  • 🎓 Global Leaders Award – George Washington University (Scholarship)

Education

Master of Science, Data Science
George Washington University, Washington, DC · December 2025
GPA: 3.9 | Honors: Global Leaders Award (Scholarship)


Leadership & Community

Operations Lead | Google Developer Groups
August 2024 – Present

  • Directed the "Build with AI" conference with C-suite executives (CEOs/CTOs)
  • Moderated executive panel on emerging tech trends and enterprise scalability
  • Collaborated with university departments on ethical, inclusive technology learning

Volunteer | IEEE Student Chapter
January 2018 – May 2020 · JNTU, India

  • Coordinated technical workshops and hackathons for 200+ participants
  • Managed stakeholder communication and event scheduling

Let's Connect

📧 Email: [email protected]
📱 Phone: 571-274-9816
💼 LinkedIn: https://www.linkedin.com/in/deepikardy7129
📍 Location: Washington, DC

Popular repositories Loading

  1. Udacity-ML-Charity-Competition Udacity-ML-Charity-Competition Public

    Forked from phanindra-max/Machine-Learning-I

    Academic final project

    Jupyter Notebook

  2. extract-module extract-module Public

    Forked from s2t2/laiser-extract-module

    Jupyter Notebook

  3. SmartClip SmartClip Public

    Python

  4. transformers transformers Public

    Forked from huggingface/transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python

  5. DeepikaReddygari DeepikaReddygari Public

  6. deepika-cv-portfolio deepika-cv-portfolio Public