Skip to content
View Abhi0323's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Abhi0323

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Abhi0323/README.md


## 🚀 About Me

I build production-grade AI systems for complex regulated environments — where correctness, reliability, security, and explainability matter more than benchmarks.

My work sits at the intersection of:

  • 📄 Document Intelligence (OCR + layout + page understanding + field extraction)
  • 🧠 Multimodal AI (LLMs / SLMs / VLMs) for real healthcare workflows
  • ⚙️ AI Platform Engineering (on-prem, cloud, gov cloud; microservices; CI/CD)
  • GPU + inference systems (CUDA, NVIDIA drivers, custom builds, profiling)

I’m passionate about Responsible AI in healthcare — making care faster and easier while staying audit-friendly, verifiable, and compliant.


🏥 What I’ve built in Healthcare AI

✅ AI platform for medical claims review (HIPAA / PHI / PII)

A production platform that processes large medical claim PDFs end-to-end:

  • OCR + document splitting
  • Page classification (DistilBERT + rules) to categorize page types
  • Layout detection (YOLO) to locate page regions (titles, summary boxes, checkboxes, etc.)
  • Region cropping to improve OCR/VLM accuracy and reduce compute
  • Field extraction (beneficiary details, NPI/MBI, HCPCS/CPT, ICD codes, claim lines, etc.)
  • Decision support system using RAG, answering nurse review questions with evidence + traceability
  • Human-in-the-loop UI: nurses can verify exactly where an answer came from

✅ Prior Authorization automation

  • Built a full end-to-end service operations module (frontend + backend) rapidly using AI-assisted development (Cursor/Claude)
  • Implemented an OCR service using Azure Document Intelligence with custom-trained models for prior auth cover sheets
  • Deployed services using Azure Web Apps

🧩 Core Capabilities (What I do well)

Production Document Intelligence

  • OCR pipelines, layout detection, page classification, field extraction
  • Handling messy documents: handwriting, checkboxes, signatures, diagrams

LLM/SLM/VLM Systems

  • Model selection & benchmarking (LLaMA, Mistral, Phi-3, BioBERT, BioMedBERT, Bio-Mistral)
  • RAG with citations + evidence grounding
  • Guardrails using LangChain (safe inputs/outputs for regulated environments)
  • Fine-tuning + CPU-first optimization via 4-bit quantization (C++/BitNet-style)

GPU + Inference Engineering

  • Bare-metal GPU bring-up (drivers, CUDA toolkit, NVCC, kernels)
  • H100 / L40s / A40 / RTX 4090 / RTX 5090 / DGX environments
  • Inference runtimes: PyTorch, ONNX Runtime, llama.cpp
  • Custom builds aligned to CUDA versions + GPU compute capability

Platform Engineering & Deployment

  • Microservices architecture
  • On-prem deployment on RHEL-based systems
  • RPM packaging + systemd services for “install → services up”
  • Containers + Kubernetes + ACR
  • CI/CD (Azure DevOps) for RPMs + artifacts + deployments
  • Experience across on-prem + cloud + gov cloud

Reliability, Observability, Security

  • Performance monitoring stack: Telegraf + InfluxDB + Chronograf
  • System/service/GPU metrics and bottleneck analysis
  • Production tuning: Uvicorn workers, CPU/I/O concurrency, DB pooling
  • Security: HTTPS everywhere, cert-based auth, mTLS between services + to DB

Leadership

  • Led 15 interns (Label Studio + YOLO layout detection pipeline)
  • Mentored 3 associates (install/test/validation, architecture onboarding)
  • Acted as product-owner proxy / client-facing technical lead when needed

🧰 Tech Stack (Colorful + Organized)

🧠 AI / ML / NLP

📄 Document Intelligence

⚡ GPUs / CUDA

☁️ Platform / Deployment

🗄️ Databases / Vector Search

📈 Observability / Performance

🔐 Security / Compliance

📊 GitHub Stats


✍️ Writing & Content


🤝 Work With Me

If you're building:

  • AI platforms in regulated environments
  • Document intelligence at scale
  • Evidence-grounded RAG systems
  • GPU-backed inference services

Let’s connect: https://www.linkedin.com/in/abhishek-chandragiri/


⭐ I build AI that survives production constraints — compliance, performance, and real-world messy data.

Pinned Loading

  1. Real-Time-Customer-Segmentation-with-Scalable-Kubernetes-Deployment-and-CI-CD-Integration Real-Time-Customer-Segmentation-with-Scalable-Kubernetes-Deployment-and-CI-CD-Integration Public

    Developed and deployed a scalable machine learning model for real-time customer segmentation using FastAPI, Docker, Kubernetes, and GitHub Actions, with an end-to-end CI/CD pipeline on Azure Kubern…

    Python 10 6

  2. Agile-MLOps-Deployment-Docker-AWS-CI-CD-Pipeline Agile-MLOps-Deployment-Docker-AWS-CI-CD-Pipeline Public

    An end-to-end predictive maintenance application using machine learning to enhance industrial efficiency. This project employs robust modular architecture and advanced MLOps practices, including Do…

    Python 35 9

  3. End-to-End-Employee-Churn-Prediction-with-Azure-Databricks End-to-End-Employee-Churn-Prediction-with-Azure-Databricks Public

    Developed an end-to-end machine learning solution for predicting employee churn using Azure Databricks, leveraging Spark for data processing, MLflow for managing the ML workflow, and deploying the …

    Jupyter Notebook 9 5

  4. Fine-Tuning-LLaMA-2-with-QLORA-and-PEFT Fine-Tuning-LLaMA-2-with-QLORA-and-PEFT Public

    This project enhances the LLaMA-2 model using Quantized Low-Rank Adaptation (QLoRA) and other parameter-efficient fine-tuning techniques to optimize its performance for specific NLP tasks. The impr…

    Jupyter Notebook 13 5

  5. RAG-Powered-AI-Assistant-Transforming-Data-Retrieval-and-Analysis-Across-the-Web-and-PDFs RAG-Powered-AI-Assistant-Transforming-Data-Retrieval-and-Analysis-Across-the-Web-and-PDFs Public

    Harness the power of Retrieval-Augmented Generation with the Personal AI Assistant, an innovative tool designed to extract and synthesize information from web and PDF sources efficiently. This cutt…

    Python 32 12

  6. Machine-Learning-Based-Loan-Default-Early-Warning-System Machine-Learning-Based-Loan-Default-Early-Warning-System Public

    Developed an end-to-end ML system on Azure to predict loan defaults, leveraging advanced data preprocessing, feature engineering, and machine learning models to optimize accuracy. This project incl…

    Jupyter Notebook 22 8