Hi, I'm a Software Development Engineer specializing in high-throughput data systems, distributed scraping infrastructure, and cloud-native backend APIs. I have hands-on experience building and maintaining PySpark-based ETL pipelines that process 20M+ records daily across AWS and GCP, as well as mobile API reverse engineering for large-scale data extraction. Additionally, I have a strong foundation in the LLM ecosystem (LangChain, LangGraph, LangSmith) and am highly motivated to build Agentic AI applications and AI-driven workflows.
I'm adaptable, comfortable tackling complex data engineering, backend, or applied AI challenges, and looking for an opportunity to contribute to a great team. Feel free to reach out via email!
Remote: Yes/Hybrid/Onsite (can be flexible with the right opportunity)
Willing to relocate: Yes (India only)
Technologies: Python, SQL, C/C++, FastAPI, PySpark, Scrapy, AWS, GCP, Apache Kafka, Docker, LangChain
Résumé/CV: https://drive.google.com/file/d/1xGjgaJK7k8b-hY10f0YmJ2x8qxa...
Email: dheeraj25062003@gmail.com
Hi, I'm a Software Development Engineer specializing in high-throughput data systems, distributed scraping infrastructure, and cloud-native backend APIs. I have hands-on experience building and maintaining PySpark-based ETL pipelines that process 20M+ records daily across AWS and GCP, as well as mobile API reverse engineering for large-scale data extraction. Additionally, I have a strong foundation in the LLM ecosystem (LangChain, LangGraph, LangSmith) and am highly motivated to build Agentic AI applications and AI-driven workflows.
I'm adaptable, comfortable tackling complex data engineering, backend, or applied AI challenges, and looking for an opportunity to contribute to a great team. Feel free to reach out via email!