I'm a passionate Data Engineer with 3+ years of experience building scalable data pipelines and cloud-based solutions. I specialize in transforming raw data into actionable insights using modern data engineering practices.
- Building end-to-end data pipelines on Azure using Databricks and PySpark
- Developing serverless ETL workflows with Azure Functions
- Optimizing big data processing with Apache Spark
- Data Engineer @ Tata Consultancy Services (TCS)
- Working on large-scale data migration and transformation projects
- Specialized in Azure cloud services, PySpark, and modern ETL workflows
Languages & Frameworks
- Python | SQL | PySpark | Spark-SQL
Big Data Technologies
- Apache Spark | Hadoop | Hive | Sqoop
Cloud & Azure Services
- Azure Databricks | Azure Data Factory | Azure Data Lake Storage
- Azure Blob Storage | Azure SQL Database | Azure Functions
Databases
- Microsoft SQL Server | MySQL | PostgreSQL | HBase
Tools & Platforms
- GitHub | Azure DevOps | Jupyter Notebook | Delta Lake
- Databricks Certified Data Engineer Associate (May 2025 - May 2027)
- Python for Everybody - Coursera
- Programming, Data Structures and Algorithms Using Python - NPTEL
- PySpark for Data Engineers
Built a complete data pipeline to analyze flight delay trends using Azure services
- Improved data processing speed by 30%
- Reduced query execution time by 25%
- Technologies: Azure Data Factory, Azure Databricks, Azure Data Lake, PySpark
- Advanced Databricks features and Delta Live Tables
- Real-time streaming with Apache Kafka
- MLOps and data pipeline orchestration
- Data Pipeline Architecture
- Cloud Data Engineering
- Big Data Processing
- Data Quality & Governance
- ETL/ELT Best Practices
- π§ Email: [email protected]
- πΌ LinkedIn: Mayur Rajurkar
- π Location: Pune, Maharashtra, India
I've processed terabytes of data and migrated legacy systems to modern cloud platforms, turning data chaos into organized insights!
π¬ "Data is the new oil, and I'm here to refine it!"

