I'm passionate about creating and optimizing the technological foundations that businesses rely on, turning complex infrastructure challenges into elegant, efficient solutions that scale seamlessly and perform reliably.
With 8+ years of experience in large-scale infrastructure and reliability engineering, I've mastered the art of building and maintaining infrastructure at scale. From orchestrating distributed cloud architectures to deploying AI infrastructure and GPU clusters, I combine technical precision with innovative solutions to drive operational excellence.
- Advanced Data Observability solutions using GCP tools for real-time analytics
- Large Language Models (LLMs) deployment on GPU-accelerated environments
- Scaling distributed systems with high availability architecture
- Distributed Key-Value Store: Highly available system with consistent hashing, replication, vector clocks
- Advanced Data Observability: Comprehensive solutions using GCP tools for real-time analytics
- Healthcare Workflow System: Streamlined patient care workflow system with regulatory compliance
- Microservices Monitoring: Implemented OpenTelemetry for distributed tracing across microservices
- NLP and LLM Deployment: Leveraging large language models for unstructured data processing
- Enterprise Cloud Migration: Major cloud migration to AWS using Infrastructure as Code
- Cloud Computing (AWS, GCP)
- Kubernetes & Containerization
- Observability & Monitoring
- Big Data & Data Engineering
- CI/CD Pipelines
- Large Scale Distributed Computing
- Machine Learning & AI Infrastructure
- Infrastructure as Code
- Email: [email protected]
- Phone: +5518041964
- LinkedIn: linkedin.com/in/andrew-espira
Every great system scales with purpose, let's build your future.
βοΈ From espirado