Site Reliability Engineer II
Flywire Corp · New York, NY
Aug 2023 — Present
Owns reliability work that connects infrastructure architecture to business continuity: cloud connectivity, modernization, observability, and incident practices.
- Architected hybrid cloud connectivity between on-premise systems and AWS using IPsec, BGP, and hub-spoke VPC design, enabling critical legacy workloads to move into ECS during a legacy infrastructure failure event.
- Reduced annual infrastructure cost by 30%, decommissioned $250K in legacy infrastructure, and preserved $500K+ ARR by preventing disruption to revenue-critical systems.
- Led modernization from legacy .NET to .NET 8.0+ and built CI/CD pipelines with ECR and CodeArtifact, helping decouple a code monolith into 20+ cloud microservices.
- Reduced SRE manual intervention and deployment MTTR by >80% through modernization and pipeline improvements.
- Migrated manually configured telemetry into Datadog with Terraform-based IaC, giving teams version-controlled monitoring across 400+ resources.
- Defined incident metrics, SLO/SLA standards, and postmortem patterns that drove self-healing workflows and reduced repeated common incidents by 50%.