Site Reliability, Cloud & DevOps Engineer with 10+ years of experience managing high-volume, customer-facing services in dynamic environments. Expertise in Amazon Web Services (AWS), Akamai, Linode, and Infrastructure as Code (CloudFormation, Terraform & Ansible). Passionate about improving service reliability through automation and monitoring with tools such as NewRelic, App Dynamics & Datadog. Proficient in Python, PHP, and Bash. Experienced technical lead and mentor with a strong background in incident response, GitOps practices (IaC, SCM, CI/CD), and performance testing.
- 🔭 I’m currently working on security automation for Ubuntu.
- 🌱 I’m currently learning FIDO2, U2F, Smart card, OTP, OpenPGP 3 standards and protocol implementation for hardware devices using SoloKeys and Yubico products.
- 💬 Ask me about anything, really, and if I don't know we can learn together.
- 📫 How to reach me: thesweeps@scourgethetracker.org
- Led a microservice migration from on-premises to AWS, driving cloud transformation for improved scalability and performance.
- Reduced vulnerability remediation time by 99% in the first 30 days through leadership in internal and external security audits.
- Served as the Technical Lead for cross-team support, ensuring smooth collaboration and reducing downtime for production systems.
- Reduced incident detection and response time by 87%, improving site reliability by 80% within the first 180 days.
- Led the implementation of CI/CD pipelines, reducing deploy times by 90%.
- Developed the first DevOps and Developer Relations Wiki, improving collaboration across teams.
- Managed a $28-million revenue-generating infrastructure at Rue La La, demonstrating leadership in high-stakes environments.
- Cloud Platforms: AWS (EC2, ECS, Lambda, S3), Akamai, Linode
- Infrastructure as Code (IaC): CloudFormation, Terraform, Ansible
- Programming Languages: Python, PHP, Bash
- DevOps Practices: CI/CD, GitOps, Incident Management
- Monitoring Tools: NewRelic, AppDynamics, Datadog
- Linux System Administration: Red Hat Enterprise Linux (RHEL), Networking
- Security: PCI, ISO Audits, Vulnerability Management
- Automation: Ansible, ETL Processes
- Version Control & CI/CD: Git, Jenkins, GitOps
SAP Concur
09/2013 – 06/2024
- Led a microservice migration from on-premises to AWS, driving cloud transformation and improving scalability.
- Reduced vulnerability remediation time by 99% in the first 30 days.
- Served as Technical Lead for cross-team support, collaborating across development teams to improve uptime and performance.
- Reduced incident detection and response times by 87% through automation and monitoring solutions.
- Led the implementation of CI/CD pipelines, reducing deploy times by 90%.
Path
11/2012 – 09/2013
- Led virtualization platform migration to ESX, improving infrastructure stability.
- Developed configuration management solutions that reduced repeat problem incidents by 80% in the first 180 days.
- Served as the technical lead for PCI and ISO audit teams, ensuring security compliance.
FuseSource
2010 – 2012
- Completed 70% of a stabilization project on aging infrastructure, reducing Java, Rails, and connectivity errors.
- Developed custom DevOps tooling using Python, Bash, and Ruby to automate key infrastructure processes.
GSI Commerce (Rue La La)
2012 – 2012
- Managed a $28-million revenue-generating infrastructure, ensuring operational excellence.
- Architected load balancer, firewall, and RHEL deployment methods for 0% downtime.
- Led PCI DSS v2.0 compliance initiative to secure customer data and maintain regulatory compliance.
- Led two datacenter rebuilds, optimizing infrastructure capacity.
- Contributed to various DevOps and Cloud projects on GitHub, including tools for automation, infrastructure management, and performance monitoring.
Feel free to reach out to me for collaborations or if you need guidance in the DevOps and clo infrastructure space. Let's innovate together!