Resume

Gable

Senior DevOps Engineer -- July 2023 - Present

  • Owner of DevOps function for seed-stage startup, ensuring organizational needs and customer setups are delivered responsively and reliably.
  • Responsible for in-house platform of AWS infrastructure and customer deployment using Github Actions and Terraform with Atlantis automation.
  • Provide assistance consultation concerning development team's branching strategy and software development/delivery lifecycle.
  • Develop and maintain ad-hoc tooling to address AWS service shortcomings.
  • Operation and management of services including Github, AWS Organization/Control Tower, AWS Cognito, Amazon ECS/Fargate,

Highspot

Senior DevOps Engineer -- Aug 2020 - June 2023

  • Terraform Automation Infrastructure Code Lead, supporting a multi-pillar/department Terraform deployment consisting of several dozen Terraform versioned modules across more than a dozen AWS accounts using Github and Atlantis, integrated with Okta SSO, S3 Terraform State File Storage, and standardized IAM roles. Terraform deployments manage resources consisting of multiple millions of dollars in annual AWS spend.
  • Develop and Deployed service-focused Terraform system using Terragrunt and Atlantis, providing Developer teams with a standardized self-service Infrastructure as Code experience oriented towards their needs.
  • Provide depth and operational experience to the Kubernetes build-out teams operating the Engineering Unified EKS Platform.
  • Develop and maintain Python-based automation tooling to solve critical organization needs regarding secrets management and deployment.
  • Build and Maintain AWS Simple Email Service (SES) infrastructure for mission-critical sales pitch application functionality.
  • Developed DevOps Onboarding program to reduce engineering ramp-up time from 6-8 months to 4 weeks.
  • Participated in organization-wide on-call rotations, responding to incidents as primary on-call or as part of a cross-functional team, preparing root cause analyses for major production incidents for infrastructure owned by my team.
  • Coordinated with Engineering and Product Managers to balance prioritization for incoming feature requests and internally generated site reliability work.
  • Operation and management of various automation and tooling such as Spinnaker, Buildkite, AWS MSK/Kafka, AWS SSO & IAM integration, ArgoCD, Managed Chef Server.

Conversica

Site Reliability Engineer III -- Nov 2019 - Jul 2020

  • Lead SRE for a Site Reliability group in transition; balancing sprint planning, operational needs, vendor management, and coordination of long-term goals with Engineering Management.
  • Cared for existing multi-tier Kubernetes deployments using AWS EKS and assisting engineering staff in troubleshooting various AWS dependencies and Gitlab pipelines.
  • Led and built user management processes in furtherance of SOC2 certification efforts.
  • Mentored new hire Engineers, Directors, and Managers on existing infrastructure, risks, and gaps.
  • Performed in-place update of Infrastructure maintenance code from Ansible 2.0/2.2 to Ansible 2.9

Sprout Social

Staff Site Reliability Engineer -- Nov 2019 - Nov 2020

  • Synthesized and Implemented Blameless Incident Post-Mortem Process to enable product development teams to clearly communicate incident impact, scope, technical triggers, action item status, and non-technical underlying causes.
  • Mentor SREs and provide feedback to management regarding sustainable processes and approaches.
  • Build effective working relationships with engineers working in distant locales, providing feedback via code reviews, squad retrospectives, management feedback, and 1:1 technical pairing.
  • Implement, Maintain, and Decommission Infrastructure as a Service for the Product Teams using Chef and Terraform.
  • Assist Product Teams to enable management and configuration of their Infrastructure using IaaC concepts and Configuration Management.
  • Provide technical depth for incident analysis and resolution in legacy systems.

Skytap

Senior Operations Engineer (Nov 2014 - Feb 2017)

Principal Operations Engineer/Operations Engineering Manager (Feb 2017 - Nov 2019)

  • Identify business requirements and deliver mission-critical solutions in-scope, on-time, and on-budget to enable internal customers to meet and exceed Skytap's customer SLAs.
  • Manage services at scale with automation and tools to ensure that services are not only highly available but may be serviced and correctly understood by all members of a team. (Puppet, Ansible, Python, Jenkins)
  • Develop policies, training, and reporting to appropriately support and manage services throughout the product lifecycle.
  • Develop metrics and monitoring to enhance observability for stakeholders and improve planning for future capacity requirements.
  • Provide Technical Supervision, Product Management, Project Management, and HR Supervision for service teams of up to 5 engineers of varying backgrounds, skill sets, and experience.
  • Developed, supported, and evangelized use of automation that resulted in reduction of datacenter capacity deployment time from 4 hours per server to 17 minutes as measured by delivery to carrying revenue-generating customer load.
  • Experience with Zabbix, Gerrit/Git, Mercurial, Puppet (Open Source), PowerDNS, Foreman, NGINX/Apache, DHCP/DNS/TFTP and similar TCP/IP based network and provisioning services.