Robert J. Lambert III

New York · [email protected]

Hi there, I'm Robert Lambert. An experienced Site Reliability and Systems Engineer.

I wrote my first computer program over 20 years ago on a Commodore 64 in BASIC, which inspired me to gain my college accreditation in Computer Information Systems, and pursue a professional career in this field. Now, with over 13 years of professional, real-world experience, I find myself constantly in search of knowledge and new skill sets. Most recently, I’ve taken pride in working on multiple new teams as the need arises to assist in improving the stability and reliability of systems at DigitalOcean.

I am currently working as a Senior Infrastructure Engineer II @ DigitalOcean
My GitHub Overview

Socialization

skills

Professional and Social

Effective communication Team player Strong problem solver

Languages and Tools

C/C++ Rust Go Java Python Nix git linux bash javascript

Operating Systems

NixOS Fedora Debian/Ubuntu

Configuration and Infrastructure Management

Chef Ansible Terraform Consul Vault Nomad

Data Management

MySQL/MariaDB Sqlite postgresql prometheus Victoria Metrics

Observability and Monitoring

grafana kibana Lightstep sentry Victoria Metrics

Architecture

Distributed Systems System Resiliancy Secure System Design REST RPC

Containers & Cloud

Docker DigitalOcean

opensource

A collection of efforts to which I contributed, but did not create. Contributing back to Open Source projects is a strong passion of mine, and requires a considerate approach to learn norms, standards and approach for each community for a successful merge!

Chipate is a simple CHIP-8 emulator I created as a quick introduction in to building emulators. I also used this project to help me learn the Rust Programming language.

Rust Emulation

A prometheus exporter for CINC Server and CINC compatible configuration management solutions.

Rust CINC

experience

DigitalOcean

• Built my first rust crate to fill a gap in monitoring in our multi-region CINC server environment
• Designed and Built a next-gen bare metal deployment system built on top of Nomad
• Help maintain the internal fork of our terraform provider
• Setup the internal Terraform Provider Repository in Artifactory
• Built pull based distributed CINC model to reduce load on CINC servers
• Integrated Sentry in to numerous services and tools under our ownership
• Integrated Lightstep in to numerous services and tools under our ownership

DigitalOcean

• Monitor, troubleshoot, maintain, extend and improve the legacy Mesos stack
• Deploy changes to thousands of containers running on this cluster using Ansible, and Chef
• Built custom automated reports against this platform using Bash, Ansible and Python
• Managed deployment of configs via Ansible the current stack runs multiple high priority internal workloads
• Improvement in monitoring with custom Hadoop/HDFS exporters
• Provide support for one of our legacy Kafka platforms as well as refresh monitoring while developers work to migrate to a newer Kafka platform
• Create design documents, architecture designs, and other high level design documents to provide information to management to help make critical design decisions for our environments
• Determine RCA on issues regarding Chef builds and deployments
• Built Chef Cookbooks for the deployment of Chef Automate HA cluster which included ElasticSearch, PostgreSQL and Nginx components
• Built internal service utilizing Okta and Vault allowing new DigitalOcean Developers to self service deploy their own chef user
• Built Chef module using Optimizely for scheduling and tracking Chef deployment changes across our entire fleet of machines
• Built automation around tasks that wasted engineer time and that we’re prone to error
• Built reusable Terraform modules for internal users to deploy private virtual machines to mange internal tooling, complete with
• Built generic workflow for Atlantis allowing internal users to move to a more secure (using Vault) automated Terraform provisioning experience

• Represented LinkedIn’s Espresso Team in cross-organizational stand-ups to provide both executions of complex tasks, and act as a single point of contact for our team
• Maintained multiple internal products which included upgrading multiple legacy codebases to Python3 and Go
• Designed and implemented global availability, cross data center consistency, and cross data center latency metrics and SLAs for all Espresso clusters
• Trained and onboarded new Espresso SREs, which enabled them to contribute to our ecosystem in under 3 months
Above and beyond
• Completed the Machine Learning course offered through Stanford Online: Verification
• Member of the interview and hiring committees of SREs, Senior SREs, and Managers
• Organized, hosted, and presented at the first Espresso SRE Hackday
• Taught ongoing SRE Bootcamp series, as part of the new hire onboarding process

• Collaborated across teams to build the Espresso Data layer for a new logical data center
• Provided documentation improvements for Espresso cluster creation, consuming tribal knowledge from multiple sources into useful runbooks
• Designed, built and maintained 120+ node ELK cluster handling all of Espresso’s logging data across 4 data centers
• Built service overview dashboards for each Espresso owned service reducing both MTTR and MTTD
• Acted as lead on 80% of all Espresso GDPR related horizontal initiatives, overseeing both the design and resolution
• Oversaw the data migrations of Espresso clusters, which involved coordinating between teams both onsite and offshore

CDPHP

• Maintained a balance of operational and development work to ensure critical business services were responsive
• Provided engineering and development teams with custom tooling built off known frameworks
• Designed Ansible and CFengine code to automate operational and system level tasks to improve performance, provide higher reliability and scale when needed
• Designed and implemented ELK log monitoring solution, with FileBeats, NXLog, Kafka, Logstash, ElasticSearch, Kibana

CDPHP

• Designed and implemented custom integration systems between our enterprise monitoring solutions, CMDB, operator runbook, ITSM tool, and CFEngine our configuration management platform
• Integrated APM solution AppDynamics with existing Opsview monitoring environment
• Focused on the design and implementation of the Mulesoft Enterprise Service Bus, which had both on-premise and AWS components

CDPHP

Built CFEngine solution backed with Ansible ad-hoc tasks for constructing simple reports

• Replaced Tivoli Enterprise Portal with Nagios fork Opsview
• Handled day to day operational work such as: diagnosing web performance issues, code misconfigurations, as well as other internal infrastructure problems

Xerox Litigation Services

• Installed and maintained servers in multi-site tier 3 and 4 data centers located internationally
• Traveled overseas to work directly with clients on site to provide support for the custom virtualized appliance
• Designed custom software solutions to reduce costs of operational expenses by utilizing open source solutions rather than purchasing an enterprise solution
• Engineered IT solutions to accommodate changing environments and project requirements from clients while actively working with other departments to resolve any issues in a timely manner

Xerox Litigation Services

• Provided 24-hour support for Clients and Client Managers for all of the XLS products and infrastructure
• Monitored logs and custom alerting software to initiate the resolution of any potential or incipient issues
• Programmed enhancements for internal monitoring software such as upgrading the web application to use AJAX over the old static layout