Senior Virtualization Engineer - Remote

Department: Engineering
Location: Remote, New York
Updated on: August 09, 2022

Back to Open Positions

This role can be done, remotely. 

MediaMath’s infrastructure footprint is reflective of the challenges of a real-time advertising
stack – a hybrid environment of multiple globally placed datacenters and the AWS cloud. We
support a broad range of services, from low latency bidding processes handling millions of
transactions per second, through to big data storage and analytics, as well as client-facing UI and
reporting solutions. Each has its own unique operational challenges, and the Infrastructure team
is a key partner in ensuring these workloads are managed in scalable, maintainable, cost-
effective ways.
As a key member of the Infrastructure team, the Virtualization SME will be responsible for the
design, build, and ongoing development of MediaMath’s on-premise virtualization platform,
growing it from inception to full production scale. This engineer will work directly with
development groups, gathering requirements and supporting the migration of business-critical
workloads from bare metal and existing cloud platforms to new virtualized environments. The
SME will also be involved in the continued development and management of MediaMath’s on-
premise & AWS-based Kubernetes offerings.
Take a lead role in the formulation of MediaMath’s virtualization strategy, working with
stakeholders across the organization to establish pain points, requirements, and priorities
for this new platform.
Assist in the design, build and testing of potential on-premise virtualization solutions.
Develop standards and guiderails around the production usage of virtualized resources, to
aid in onboarding and controlled growth of the platform.
As a business-critical offering, implement comprehensive monitoring & alerting for the
various components of the virtualization platform.
Regularly assess capacity needs for the virtualization platform, with consideration to
upcoming roadmap items and regular growth.
Act as a Virtualization SME within the Operations organization, providing guidance and
oversight to others in-team and across the development community.
Conduct training sessions to share knowledge with peers and development groups.
Work with Operations leadership to build out internal processes supporting our
virtualization offering, including - growth & onboarding, resource tracking & capacity
management, maintenance events, and incident mitigation.
Assist in the continued development and adoption of MediaMath’s Kubernetes clusters.
Participate in on-call rotation as part of the larger operational team.
Role specific experience:
Advanced-level technical knowledge of VMware virtualization technologies such
as vCenter, vSphere, ESXi, DRS, vMotion, vSphere Replication, HA, FT, SRM,
vSAN, NSX, VMware Life Cycle Manager.
Knowledge of VMware virtualization concepts, including provisioning and
automation methods using the API.
Experience overseeing a major infrastructure project, from initial planning &
testing, through to production.
Hands-on involvement in the end-to-end build of a virtualization solution,
preferably in a globally-distributed on-premise environment.
Experience working with business-critical applications deployed within a
virtualized environment.
Experience supporting low-latency, globally distributed services at scale.
Experience working with private datacenter infrastructure (“on-premise” servers)
Working knowledge of Kubernetes-based deployments.
General Ops experience:
Proficiency with Linux system administration (Debian, Ubuntu, CentOS)
Proficiency with basic AWS administration (IAM, EC2, Networking, cost
Proficiency with scripting & basic coding (Python, Ruby, Golang, Bash)
Understanding of networking fundamentals, including application layer protocols
(HTTP, SSL, SSH), load balancing solutions (LVS, NGINX), and DNS
Additional preferable qualifications:
Experience leveraging configuration management and deployment automation
toolsets (Chef, Salt, Ansible)
Experience leveraging common build and continuous-integration workflows
(CircleCI, Jenkins, Artifactory)
Experience collecting and analyzing metrics for service-level monitoring using
Prometheus and Grafana
Experience managing Kubernetes, PKS, or Tanzu environments.
Skills & Approach:
Apply practical solutions to real world problems, with a “hands-on” approach.
Think strategically, understand business context, and make collaborative decisions.
Gather information & recommend courses of action clearly and confidently, with a high
degree of comfort presenting ideas to departmental & organizational leadership.
Foster open communication, speak with impact, listen to others, and write effectively.
Partner with internal clients in development teams to address issues, consult on solutions
and plan for future needs.
Communicate often complex ideas to non-technical stakeholders, including product and
support teams.
Mentor and provide guidance to junior engineers, both technically and professionally.
Adhere to, streamline and help improve team processes for work tracking, knowledge
sharing, incident response, and cross-org communication.