< Back to Careers

Site Reliability Engineer

The Media Trust is on a mission to make the Internet safer for everyone. Everyday we help identify and prevent sophisticated malware from infecting users all over the globe. Our products and services are relied on by some of the world’s most popular brands to secure their sites and ensure compliance with the growing number of consumer protection laws. The Engineering team needs you!

The Media Trust Engineering team is looking for an experienced Site Reliability Engineer to join our team responsible for building, managing, maintaining and scaling operational infrastructure on which our mission-critical services are deployed. We prize engineering professionals who are equipped with great technical skills and attitudes that motivate those around them. We seek individuals with a sense of purpose and for whom detail and craftsmanship are of their essence; individuals that are curious, demonstrate initiative, and love solving problems. The qualified candidate will have cross-discipline knowledge of networks and firewalls, DNS and load balancing, web servers and SSL, storage systems and databases. They must have extensive hands-on experience with Linux systems, navigating the command line like a pro. 


Responsibilities:

  • Installation, configuration and maintenance of production, QA and development environments in which TMT applications are deployed; environments include servers, operating systems, databases, network devices, and automation tools
  • Development of deployment automation tools using Ansible, Terraform, et al
  • Coordination with other technical staff to implement systems and software
  • Performance of daily operations support functions, including problem isolation and resolution
  • Development of monitoring and alerting systems (grafana/prometheus etc.)
  • Support of Infrastructure on-site at Data Centers
  • Shift rotational support for after hours alerting and problem resolution
  • Documentation of processes, procedures, configurations, and deployment plans
  • Regular reporting of progress to leadership

Qualifications:
 
Our ideal candidate will have a unique blend of technical skills and a desire to work as part of a team to grow themselves and those around them. The candidate must have a strong combination of the following:

  • Bachelor's degree in IT, Engineering or related field (equivalent experience/training will be considered)
  • 6+ years of experience developing automation and operating mission-critical systems
  • 2+ years of public/private/hybrid-cloud server builds, management, automation, and maintenance (AWS experience preferred) in an SRE/DevOps role 
  • Excellent understanding of Linux configuration and administration
  • Experience with configuring DNS servers (i.e. Bind)
  • Experience with configuring switches, firewalls and NAT
  • Understanding of layer 7 Load balancing
  • Knows how to install SSL certificates.
  • Python and/or PHP coding experience
  • Automation experience using Ansible and Jenkins
  • Understanding of infrastructure-as-code
  • Written and verbal communication skills – able to clearly and succinctly describe complex issues

Preferred qualifications:

  • Experience with PostgreSQL Databases
  • Experience racking and cabling of physical systems in a Data Center environment
  • Knowledge of storage systems and understanding of RAID
  • Familiarity with networking protocols and systems