Search for More Jobs
Get alerts for jobs like this Get jobs like this tweeted to you
Company: Paycom
Location: Oklahoma City, OK
Career Level: Mid-Senior Level
Industries: Technology, Software, IT, Electronics

Description

Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. Primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites. Additionally, the Senior Site Reliability engineer will mentor junior team members.

RESPONSIBILITIES

  • Architect solutions that that proactively reduce or eliminate errors and incidents in production systems. Review and approve software development and processes created by junior site reliability engineers.
  • Review code and approve error logging and monitoring in new software development across all company developed applications.
  • Take responsibility for removing, isolating, or remediating errors, debugs, warnings, or other kinds of messages from existing logs to improve overall log content and usefulness.
  • Establish, implement, and track reliability metrics (MTTR, MTTD, MTBF)
  • Effectively respond to escalated site reliability issues any time of the day while on-call.
  • Conduct regular research on best practices and new technology for monitoring, alerting, error tracking and detection and application performance.
  • Mentor and guide junior site reliability engineers


Qualifications

Education/Certification:

  • Bachelor's degree in Computer Science, MIS or related field

Experience:

  • 5+ years' experience utilizing alerting and telemetry tools such as Grafana, Prometheus, Splunk, Dynatrace and others
  • 3+ years' experience with Splunk SPL
  • 3+ years' experience software development with at least one programming language such as PHP, Python, Java, .Net
  • 2+ years' experience creating and tuning analytical tools in Splunk

PREFERRED QUALIFICATIONS

Experience:

  • 2+ years' experience with CI/CD
  • 2+ years' experience with container and container orchestration such as Docker and Kubernetes
  • 2+ years' experience with Prometheus PromQL
  • 2+ years' experience with SQL

Skills/Abilities:

  • Troubleshooting in a large-scale networked environment
  • Knowledge of Paycom's applications, systems, and database

Paycom is an equal opportunity employer and prohibits discrimination and harassment of any kind. Paycom makes employment decisions on the basis of business needs, job requirements, individual qualifications and merit. Paycom wants to have the best available people in every job. Therefore, Paycom does not permit its employees to harass, discriminate or retaliate against other employees or applicants because of race, color, religion, sex, sexual orientation, gender identity, pregnancy, national origin, military and veteran status, age, physical or mental disability, genetic characteristic, reproductive health decisions, family or parental status or any other consideration made unlawful by applicable laws. Equal employment opportunity will be extended to all persons in all aspects of the employer-employee relationship. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, training, promotion, discipline, compensation benefits, and separation of employment. The Human Resources Department has overall responsibility for this policy and maintains reporting and monitoring procedures. Any questions or concerns should be referred to the Human Resources Department. ****To learn more about Paycom's affirmative action policy, equal employment opportunity, or to request an accommodation - Click on the link to find more information: paycom.com/careers/eeoc

#LI-Hybrid


 Apply on company website