New
Site Reliability Engineer
![]() | |
![]() | |
![]() | |
![]() United States, Texas, Dallas | |
![]() | |
*Description*
The Site Reliability Engineer (SRE) involves identifying and delivering automation solutions to ensure high availability and resiliency, leveraging expertise in software development, complexity analysis, and scalable system design. The SRE will work closely with other engineering teams to ensure services and systems are highly stable and performant, meeting the expectations of business partners and end users. JOB DUTIES * Work with architecture and development teams to ensure applications are highly available, reliable, and performant at a global scale. * Partner with the architecture team to ensure operability, measurability, and manageability are integrated into business features and enablers. * Collaborate with product owners and managers to establish service level objectives (SLOs) for applications and define consequences if objectives are not met. * Work with development team members to identify monitoring gaps, improve application performance, and assist with troubleshooting issues. * Drive Root Cause Analysis (RCA) of production issues and other failures within the product software, pipeline, or other DevOps support processes or technology. * Design, build, and advocate for automated solutions to optimize application/service/platform uptime with minimal human intervention. * Participate in an on-call rotation to support troubleshooting and communication efforts outside of normal business hours. * Create and implement standards and best practices, driving adoption across development teams and external vendors as applicable. * Perform other duties as assigned. * Ensure compliance with all company policies and procedures. *Skills* Devops, Python, Cloud, Terraform, Kubernetes, Azure *Top Skills Details* Devops,Python,Cloud,Terraform,Kubernetes,Azure *Additional Skills & Qualifications* What makes you a dream candidate? * Strong collaboration and communication skills. * A proactive approach to problem-solving and continuous improvement. * Passion for automation and operational excellence. * Deep expertise in cloud technologies and software development, with a strong technical background. * Expertise in defining, implementing, and evaluating Service Level Objectives (SLOs) and Service Level Indicators (SLIs), and associated consequences. * Strong skills in performing Root Cause Analysis (RCA) and Problem Management. * Extensive experience in cloud native applications Azure/AWS (monitoring, networking, containerization, infrastructure). * Proficiency in containerization technologies such as Azure Kubernetes Service, Kubernetes (open source), and Docker. * Knowledge of metrics and monitoring tools like Azure Application Insights and Azure Monitor. * Familiarity with networking technologies relevant to Azure and AWS, including Azure DNS, Virtual Networks, Azure API Manager, Azure Application Gateway, Akamai WAF/CDN, AWS Route 53, AWS VPC, AWS API Gateway, and AWS CloudFront. * Strong experience with Terraform for infrastructure as code. * Ability to establish and maintain a culture of learning through the development and sharing of skills, knowledge, processes, and tools; combat traditional silos that create "us and them" environments. *Experience Level* Intermediate Level *Pay and Benefits* The pay range for this position is $60.00 - $75.00/hr. Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following: * Medical, dental & vision * Critical Illness, Accident, and Hospital * 401(k) Retirement Plan - Pre-tax and Roth post-tax contributions available * Life Insurance (Voluntary Life & AD&D for the employee and dependents) * Short and long-term disability * Health Spending Account (HSA) * Transportation benefits * Employee Assistance Program * Time Off/Leave (PTO, Vacation or Sick Leave) *Workplace Type* This is a hybrid position in Dallas,TX. *Application Deadline* This position is anticipated to close on Sep 5, 2025. h4>About TEKsystems: We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company. The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law. About TEKsystems and TEKsystems Global Services We're a leading provider of business and technology services. We accelerate business transformation for our customers. Our expertise in strategy, design, execution and operations unlocks business value through a range of solutions. We're a team of 80,000 strong, working with over 6,000 customers, including 80% of the Fortune 500 across North America, Europe and Asia, who partner with us for our scale, full-stack capabilities and speed. We're strategic thinkers, hands-on collaborators, helping customers capitalize on change and master the momentum of technology. We're building tomorrow by delivering business outcomes and making positive impacts in our global communities. TEKsystems and TEKsystems Global Services are Allegis Group companies. Learn more at TEKsystems.com. The company is an equal opportunity employer and will consider all applications without regard to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law. |