Plano, TX
Site Reliability Engineer
Cognizant’s Digital Engineering practice is seeking a highly qualified Site Reliability engineering experience. As a Site Reliability Engineer (SRE) we want the person to be responsible for both uplifting and maintaining our evolving technology platforms, infrastructure and technology controls. As a valued colleague on our team, you will collaborate with team in designing, producing, testing, or implementing moderately complex software, technology, or processes, as well as create and maintain IT architecture, large scale data stores, and cloud-based systems. You will apply your expertise in software and systems engineering to ensure that both our internally critical and externally visible systems meet the appropriate performance needs of our users. You will serve as a champion of service availability, efficiency, automation, monitoring, and capacity management. Specifically, you will leverage your skills and experience in Amazon Web Services, software development with Java and/or Python, customization in Splunk and/or Dynatrace, and automation in Selenium and/or Blue Prism (among others) to enable increased feature velocity and continuous improvement. Service Reliability Engineering (SRE) Senior Associate role will offer you the flexibility to make each day your own, while working alongside people who care, so that you can deliver on the following responsibilities: Independently determine the needs of the customer and create solution frameworks. Design and develop moderately complex software solutions to meet needs. Use a process-driven approach in designing and developing solutions. Implement new software technology and coordinate end-to-end tasks across the team. May maintain or oversee the maintenance of existing software. Location: Plano, TX or Remote You must be legally authorized to work in the United States without the need for employer sponsorship, now or at any time in the future. Cognizant will not sponsor H-1B or other U.S. work authorization for this role. Minimum Required Experience: 2+ years of relevant work experience Required Experience: Bachelor’s Degree in Computer Science, Management Information Systems (MIS), Systems Engineering, or related field Certification in AWS Solutions Architect Associate or Developer Associate, Splunk Certification Developer, or Sun Certified Java Developer Experience with Scaled Agile Framework (SAFe) and Jira / Confluence Experience with AWS Elastic Container Service (ECS) and Fargate Experience with AWS CloudWatch, Splunk, Dynatrace, CatchPoint, and / or Datadog Experience with application production / operations support, including incident response, problem management, runbooks, and knowledge articles using tools such as ServiceNow, Moogsoft, StatusHub, and / or Blameless Experience with post-mortems, root-cause analysis (RCA), and / or AWS Correction-of-Errors (CoE) Understanding of error budgeting and toil reduction Experience creating disaster recovery plans and executing failover tests Experience with capacity planning and performance testing / engineering tools, such as JMeter and / or LoadRunner Experience with Failure Mode Effect Analysis (FMEA) and Chaos testing / engineering tools, such as Gremlin, Chaos Monkey, Chaos Toolkit, AWS Fault Injection Service (FIS) Experience with CI/CD / DevOps deployment tools, such as Jenkins, Terraform, UrbanCode Deploy (UCD), and / or GitLab Experience with programming in Java and / or Python Understanding J2EE frameworks, such as JavaScript, Spring Boot / Spring Cloud, and REST Understanding of Java performance monitors (JVM, GC, Heap Size, Message Broker) Experience with building automation solutions using tools such as BluePrism and / or Selenium Understanding of fault tolerant / resilience architectural design patterns, such as Bulkhead, Circuit-breaker, Retry, Timeout, etc Skills Required: 2+ years of experience supporting cloud applications and technologies, including containerization, virtualization, microservices, and server-less architecture 2+ years of experience working in an Agile, Scrum, or Kanban environment 2+ years of experience with application monitoring / observability, including building dashboards, establishing service level indicators / objectives / agreements (SLIs / SLOs / SLAs), and logging / tracing Excellent problem-solving skills and proactivity in resolving issues / blockers Excellent verbal / written communication skills, relationship management skills, and ability to collaborate with multiple stakeholders Eagerness to learn and ability to work independently with minimal guidance Understanding of IT Service Management (ITSM) Understanding of DevOps and CI/CD pipelines Why Choose Cognizant? It takes a lot to succeed in today’s fast-paced market, and Cognizant Technology Solutions has become a leader in the industry. We love big ideas and even bigger dreams! We stand out because we put human experiences at the core. Our associates enjoy robust benefits and training opportunities from our industry-recognized, award-winning Academy team. You will have access to hundreds of technical trainings to keep your skillsets fresh and have opportunities to acquire certifications on the newest technologies. Everything we do at Cognizant we do with passion—for our clients (fortune 100 companies), our communities, and our organization. It’s the defining attribute that we look for in our people. If you love ambiguity, excited by change, and excel through autonomy, we’d love to hear from you! #LI-DJ3 #CB #Ind123 Technical Skills SNo Primary Skill Proficiency Level * Rqrd./Dsrd. 1 Resilience PL3 Required * Proficiency Legends Proficiency Level Generic Reference PL1 The associate has basic awareness and comprehension of the skill and is in the process of acquiring this skill through various channels. PL2 The associate possesses working knowledge of the skill, and can actively and independently apply this skill in engagements and projects. PL3 The associate has comprehensive, in-depth and specialized knowledge of the skill. She / he has extensively demonstrated successful application of the skill in engagements or projects. PL4 The associate can function as a subject matter expert for this skill. The associate is capable of analyzing, evaluating and synthesizing solutions using the skill.
Recommended Skills
- Agile Methodology
- Architecture
- Automation
- Blue Prism
- Building Automation
- Business Informatics
Browse other jobs