Site Reliability Engineer (SRE) at TMS San Jose, CA (San Jose) Job at Itlearn360, San Jose, CA

NjNNKzlMYThPVWdYSk5raHp2aUlGZ1lldlE9PQ==
  • Itlearn360
  • San Jose, CA

Job Description

Site Reliability Engineer (SRE) job at TMS. San Jose, CA.

Job Title: Site Reliability Engineer (SRE) AWS & DevOps
Location: San Jose, CA (Local Only)
Duration: 12+ Months
Experience Needed: 12+ Years

Job Overview:

We are looking for a skilled Site Reliability Engineer (SRE) with strong expertise in AWS Cloud and DevOps practices to join our growing engineering team. This role is critical in ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. You will focus on automation, observability, infrastructure as code, and operational excellence, working closely with development and operations teams to deliver a seamless production environment.

Key Responsibilities:

  • Design, implement, and manage highly available and scalable infrastructure on AWS .
  • Develop and maintain Infrastructure as Code using tools such as Terraform , CloudFormation , or AWS CDK .
  • Build, manage, and optimize CI/CD pipelines for fast and reliable application delivery using tools like Jenkins , GitLab CI/CD , CircleCI , or AWS CodePipeline .
  • Monitor system performance and availability using CloudWatch , Prometheus , Grafana , ELK , or Datadog .
  • Define and implement observability , logging , and alerting to ensure system health and quick incident resolution.
  • Improve system reliability through monitoring, incident response, post-mortems, and automation.
  • Support containerized applications using Docker , ECS , EKS , or Kubernetes .
  • Collaborate with development teams to ensure operational readiness and incorporate SRE principles into the software development lifecycle.
  • Implement security best practices across AWS resources, including IAM , encryption , VPC configuration , and compliance monitoring .
  • Participate in an on-call rotation and lead incident response and root cause analysis.

Required Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or related technical field (or equivalent experience).
  • 3+ years of experience in a Site Reliability Engineering, DevOps, or Cloud Engineering role.
  • Strong hands-on experience with AWS services including EC2, S3, RDS, Lambda, IAM, VPC, CloudFront, etc.
  • Proficient in scripting/programming (e.g., Python, Bash, Go).
  • Expertise in CI/CD tools and automation frameworks .
  • Experience with monitoring and observability tools .
  • Strong understanding of networking , DNS , firewalls , and load balancing .
  • Solid grasp of DevOps methodologies and agile practices .
  • Familiarity with containerization and orchestration tools.

Preferred Qualifications:

  • AWS Certifications (e.g., AWS Certified DevOps Engineer , Solutions Architect ).
  • Experience with Kubernetes , Helm , or Service Mesh .
  • Background in security and compliance (SOC2, ISO 27001, etc.).
  • Exposure to chaos engineering , SRE principles , and error budgets .
  • Experience with configuration management tools like Ansible , Chef , or Puppet .
#J-18808-Ljbffr

Job Tags

Full time, Local area,

Similar Jobs

Quality Truck Care Center

TRACTOR SHOP FOREMAN 2ND SHIFT Job at Quality Truck Care Center

 ...Care Center is a Family-owned business in The Fox Valley since 1986! We represent Western Star, and Autocar. Certified in Detroit, Cummins, CAT engines, Allison and Detroit Transmissions and Axles. Our De Pere location is currently looking for a 2nd shift Working Foremen... 

Solomon Page

Travel Case Manager (Utilization Review) Job at Solomon Page

 ...by our people. Offering a comprehensive benefits package, travel nurses have immediate access to medical coverage and ReviveHealth virtual care. Additionally, you are offered access to dental and vision coverage, commuter benefits, a 401(k) plan, flexible spending, referral... 

FastSigns

Graphic Designer Job at FastSigns

 ...Graphic Designer This part-time Graphic Designer position is responsible for creating computer-generated full-color graphics and/or vinyl output that can be weeded, cut and applied or printed and mounted to a substrate. This may involve various levels of artistic creativity... 

PeopleReady

Siding Installer Job at PeopleReady

Job DescriptionAs Siding Installer with PeopleReady Skilled Trades, you'll support on large scale commercial projects. In this full-time role, you'll enjoy weekly pay, health insurance benefits and a flexible work schedule. Plus, the more hours you work, the more meaningful... 

Continuum Pediatric Nursing Services

Licensed Practical Nurse (LPN) Pediatric Private Duty Job at Continuum Pediatric Nursing Services

*Make a Difference. One Child at a Time.**Pediatric Home Care Nurse Chesapeake, VA| RN or LPN | Flexible Shifts* Are you a compassionate...  ...For over *30 years*, Continuum has been a trusted leader in *Private Duty Pediatric Nursing*. With *11 offices across 9 states*, we...