Sr. Site Reliability Engineer Job at ASK IT Solutions, Phoenix, AZ

WlFBNU9nbEVPdTVGTEJMTnhWUStnU01BNkE9PQ==
  • ASK IT Solutions
  • Phoenix, AZ

Job Description

Site Reliability Engineer

Location: Phoenix, AZ

(SRE) to join Cloud Operations and Observability team. You'll be instrumental in driving resiliency, performance, automation, and AI-driven observability across hybrid cloud environments (Azure and GCP). You will design, implement, and manage infrastructure with a strong focus on Kubernetes, and integrating AI/LLM solutions into observability and operational workflows.

Key Responsibilities:

  • Build and operate scalable, secure, and highly available infrastructure in Azure and GCP.
  • Design and maintain observability platforms leveraging Splunk, OpenTelemetry, and cloud-native monitoring tools.
  • Develop and support AI/LLM-driven automation solutions to improve incident triage, alert correlation, and root cause analysis.
  • Partner with application and data teams to define SLOs, SLIs, and error budgets.
  • Drive operational excellence through automation, chaos testing, and proactive reliability improvements.
  • Optimize Kubernetes environments (GKE/AKS) for performance, security, and cost-efficiency.
  • Integrate observability data pipelines with LLMs for anomaly detection, summarization, and proactive remediation.
  • Participate in on-call rotations, incident response, and postmortem reviews.
  • Implement runbooks, auto-remediation scripts, and AI copilots for operations.

Required Qualifications:

  • 8+ years of experience as an SRE.
  • Strong expertise in Azure and GCP cloud platforms (certifications a plus).
  • Proficient in Splunk (Enterprise + Observability) for monitoring, alerting, and log analytics.
  • In-depth knowledge of Kubernetes (AKS, GKE), Helm, and container lifecycle.
  • Familiarity with AI/ML and LLM-based tools (e.g., OpenAI, Hugging Face, Azure OpenAI) for observability or automation use cases.
  • Experience with CI/CD pipelines, GitOps, and secure deployment practices.
  • Programming/scripting skills in Python, Go, or Bash.
  • Strong understanding of SRE principles: SLAs, SLIs, SLOs, error budgets, and incident management.

Preferred Qualifications:

  • Experience building AI-enabled runbooks or copilots.
  • Exposure to FinOps or cost-optimization strategies in cloud environments.
  • Knowledge of distributed tracing and event correlation using OpenTelemetry.
  • Familiarity with Kafka, Pub/Sub, or other messaging systems for observability data.

Job Tags

Similar Jobs

Southeastern Archaeological Research, Inc. "SEARCH"

Principal Investigator (Terrestrial Archaeology - Southwest) Job at Southeastern Archaeological Research, Inc. "SEARCH"

 ...Job Title: Principal Investigator (Terrestrial Archaeology Southwest) Job Location: Remote - Southwestern US Job Code: PI-SW-2025 Job Link: SEARCH Job Postings - Direct Applications Position Information SEARCH is seeking a Principal Investigator to... 

Headhunter Insider

Certified Dialysis Technician/Administrative Assistant - Full-time Job at Headhunter Insider

Role Description This is a full-time, hybrid role located in Chandler, AZ, for a Senior Director of Internal Audit. The Senior Director of Internal Audit will oversee the internal audit function, manage audit projects, and ensure compliance with regulations and best...

Unilever

Senior Sales Manager - Walmart Job at Unilever

 ...ORGANIZATION - THE COLLECTIVE As part of working together to achieve these goals we are...  ...the different operating companies. OUR HOME-BASED APPROACH: While working for the...  ...ACCOUNTABILITIES ? Reporting to our Director of Walmart for the Wellbeing Collective (WBC... 

Taylor & Francis Group

Publisher (STM) Job at Taylor & Francis Group

 ...Reference #: 744000091453736 Company Description Informa is a leading academic publishing, business intelligence, knowledge and events business, creating unique content and connectivity for customers all over the world. It is listed on the London Stock Exchange and... 

Huntington Learning Center of Turnersville

Tutor/Teacher Job at Huntington Learning Center of Turnersville

 ...study skills, and SAT/ACT Prep. NOTE: Tutoring occurs in-person at our center in Turnersville, NJ. Are you a current or retired teacher looking for an additional paid way to be involved in the education field? Are you a graduate student working on your Master's...