SRE aligned Infrastructure Architect Location : Milwaukee - WI
- Maintain high availability of Data Center infrastructure (compute, storage, network, backup).
- Design and implement robust monitoring and alerting systems (e.g., using Prometheus, Grafana, Dynatrace, etc.).
- Define and measure SLOs, SLIs, and SLAs for infrastructure services.
- Identify repetitive manual tasks (toil) and implement automation using tools like Ansible, Terraform, or scripting (Python, Shell, PowerShell).
- Enable infrastructure-as-code (IaC) practices for provisioning and configuration.
- Lead or participate in on-call rotations for Data Center incidents.
- Drive post-incident reviews using blameless retrospectives to uncover root causes and implement systemic fixes (RCAs with actionable insights).
- Develop runbooks, dashboards, and self-healing routines to reduce MTTR.
- Collaborate with architecture and engineering teams to design scalable and fault-tolerant infrastructure solutions.
- Lead capacity planning, hardware refresh cycles, and cost optimization initiatives.
- Evaluate and adopt emerging technologies for better efficiency and sustainability.
- Perform installation, configuration and technical administration activities for all client data center and cloud systems that include Unix/Linux operating systems, middleware instances, cloud compute and OpenShift nodes
- Carry out performance tuning for all client data center and cloud systems; activities include developing tuning plans, performing system tuning and load balancing to improve performance, correct transmission slowdowns, congestion, malfunctions, and abnormalities.
- Carry out capacity planning, forecasting and implementation for data center and cloud systems
- Simplify composite business tasks into tactical actions by implementing appropriate technologies, applying reasonable problem solving and logical thinking skills
- Enable a smooth liaison with clients in understanding, identifying areas of improvement and procuring new tasks that could add as additional business.
Qualifications
- Possess a strong understanding of SRE aligned infrastructure support operating model.
- Demonstrate knowledge of database basics to support infrastructure needs.
- Show proficiency in managing storage platforms for optimal performance.
- Understand network basics to ensure reliable connectivity.
- Be skilled in using migration tools for efficient data and application migrations. Should Have experience in migration planning to ensure smooth transitions.
- Experience in compute platforms to support business applications.
- Understand cloud basics to design scalable cloud solutions.
- Have excellent troubleshooting skills to resolve infrastructure issues.
- Possess strong documentation skills for recording infrastructure processes.
- Be able to collaborate effectively with cross-functional teams.
- Demonstrate the ability to monitor and maintain infrastructure performance
Certifications: Good to have cloud platform architecture certification, SRE certified professional or SRE foundation
|
Skill Proficiency |
|
(Top 5 Keywords or skills) |
Years of Experience |
Basic Knowledge |
Medium |
Expert |
Data Center Infrastructure |
6-8 |
|
|
X |
Cloud Systems & SRE |
4-5 |
|
|
X |
Linux Unix Middleware |
4-5 |
|
|
X |
Cloud Compute & Open Shift |
4-5 |
|
|
X |
|