Guide and shape the future of technology at a globally recognized firm, driven by pride in ownership.
As a
Senior Lead Site Reliability Engineering at JPMorgan Chase within the
Infrastructure & Production Management sector of Consumer & Community Banking, you are the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your team's strategic planning, driving continual improvement in customer experience, resiliency, security, scalability, monitoring, instrumentation, and automation of the software in your area. You act in a blameless, data-driven manner and navigate difficult situations with composure and tact.
Job responsibilities - Demonstrates expertise in site reliability principles and demonstrates an understanding of the fine balance between features, efficiency, and stability
- Effectively negotiates with peers and executive partners to ensure optimal outcomes for all
- Drives the adoption of site reliability practices throughout the organization
- Ensures your teams demonstrate site reliability best practices with the ability to demonstrate this empirically through stability and reliability metrics
- Drives a culture of continual improvement and solicits real-time feedback to improve the customer's experience
- Ensures your team collaborates with other teams within your group's specialization and avoids duplication of work where possible
- Follows blameless, data-driven, post-mortem strategies and conducts regular team debriefs to enable learning from both successes and mistakes
- Provides personalized coaching for entry to mid-level team members
- Ensures your team documents and shares their knowledge and innovations via internal forums, communities of practice, guilds, and conferences
Required qualifications, capabilities, and skills - Formal training or certification in software engineering concepts and 5+ years of applied experience; plus 2+ years leading technologists to manage and solve complex technical items within your domain.
- Advanced proficiency in SRE culture and principles, with a track record of implementing SRE practices across application and platform teams while avoiding common pitfalls.
- Strong observability fundamentals: define and measure SLIs, set and manage SLOs and error budgets, build actionable alerting and dashboards; hands-on experience with Dynatrace and Splunk.
- Proven resiliency engineering: capacity planning, failure mode analysis, fault-tolerant design (circuit breakers, retries, bulkheads), disaster recovery strategies, and running game days.
- Proficiency in at least one programming language (e.g., Python, Java Spring Boot, .NET) to build production-grade automation and tooling; deeper coding skills are a plus but not a hard requirement.
- Proficiency in CI/CD and Infrastructure as Code (e.g., Jenkins, GitLab, Terraform), including pipeline design, environment promotion, and secrets/artifact management.
- Experience with containers and orchestration (e.g., Docker, Kubernetes, ECS), including image hardening, Helm, and operational runbooks.
- Ability to troubleshoot common networking technologies and issues (TCP/IP, DNS, HTTP, proxies, load balancers, TLS, routing, VPCs/subnets, firewalls).
- Demonstrated proficiency operating cloud-scale, distributed systems within a technical discipline (e.g., cloud platforms), with experience at firmwide or similarly large scale.
- Ability to influence team culture by championing innovation and change; experience mentoring and leading technologists (including hiring, developing, and recognizing talent) as an individual contributor.
- Automation mindset focused on reducing toil (target ~25% of time), building self-service capabilities, and codifying operational procedures into code.
Preferred qualifications, capabilities, and skills - Experience in banking/financial services and familiarity with risk and control expectations in regulated environments.
- AWS experience; AWS Certified Solutions Architect (Associate or Professional) preferred.
- Advanced observability ecosystem knowledge beyond Dynatrace/Splunk (e.g., OpenTelemetry, Prometheus, Grafana, ELK).
- Experience scaling SRE practices across multiple teams/platforms, including playbooks, SRE onboarding, and maturity assessments.
- Exposure to payments concepts and platforms (e.g., ISO 20022, SWIFT, real-time payments) with willingness to learn; not required for the role.
- Experience with chaos engineering tools (e.g., Gremlin, Litmus, Chaos Mesh) and integrating resilience tests into CI/CD pipelines.
- Proven cloud cost/performance optimization in production (autoscaling, caching, capacity management, and efficiency tuning
Chase is a leading financial services firm, helping nearly half of America's households and small businesses achieve their financial goals through a broad range of financial products. Our mission is to create engaged, lifelong relationships and put our customers at the heart of everything we do. We also help small businesses, nonprofits and cities grow, delivering solutions to solve all their financial needs.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
Equal Opportunity Employer/Disability/Veterans
Please see the job description for required or recommended skills.
Please see the job description for benefits.