This role sits within the Site Reliability Engineering team and is part of the wider Cloud Services team. Our suite of SaaS, distributed systems and product integrations help our internal stakeholders run their critical business operations and provide customers in turn with industry leading threat detection technology products. You'll play a key role in the formation of a new area within Kroll: that aims to drive operational excellence and customer focus into the operation of our SaaS hosted application suite.
As an Associate Cloud Engineer, you will be using your skills and expertise on cloud platforms to maintain and improve our cloud infrastructure running on Azure & AWS, orchestrate deployments and support our industry leading SaaS solution. As part of the SRE team you will be an integral part of ensuring our platforms are highly available and resilient, through continual monitoring and providing improvement suggestions. You will work closely with engineering teams in Development and Delivery to uphold contracted Service Level Objectives (SLOs). You will be tasked with ensuring our internal and externally available systems have reliability, and uptime appropriate to user needs.
Day-to-day responsibilities: - Working in a team to provide third line support for the infrastructure and application
- Take responsibility, ownership, and coordinate fault resolution. Work alongside a team of engineers where necessary to fix faults that are raised against the supported elements, networks, or applications,
- Own the deployment process to provide regular service improvements delivered by our engineering team
- For service impacting incidents lead the investigation into the RCA, producing any reports and co-ordinating the delivery of any fixes to mitigate further occurrences
- Use and maintain our monitoring platforms
Essential traits: - Strong hands ‑ on experience with Linux operating systems, including Debian, Ubuntu, and CentOS.
- Proven ability to work within distributed, cloud ‑ based environments across Azure, AWS, or GCP.
- Proficiency with monitoring and observability tools such as Nagios, Zabbix, or New Relic.
- Exceptional troubleshooting and fault ‑ finding skills in complex technical environments.
Nice to have Skills: - Working knowledge of Oracle databases and related tooling.
- Experience with IBM QualityStage for data quality and transformation workflows.
- Solid understanding of SQL for querying, analysis, and data manipulation.
- Familiarity with Software ‑ Defined Networking concepts and technologies.
- Awareness of cloud and platform security best practices.
- Background in incident management processes and frameworks.
- Possession of Azure or AWS cloud certifications.
- Experience with release management and deployment automation tooling.
- Exposure to containerization technologies, particularly Docker.
About Kroll Join the global leader in risk and financial advisory solutions-Kroll. With a nearly century-long legacy, we blend trusted expertise with cutting-edge technology to navigate and redefine industry complexities. As a part of One Team, One Kroll, you'll contribute to a collaborative and empowering environment, propelling your career to new heights. Ready to build, protect, restore and maximize our clients' value? Your journey begins with Kroll.
Kroll is committed to creating an inclusive work environment. We are proud to be an equal opportunity employer and will consider all qualified applicants regardless of gender, gender identity, race, religion, color, nationality, ethnic origin, sexual orientation, marital status, veteran status, age or disability .
In order to be considered for a position, you must formally apply via careers.kroll.com.
Salary range for this role is $70, 000 - $90, 000 USD
#LI-Remote
Please see the job description for required or recommended skills.
Please see the job description for benefits.