This job posting has expired and no longer is available. Please explore other opportunities.

Cloud Operations Engineer

Who we are

At iTMethods, our mission is to provide the leading integrated DevOps SaaS Platform that enables companies to compete by securely delivering software faster and with higher quality. We’re focused on providing a better, revolutionary way for companies to accelerate their business innovation and software delivery.


Our DevOps SaaS Platform provides a flexible and integrated toolchain that allows development teams to focus on building software without having to maintain an ever-evolving set of DevOps tools and capabilities. The platform automates access to industry leading applications from Atlassian, CloudBees Jenkins, GitHub, GitLab, Sonatype, JFrog, SonarQube and many more. It delivers enterprise security, hybrid integration, reliability, scalability and a host of platform add-ons that streamline the on-boarding of users and teams across the enterprise.


Today leading companies in many industries including finance, software, media, retail and others rely on iTMethods’ DevOps SaaS Platform and expertise for providing the best in DevOps and Cloud automation, applications services and management. Over the years we’ve won numerous awards, passed audit and certifications and gained the trust of our valued customers and partners worldwide.


At iTMethods, we live and breathe 3 core values; we are customer obsessed, take no shortcuts and we act as one team. Our culture is driven by the belief of customers above everything. We embrace new and exciting practices to better serve and advance our customers. We have a dynamic and rich history and a bright future ahead! Our team is united by a common purpose to provide the best DevOps Saas Platform possible, and an endless ping pong tournament.


The opportunity

Reporting to the Senior Operations Manager, we are adding a Cloud Operations Engineer to the team. This is an opportunity to work closely with customers, engineers, technology consultants, and various delivery teams to ensure maximum uptime for customers on our platform. This is an opportunity to work with big-name customers, using a variety of tools as projects continuously enter and exit the pipeline. It’s a chance to create repeatable processes and procedures, and anticipate opportunities for automating the operations to minimize human error and move towards a self-healing environment with automatic recovery. 


Who you are

You are a Technology Operations professional with hands-on knowledge of AWS, Terraform, Ansible, Jenkins, Git, and Artifactory. You describe yourself as an innovator, curious, an automator, a multitasker, obsessed with continued education and an excellent communicator. You are addicted to problem-solving, you own your work, and execute flawlessly and independently. You like to share knowledge, mentor others, and thrive in high-growth, fast-paced environments. You are curious and enjoy experimenting with a range of technologies to constantly improve efficiency in how we respond to incidents and run our operations.


What’s in it for you

  • Impact. You are ready to play a critical role in ensuring the overall stability and resiliency of our platform. You want to work on multiple initiatives using a range of tools instead of just focusing on operational issues and support requests. You want to engage in and improve the whole lifecycle of services from inception and design, through deployment, operation, and refinement.
  • Exposure. You want to deliver high-quality experiences for users on our platform. You want to use your expertise to advise and provide functional support to our customers on their DevOps tools (Jenkins, GitHub, Artifactory and similar products) and cloud workloads. 
  • Growth. You want to apply and expand your technical expertise including professional certification in AWS and Jenkins. You are eager to keep up to date with cutting edge technologies impacting the solutions being operated as well as best practices in software delivery.


Your day-to-day:

  • Ensure a sustained focus on engineering with a goal of exposing faults and applying engineering to address root cause of faults so they do not re-occur
  • Demand Forecasting and Capacity Planning – Creating and maintaining good visibility to usage and demand and planning and executing the changes for provisioning the capacity through rigorous change management protocols and ensuring efficient use of resources
  • Eliminating toil related to manual, repetitive, tactical solutions with no enduring value which can either be eliminated or automated for a more sustained and scalable solution
  • Develop automation solutions to improve and optimize operational processes and services.
  • Act as a subject matter expert and implement DevOps best practices for our customers.
  • Assist in the configuration and support of customer environments; code deployments, optimization, and various tools.
  • Architect and implement monitoring and logging solutions, identify issues proactively, and mitigate them to improve the customer experience.
  • Test/Audit/Review solutions to ensure we deliver a resilient, monitored, highly secure, and complete solution.
  • Troubleshoot and resolve escalated software and infrastructure related issues and challenges.
  • Contribute to the continuous improvement of all operations collaterals and services to efficiently manage and maintain deployments
  • Explore and evaluate new and emerging software tools and technologies.


What you bring

  • The mindset. You have a site resiliency engineering mindset constantly looking to avoid and eliminate faults.
  • The drive. You thrive on developing solutions to open-ended business problems. You can work within a team and independently on multiple concurrent initiatives.
  • The certifications. You are certified in AWS and Jenkins, or other DevOps tools.
  • The expertise. You have at least 2 years hands-on knowledge of and experience with:
  • AWS and Jenkins
  • Configuring or customizing Jenkins and other DevOps tools
  • Linux or Windows administration
  • Continuous integration of best practices
  • Managing and supporting AWS or other cloud environments.
  • Configuration management, using Ansible, Terraform or similar tool
  • Use of performance monitoring, and alerting tools to assess service as it relates to service levels
  • In depth experience developing application-level monitoring using tool such as DataDog. Understanding of how to identify the meaningful metrics to monitor so that user-impacting problems can be identified before they occur.  
  • Functional knowledge and expertise in common DevOps Tools
  • Using code to automate so that mistakes can be avoided
  • The flexibility. You are available to work rotating daytime shifts and participate in an after-hours on-call schedule.


Join us.

Apply here or learn more on our websiteMedium or LinkedIn.


iTMethods is committed to fostering an inclusive and accessible environment where employees feel valued and respected, and where every employee has the opportunity to realize their potential. We are committed to providing reasonable accommodations, if required, and will work with you to meet your needs.  

Subscribe to Job Alerts