Junior Site Reliability Engineer


Houston - Telecommute

iland is looking for a Junior Site Reliability Engineer to join our team of Site Reliability Engineers, with a focus on daily operation of distributed storage software, storage hardware, monitoring/metrics/logging, and test/development environments. This is a full-time remote position within the North American region reporting to the Director of Site Reliability Engineering.


  • Day to day operation and management of distributed storage architecture
  • On-going configuration and operation of monitoring and log centralization system
  • Configuration of new, existing, and custom metric exporters
  • Development of dashboards for metrics and monitoring
  • Perform QC/QA on new clusters and software deployments
  • Troubleshooting of new and existing hardware
  • Application lifecycle management to include installation, testing releases, as well as performing upgrades
  • Deployment and configuration of test/development environments
  • All other duties as assigned


  • Experience with Linux, to include system and networking troubleshooting
  • Experience with hardware troubleshooting

Preferred Skills

The following skills represent additional proficiencies preferred to be successful in this position:

  • Experience with network storage
  • Experience with Ceph
  • Experience with both Prometheus as well as Grafana for metrics collection and display
  • Experience with Loki for logging
  • Experience with RESTful APIs
  • Experience with data interchange formats such as JSON and YAML
  • Experience working with the open source community/open source projects
  • Familiarity with container-deployed applications
  • Experience with Git in a distributed multi-user environment
  • Experience with automation/configuration management
  • Familiarity with Saltstack
  • Experience with object storage systems
  • Process troubleshooting using strace/gdb


  • Competitive Salary
  • 401k Plan with Company Match
  • PPO Healthcare Insurance Plan
  • Dental Insurance
  • Vision Insurance
  • Life Insurance
  • Short-Term Disability Insurance
  • Long-Term Disability Insurance
  • Paid Vacation & Holidays
  • Extensive Training

About iland

iland has been in business for over 25 years, and is an industry leader in the areas of Secure Disaster Recovery as a Service (DRaaS), Secure Cloud Backup (BaaS), and Secure Infrastructure as a Service (IaaS). iland differentiates itself and maintains its market leadership by investing heavily in its proprietary Cloud Console, which is an orchestration tool for its cloud services offered in the US, Canada, Europe, Australia, and Singapore. The result of this investment is a rapid development cycle with up to four product releases per year. We provide an exciting, fast-paced environment that has been recognized by these industry leaders and more:

  • Gartner Magic Quadrant "DRaaS" Leader: 2016, 2017, 2018, & 2019
  • The Forrester Wave "DRaaS" Providers: 2014, 2017, & 2019
  • Veeam Impact Partner of the Year: 2015, 2017, 2018, 2019, & 2020
  • Veeam Innovation Award: 2018, 2019, & 2020
  • Zerto Cloud Partner of the Year: 2016, 2017, 2019, & 2020
  • CRN Partner Program Guide Winner: 2018, 2019, & 2020
  • Best of VMworld 2018 Gold Award: 2018
  • Houston Business Journal #1 Best Place to Work: 2012 & 2013
  • Nine Lives Media Inc. Talkin’ Cloud 100: 2011, 2012, 2013, & 2016
  • Houston Business Journal Houston Fast 100: 2012 & 2013
Apply here

Hell yes, please send me new remote jobs by email!

© 2018 Mike's Remote List