Hays
Location: Montreal, QC
Availability/Duration: ASAP + 1-YEAR CONTRACT (possible extension)
Working organization: Hybrid (3 days on site/ 2 days on remote)
Language: English
RESPONSIBILITIES
- Level 3 production management of Autosys, Gemfire, and Redis plants. This includes handling escalations from Level 1 and Level 2, incident management, request fulfillment, problem management, and change management.
- Deep-diving into complex troubleshooting, implementing changes, and serving as an escalation point for the Level 2 teams.
- Improving operational processes and automation by proactively identifying, analyzing, and improving upon existing processes and tools.
- Work in a global environment in a team that has members across the globe and provides support 24/7 in a follow the sun manner.
- Creating and maintaining best practices / policies and ensuring that they are followed.
- – Work with External vendors and internal key stake holders to plan and execute changes.
- – Participate in weekly review meetings and the squads’ agile ceremonies, and actively engage with various engineering teams to review the infrastructure.
QUALIFICATIONS
- Experience with Linux System Administration (preferably Red Hat)
- Hands-on data caching support (preferably Gemfire or Redis) experience
- Familiarity with batch scheduling (Autosys preferred)
- Familiarity with scripting and orchestration (Perl, Python, Ansible, GitOps)
- Experience with monitoring and alerting tools and modern observability concepts and tools
- Excellent written and oral English communication skills. The candidate must be capable of writing documentation, making presentations to an internal audience and interacting positively with upper management, colleagues, and customers
- – Independent problem-solving skills, self-motivated, and a mindset for taking ownership
- – Knowledge of operational and agile, DevOps, and SRE concepts such as SLA, metrics, toil, SLIs/SLOs, observability, and automated deployment pipielines
- A minimum of 3-5 years of infrastructure production support experience, preferably in a regulated environment (e.g. finance IT)
OTHER SKILLS
- Public/private Cloud experience
- Experience with containerization technologies (Kubernetes/Openshift)
- Experience with work management tools such as Jira, ServiceNow, Git
- Experience with troubleshooting tools such as TCPdump and Wireshark.
- – Interest and understanding of emerging IT trends