Senior Platform Operation Manager - VP
Place of work
Work from home
Job details
Job description, work day and responsibilities
JR016595
This position is for a Senior Platform Operation Manager of Snowflake/Gen AI/Postgres CET based in Glasgow office with Site Reliability Engineering (SRE) oversight responsible for managing and improving the global database infrastructure services.
Position Description:
The Snowflake/ Postgres Customer Engagement Team (CET) is part of the Enterprise Computing Data Services Organization in Morgan Stanley. It is part of the Data & Analytics Technology (DAT) fleet, responsible for managing mission critical distributed database platforms like Snowflake, Postgres and Greenplum on public-cloud and on-prem
The successful candidate will be also be designated incident and escalation manager for the global production Data and Analytics infrastructure during EMEA time zone.
The person will also lead run-the-bank type of projects such as data center migration , plantwide version upgrade , release management , plant automation, database design and architecture, performance monitoring and optimization.
In addition, the person would also participate at least one squad as SRE, following Agile practice and contributing to the infra modernization and automation.
10+ years of overall enterprise level IT experience.
Strong domain expertise related to distributed database platforms both on-prem/cloud like Snowflake /Postgres or Greenplum.
Strong shell scripting and python programming skills for SRE related work.
Advanced Linux / Unix skills
Experience on using Splunk OR Grafana/Prometheus/Loki stack
General understanding of Project Management , Database design and architecture , Data Integrity and security , Disaster recovery and backup.
Knowledge on Agile methodologies
Effective oral and written communication skills, and interpersonal skills to work well in a team environment required.
Strong organizational and coordination skills with the ability to manage multiple tasks and high-pressure situations for outage handling, management, or resolution.
Strong Incident Management Skills with proper understanding of ITIL procedures.
Be available for weekend work.
Key Responsibilities:
Deploy Optimize and manage enterprise scale distributed database platforms like Greenplum , Snowflake/Gen AI and Postgres.
Respond to incidents, troubleshoot issues, and conduct root cause analysis.
Design, implement, and maintain disaster recovery and high-availability solutions.
Automate plant wide operational tasks related to provisioning, monitoring, backups, scaling, and recovery.
Monitor system health, identify performance bottlenecks, and implement optimizations.
Collaborate with development teams to support schema design, query optimization, and database best practices.
Ensure data security, compliance, and access controls are enforced.
Participate in on-call rotations and incident response.
You will be redirected to another website to apply.
Offer ID: #1246715,
Published: 20 hours ago,
Company registered: 1 month ago