Job Description
We are looking for a seasoned Senior System Architect with a strong foundation in Snowflake and AWS architecture, who also brings hands-on expertise in production support and operational reliability. This role will be responsible for designing robust, scalable data warehouse solutions while ensuring high availability, performance, and supportability of production environments.
The ideal candidate combines strategic architectural vision with the technical depth needed to troubleshoot, optimize, and maintain complex data systems in real-time
Key Responsibilities:
- Lead end-to-end solution design for data warehouse platforms using Snowflake on AWS.
- Architect and oversee data ingestion, transformation, and provisioning pipelines that are performant, secure, and maintainable.
- Design and implement monitoring, alerting, and logging frameworks for production data environments.
- Provide hands-on production support, including issue triaging, root cause analysis, resolution, and post-mortem reporting.
- Establish and manage SLA-driven support models, escalation paths, and on-call processes.
- Optimize Snowflake performance, warehouse sizing, and cost management strategies for production use.
- Collaborate with DevOps, platform, and engineering teams to maintain high system uptime and reliability.
- Ensure security, governance, and compliance standards are enforced across the entire lifecycle.
- Document architecture, production support runbooks, and incident resolution procedures.
Required Qualifications:
- Minimum 15 years of experience in data architecture, enterprise systems design, and cloud data platforms.
- In-depth experience with Snowflake, including query optimization, virtual warehouse tuning, and monitoring in production environments.
- Strong hands-on expertise with AWS services, especially S3, Glue, Lambda, CloudWatch, and IAM.
- Experience setting up and managing production support processes, on-call rotations, and incident response in large-scale environments.
- Strong SQL skills, with ability to troubleshoot and optimize long-running or failed queries in production.
- Familiarity with tools like dbt, Airflow, Matillion, and monitoring tools such as Datadog, Splunk, or CloudWatch Logs.
- Demonstrated ability to work under pressure in fast-paced environments and maintain system reliability.
Preferred Qualifications:
- SnowPro Advanced Architect certification or equivalent.
- Experience with automated testing, CI/CD pipelines, and infrastructure as code (IaC).
- Exposure to real-time data streaming (e.g., Kinesis, Kafka) and event-driven designs.
- Background in production support for regulated industries (e.g., finance, healthcare, insurance) is a plus.
