职位描述
As part of an AWS managed services engagement, we are looking for an AWS L3 Expert to provide advanced technical support, drive operational excellence, and contribute to infrastructure evolution in their respective region. This role requires a strong expertise in AWS services, automation, and troubleshooting complex incidents while ensuring compliance with the technical standards defined by the central AWS team.Key Responsibilities:Operations & Incident Management (Run)1) Handle complex incidents and critical production issues, ensuring minimal service disruption.2) Conduct Root Cause Analysis (RCA) and implement corrective and preventive ****s.3) Continuously improve monitoring, ****ing, and troubleshooting processes.4) Collaborate with L1 and L2 teams to optimize the escalation process and incident resolution.Infrastructure Evolution & Automation1) Participate in architectural changes and optimizations within the defined AWS Landing Zone ****work.2) Implement Infrastructure as Code (IaC) solutions using Terraform and Ansible.3) Enhance CI/CD pipelines with GitLab CI/CD to streamline deployments.4) Ensure that all infrastructure modifications align with Hermès' security and compliance requirements.Standardization & Compliance1) Apply and enforce the AWS standards and best practices defined by the Technical Lead AWS.2) Ensure regional infrastructure compliance with security and governance policies.3) Contribute to the harmonization of AWS operations across different regions.Collaboration & Knowledge Sharing1) Work closely with the Technical Lead AWS to escalate and resolve major incidents.2) Engage with other AWS L3 Experts to share best practices and maintain consistency across regions.3) Provide technical guidance and mentoring to L1/L2 teams within the region.Continuous Improvement & Innovation1) Proactively suggest improvements to AWS environments, automation tools, and operational processes.2) Stay up to date with AWS innovations and assess their potential impact on current operations.Required Technical Skills1) Strong expertise in AWS services: Compute (EC2, Lambda), Storage (S3, EBS), Networking (VPC, Route 53), Security (IAM, KMS, Security Groups), and Databases (RDS, DynamoDB).2) Infrastructure as Code (IaC): Mastery of Terraform for provisioning and managing AWS infrastructure.3) Configuration Management: Proficiency in Ansible for automation and orchestration.4) CI/CD & DevOps: Experience with GitLab CI/CD for deployment automation.5) Incident & Problem Management: Expertise in diagnosing, troubleshooting, and resolving critical infrastructure issues.6) Security & Compliance Awareness: Good understanding of AWS best practices for security, networking, and governance.Certifications (Recommended)1) AWS Solution Architect Associate (highly recommended).2) AWS SysOps Administrator Associate (nice to have).3) AWS Specialty Certifications (Security, Networking) are a plus but not mandatory.Soft Skills & Language Requirements1)Problem-solving mindset with a proactive approach to improving operations.2) Strong communication skills to interact with internal teams and stakeholders.3) Ability to work in a global environment with distributed teams.4) Good client relationship management, ensuring high-quality support.5) Fluent in English (operational level required, fluent English is a plus).
企业介绍
凯捷是全球性的企业合作伙伴,利用技术的力量改造和管理企业业务。其宗旨是通过技术释放人类能量,创造一个包容和可持续的未来。凯捷是一个负责任的多元化组织,在50余个国家拥有超过34万名团队成员。自1997年进入中国市场以来,始终以人才为核心,扎根并服务于中国市场,近10年来业务持续飞速增长。凭借其55年的悠久历史和深厚的行业专业知识,在快速发展的云、数据、人工智能、互联连接、软件、数字工程和平台的创新世界推动下,凯捷深受客户信任,能够满足客户从战略、设计到运营的全方位业务需求。