职位描述

role:
? operate & maintain high availability of software for multiple products running in aws/azure/alibaba cloud
? establish a stable, reliable, secure, and high performing cloud platform
? troubleshooting complex incidents in stage and production environments
? evaluate existing devops practices and tools, identify improvement opportunities, and drive the adoption of best practices
? establish, implement, and analyze infrastructure wide slos, kpis and metrics
? co-develop infrastructure and engineering standards, documentation, processes and procedures with cloud and backend engineers
? infuse site reliability and security into all areas of lenovo cloud infrastructure; understand modern software security and secure software systems with cloud-based infrastructure
? remove roadblocks and ensure your team consistently meets commitments to deliver value by actively supporting cross functional teams
? motivate your team to deliver high impactful work; facilitate collaboration with application/solution teams and business stakeholders
? work with team members on career development by understanding their goals and helping them find stretch opportunities
? collaborate with senior leadership on defining and executing devops strategy aligns with business goals and objectives
position requirements:
basic qualifications:
? bachelor's degree in computer science, mathematics, or related field.
? 10+ years of in managing cloud infrastructure, building & supporting cloud native applications, devops, site reliability engineering or it engineering experience
? 5+ years of hands-on technical leadership and people management experience
preferred qualification:
? experience in building cloud infrastructure includes network, security, container, and identity and access management mechanisms and structures.
? experience with modern engineering tools and techniques to build systems at scale
? demonstrated experience in managing global devops team, offshore squads & 3rd party vendor teams
? experience with observability tools for cloud services and infrastructure, infrastructure-as-code, and configuration management
? demonstrating fluency in at least one development language such as java, c#, perl, php, or python
? managing aws/azure/alibaba cloud or distributed vmware environments; demonstrating knowledge of one container technology such as docker
? experience with source code management tools such as github / gitlab, ci/cd pipeline tools such as jenkins, argo cd, bamboo.
? demonstrable experience in infrastructure automation & configuration management across a multi-cloud platform using terraform, ansible, puppet or salt.
? experience with level 1 troubleshooting for a range of areas including cloud infrastructure (i.e. - aws, azure), lan/wan networking, unix/san, security.
? experience in working with front-line technical operations teams (e.g., noc)
? strong written and verbal communication skills.
? experience in design and creation of controlled chaos in production systems.
? prior experience in working with cross functional teams to identify and fix issues that affect systems reliability and performance.
? demonstrable leadership, influencing and communication skills, as well as the ability to drive continuous performance improvements with the stakeholders, vendors, and technology subject matter specialists.
? establishing a stable, reliable secure and high performing cloud platform.
? designing, building, maintaining and controlling stage and production infrastructure as code (iac) pipelines.
? establishing production system health and monitor/adjust capacity as needed.
? documenting sops (standard operating procedures) and performing knowledge transfer to others.
工作地点
地址:成都武侯区成都-高新区德必天府五街we(成都国际数字科技中心)168号902
