Lead Operations Engineer(Cloud infrastructure)

勤務形態
正社員/Fulltime
業種
Web/インターネットサービス
企業資本
日系企業/Japanese Company
勤務地
東京都/Tokyo
給与
740万円〜960万円
日本語力
なし/None
英語力
ビジネスレベル/Business Level
その他語学スキル
なし(None)

仕事内容

Who we are: GlobalSign is the leading provider of trusted identity and security solutions enabling businesses, large enterprises, cloud service providers and IoT innovators around the world to secure online communications, manage millions of verified digital identities and automate authentication and encryption. Its high-scale PKI and identity and access management (IAM) solutions support the billions of services, devices, people, and things comprising the Internet of Everything (IoE). The company has offices in the Americas, Europe, and Asia. Our customers: Intel, Adobe, CISCO, HSBC, Google, Microsoft, Yahoo, Ford, PWC and many more. What we do: TrustLogin is a division in GlobalSign offering IDaaS solutions (single sign on, users management, secure authentication etc.) to enterprise customers in Japan and abroad. The aim of the product is to provide a rich set of features stably and reliably 24/7/365. Who we are looking for: We are looking for an experienced operations engineer with thorough understanding of modern cloud technologies and how web applications work. You will help us ensure further stability and reliability of our product in production environment and facilitate usage of test environments by development and QA teams. One more thing about testing – everything, and we mean it, is tested in staging environments (including deployment process itself) before being rolled out to production – no shortcuts here. We are looking for a person who would be able to become operational quickly. Interview process will include a deep technical interview to assess candidate’s level. That being said we realize that finding a perfect match can be quite difficult, so it is okay if some of the listed things are new for you, as long as you are willing to embrace the responsibilities that come with the position and learn. We believe that this position is a perfect opportunity for your further growth as an experienced operations engineer where you can use all modern technologies (as long as it makes practical sense) to build and support an automated highly reliable infrastructure for feature rich product used by tens of thousands of users 24 hours a day, 7 days a week, all year around. Responsibilities Cloud infrastructure ■Design, implementation, and maintenance using Terraform Security ・Review and update of the current design ・Secure design and implementation of new features Stability (availability, consistency) ・Easy deployments without service interruption ・Auto-recovery Scalability ・Stable performance independently of number of users ■Ownership of production and test environments Development and QA teams support ■Implementation of tools (pipelines, etc.) to allow Development and QA teams create, reset, destroy test environments and to allow deployments to those environments BCP ■Planning ■Execution of exercises ■Incidents’ investigation Monitoring ■Design and implementation of monitoring mechanism ■Alerts’ setup Systems logs ■Collection of system logs ■Error detections and alerts mechanism setup ■Anomalies detection mechanism setup Infrastructure costs ■Regular optimization of infrastructure costs Periodical maintenance ■SSL certificate updates ■DNS assets management Software update ■Detection of outdated and/or vulnerable software and update (on infrastructure level) ■Management of AWS managed services updates

求められる
スキル

Required skills ・Good knowledge of AWS (managed services, ECS, KMS, RDS, EC2, networking etc.) ・Good knowledge of Docker ・Good understanding of HTTP protocol and how web applications work ・Fair understanding of SSL, PKI, including mutual SSL (client certificates), asymmetric and  symmetric encryption and signing ・Good understanding of networking ・Fair understanding of how DNS works ・Some experience with Terraform ・Some experience with pipelines (Jenkins or Bitbucket or Gitlab or GitHub etc.) ・Be comfortable with some scripting language ・Understanding of common types of attacks and vulnerabilities in web applications ・Some experience with Nginx Nice to have skills ・Kubernetes ・Knowing what terms like Raft and CAP are. ・Golang or Ruby or Java programming, or any other mainstream language