InComm

  • Site Reliability Engineer - Japan

    Job Locations JP-13-Tokyo
    Type
    Full-Time
  • Overview

    Leveraging deep integrations into retailers’ point-of-sale systems, InComm provides connectivity to a variety of service providers that allow consumers to conduct everyday business at more than 450,000 points of retail distribution worldwide. Whether those consumers are activating prepaid products, paying bills, enjoying real-time discounts through a membership card, purchasing digital goods in-store or adding funds to an online account, InComm is there to provide unique gift-gifting opportunities, cater to on-the-go shoppers, deliver added value through loyalty programs and serve cash-based consumers. With 186 global patents, InComm is headquartered in Atlanta with a presence in over 30 countries in North and South America, Europe and the Asia-Pacific region. Learn more at www.incomm.com or connect with us on www.twitter.com/incomm, www.facebook.com/incomm, www.linkedin.com/company/incomm or www.incomm.com/blog.

     

    Inside InComm from InComm on Vimeo.

     

    About This Opportunity

    InComm Japan is entering into exciting times and we are building sophisticated systems to meet the demands of our growing industry. These endeavors will require a highly skill professional with experience in cloud infrastructure, automation, subject matter expertise in dev ops, and communication skills necessary to work with various internal and external clients. 

     

    Why InComm?

    InComm offers an opportunity to work in the interesting niche of fin-tech. We are producing technologies and services that impact consumer shopping in most parts of the world and partner with many of the world’s well-known brands and retailers. This is an opportunity to bring your experience to a sector that is constantly evolving, fast paced, and unique.

    Responsibilities

    • Scale our cloud infrastructure
    • Deploy reliable and maintainable distributed systems
    • Adhere to industry standard security best practices
    • Write automation, monitoring, diagnostic and debugging tools or leveraging existing global standard or localized tool.
    • Develop and manage infrastructure/application stack, monitoring, and performance metrics
    • Manage, diagnose, and resolve system incidents and internal technical escalations
    • Engage with U.S. Teams, Software Engineers, Product Managers, and Analysts to diagnose and resolve service problems
    • Collaborate with Product Managers and Engineers on new customer implementations and to develop enhancements
    • Work with Agile development teams to ensure smooth promotion of code, configuration and Docker images to production
    • Effectively communicate and engage with all service stakeholders, both internal and external
    • Act as subject matter expert across development, deployment and operation of Applications (and supporting services) with performance, visibility, ease of maintenance and uptime as key areas of focus. 
    • Resolve and find root-cause to difficult production issues across services and all levels of the hardware, network and software layers.
    • Assist in improving and replacing legacy services with best practices.
    • Monitor and alert on symptoms and not on outages.
    • Consult on system design, framework, and capacity planning
    • Scale our cloud infrastructure to support our growing ecosystem
    • Deploy reliable and maintainable distributed systems
    • Write automation, monitoring, diagnostic and debugging tools

    Qualifications

     

    • 7+ years of experience in SRE, DevOps, or similar role
    • Participate in on-call rotation
    • Familiar with design principles of monitoring and alerting systems
    • Experience implementing industry standard security best practices
    • Strong operating system experience (Linux or Windows)
    • 3+ years in a technical role such as: software engineer, systems engineer, or cloud engineer and related networking (TCP/IP, load balancing, etc)
    • Experience working with large enterprise partners
    • Experience in incident management, problem management, and change management
    • Strong desire to develop new skills and evolve within an entrepreneurial organization
    • Experience with various types of storage (network, block, etc.)
    • Can debug network and performance issues in large scale distributed systems
    • Can identify and mitigate reliability risks
    • Experience on below areas
      • Plan - Jira (or equivalent)
      • Create - GitHub (or equivalent)
      • Verify - Jenkins (or equivalent CI/CD)
      • Configure - Ansible
      • Monitoring, Dashboards, Alerts - Dynatrace, Splunk, OpsView
      • Containerization - Docker, Kubernetes
      • CDN, DDoS, Web Security - Incapsula, Akamai
      • Scripting - Python, Bash, Powershell, Groovy
      • Cloud – Azure / GCP / AWS
      • SQL - SQL Server , SQL Sentry
      • Methodology - Agile (Scrum, Kanban)
      • Application Performance Tuning - Java, Spring, Spring Boot
      • Preferred skill:  API management - Apigee (or equivalent)
    •  Must be bilingual in English and Japanese

     #LI-TL1

     

    InComm provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity or national origin, citizenship, veteran’s status, age, disability status, genetics or any other category protected by federal, state, or local law.

     

    *This position is eligible for the Employee Referral Bonus Program

     

    Options

    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed