KOMOJU (by Degica) is the leading cross-border payment gateway for Japan. We power payments for companies like video game distribution platform Steam and the popular mobile app TikTok. Today we help thousands of merchants by providing them with the payment infrastructure they need through developer-friendly API’s to integrations on popular platforms like Shopify and Wix; we help our merchants grow in all markets they are expanding.
As our systems grow in complexity, scale, and traffic , maintaining their reliability and availability becomes increasingly challenging—and critical. We're looking for a Site Reliability Engineer (SRE) with a focus for observability to help us meet these demands.
In this role, you'll be at the forefront of ensuring that our infrastructure is not just running, but understandable and measurable . Observability is a core pillar of our reliability strategy—it's how we detect issues before they impact our merchants and users, quickly understand the root causes of incidents, and continuously improve our systems performance and reliability.
You’ll design and evolve our observability platform, including metrics, logging, tracing, and alerting , and partner with development teams to embed observability into every stage of the software lifecycle. Your work will directly impact our ability to scale confidently and respond to incidents swiftly.
This is a key role for someone who wants to build resilient systems , empower teams with actionable insights , and make a real difference in how we operate at scale.
While we are a remote-first company, this position is based in Tokyo, and we expect candidates to be willing to relocate to Japan.
Design, implement, and maintain our observability stack (metrics, logging, tracing, dashboards).
Define and monitor SLIs/SLOs to ensure service health and reliability.
Correspond with engineering teams to instrument applications for better visibility.
Build and maintain dashboards and alerts that provide actionable insights and minimize alert fatigue.
Troubleshoot system performance and reliability issues using observability data.
Educate and guide engineering teams on best practices in monitoring, alerting, and incident response.
Contribute to postmortems and continuously improve system transparency and resiliency.
Knowledge of CI/CD pipelines and integrating observability into build and deploy processes.
Familiarity with incident response , on-call rotations, and post-incident reviews.
Business-level Japanese.
...s setting the pace for responsible, transformative cloud infrastructure. About This Role: Crusoe.ai is seeking a dynamic SDR Manager to lead our team of SDR efforts in acquiring key enterprise accounts in the AI/ML cloud infrastructure market. As the SDR Manager...
...Commercial Review Appraiser, Harwood Heights, IL Hours for this Commercial Review Appraiser opening are Monday/Tuesday/Thursday/Friday 8:30am 4:30pm and Wednesday from 8:30am 3:30pm Hours may change based on the needs of the company. This Commercial Review Appraiser...
AHMG Interventional Radiologist+ Oakland, CA+ Alameda Health Medical Group (AHMG)+ AHMG Diagnostic Radiology+ Full Time - Day+ Physicians & Dentists+ 295.25- 305.46 /Hour+ Req #:41715-30887+ FTE:1+ Posted:April 21, 2025**Summary****SUMMARY** :The Interventional...
...Workday Analyst to Assist with resolving tickets in HR Technology queue and mailbox, investigate and resolve issues quickly and with high customer satisfaction. Escalate any tickets to Senior Mgr HR Technology as required. 5+ years of HR Information System Management...
...The Role We are seeking a mission-aligned Functional Health Coach who is passionate about personalized wellness, functional medicine... ...recommendations. You will work directly with clients in a remote setting, providing coaching sessions, messaging support, and ongoing...