Software Developer - Production Engineering Watson Orders
IBM
Mountain View, California
software
production
production engineering
engineering
watson
ibm
ai
management
software
telemetry
cloud
watson
learning
Apply with Tarta Assistant 🤖
Unleash the power of automation for your job search (Paid option) Apply Manually(Free)
I have time, I'll manually find and apply for jobs
Unleash the power of automation for your job search (Paid option) Apply Manually(Free)
I have time, I'll manually find and apply for jobs
90% of users say Tarta.ai Assistant helps them save time applying for jobs.
Not a member? Click
here to subscribe.
December 21, 2022
IBM
Mountain View, California
597455BR
- Work closely with other Watson Orders development teams in an embedded SRE model to help define & implement key metrics for uptime, reliability, and performance of these services and develop runbooks for incident management.
- Develop deep service telemetry through metric collection, distributed tracing, visualization, and reporting via Open Telemetry, Prometheus, and related tooling.
- Implement stability and performance optimizations in Python.
- Participate in the definition and management of SLIs, SLOs and error budgets for infrastructure and production services.
- Design, develop and maintain CI\\CD pipelines for integration and edge Kubernetes clusters.
- 5+ Years Linux experience configuring, supporting, and optimizing
- 2+ Years experience architecting, deploying, and supporting edge k8s environments.
- 2+ years experience designing and supporting distributed systems.
- 2+ years Experience in one of more languages such as Python, Java, Go - ability to debug, optimize, and write scalable code.
- Experience implementing telemetry frameworks (Open Telemetry, prometheus) and infrastructure (Prometheus, Jaeger, and similar tools)
- Experience designing and implementing infrastructure as code pipelines
- Familiarity with AWS DevOps (Roles, VPCs, S3, Terraform)
- Familiarity running distributed ML workloads in cluster orchestrated environments
- 2+ Years PubSub Experience (Kafka, MQTT, SQS)
- 12 weeks of paid parental bonding leave. Family care options are also available to support eligible employees during COVID-19.
- World-class training and educational resources on our personalized, AI-driven learning platform. IBM's learning culture supports your restless attitude to grow your skills and build the depth and scale of knowledge needed to achieve your career goals.
- Well-being programs to support mental and physical health.
- Financial programs that empower you to plan, save, and manage your money (including expert financial counseling, 401(k), IBM stock discount, etc.).
- Select educational reimbursement opportunities.
- Diverse and inclusive employee resource groups where you can network and connect with IBMers across the globe.
- Giving and volunteer programs to benefit charitable organizations and local communities.
- Discounts on retail products, services, and experiences.
Report this job