Striving for excellence is in our DNA. Since 1993, we have been helping the world’s leading companies imagine, design, engineer, and deliver software and digital experiences that change the world. We are more than just specialists, we are experts.
Our customer is a revolutionary skincare company providing dermatologist-inspired, clinically tested products that work. They’re all about offering life-changing skincare and life-changing opportunities. Founded by world-renowned Stanford-trained dermatologists it is #1 premium anti-aging and acne brand and #2 premium skincare company in the U.S. It is also the most fast-growing skincare brand in the United States over the past 5 years.
Company is redefining an entrepreneurial sales model in the direct selling channel which allows Independent Consultants to establish their own businesses with products they use and love and be rewarded for their volume of Customer sales.
We are currently searching for an experienced Site Reliability Engineer to work as a part of existing customer. DevOps and Infrastructure team with be focused on applications performance monitoring.
Work onsite face to face with client in San Ramon in role of Site Reliability Engineer;
Work on design, implementation and support of key APM configurations for portfolio of applications (continuous perf monitoring);
Set up, monitor and improve application performance metrics;
Implement and advocate application monitoring infrastructure by maintaining installation, configuration and ongoing health of business-critical applications;
Focus on New Relic (or similar APM like Dynatrace, Wily Introscope, etc.) tool to provide expertise in multiple tools and focus areas impacting application monitoring;
Collaborate with cross functional stakeholders to create and maintain dashboards for each system and explain key application health metrics and maintain overall system stability;
Analyze history of monitored applications to ensure applications health and stability;
Provide timely and accurate estimates for deliverables;
Be a part of operations and maintenance team and ensure continuous availability and stability of the systems.
5+ years’ experience across entire SDLC, DevOps and tools, with site reliability management, and deployments;
2+ years’ experience in working with any of APM tools (ideally New Relic) including planning, installation, configuration and maintenance across multiple environments;
Knowledge of TCP/IP networking, load balancers, HA architecture, DR principles;
Good understanding and strong hands-on expertise in DevOps stack (including CI/CD, build tools, release engineering etc.);
Good understanding of key architectural components of typical systems, and other IT engineering skills;
Shell scripting and Linux administration;
1+ year of experience in working hands-on with AWS and/or other cloud hosting provider;
Experience in Splunk, Grafana or ELK stack is a plus;
Understanding of SDLC, experience in working with Agile, using JIRA or other tools;
Proficiency in operating Java-based applications in a large-scale environment;
Experience in working with Windows servers and .NET is a plus;
Strong interpersonal and organizational skills with focus on working in team environment and following defined policies and procedures;
Outstanding verbal and written communications skills.
Competitive compensation depending on experience and skills;
Individual career path;
Social package - medical insurance, sports;
Sick leave and regular vacation;
Partial coverage of costs for certification and IT conferences;
EPAM использует cookie (файлы с данными о прошлых посещениях сайта) для персонализации сервисов и удобства пользователей. Продолжая использовать данный сайт, вы подтверждаете свое согласие на использование файлов cookie. EPAM серьезно относится к защите персональных данных — ознакомьтесь с условиями и принципами их обработки.
Вы можете запретить сохранение cookie в настройках своего браузера.