Associate Principal Engineer, Performance and Site Reliability at Nagarro | Torre

Associate Principal Engineer, Performance and Site Reliability

You'll engineer peak performance for billions of transactions, shaping the digital future.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Remote (for Sri Lanka residents)
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted 6 months ago

Requirements and responsibilities


Company DescriptionWe are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (15000+ experts across 26 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!Job DescriptionThe ISV practice is focused on building new offerings targeted towards the diverse market segments in the horizontal tech space, creating new competencies in niche technology areas and supporting sales team to make convincing pitches of Nagarro’s capabilities in the ISV market. As a rapidly growing company, we are seeking geeks who want to squeeze the last bit of performance and maintain every second of uptime for an application. We are looking for a talented and motivated Principal Performance and Reliability Engineer to join our dynamic team and contribute to the success of our cloud-based projects where performance, scalability, reliability and resiliency are critical to support billions of online transactions everyday.Profile requirements:Strong knowledge of Java/.NET and SQL/NoSQL, event driven architecturesHands-on experience with Locust, JMeter, Gremlin and other performance and reliability toolsPerformance engineering experiences with a distinguished track record on technically demanding projectsExperience with performance testing, analysis, troubleshooting, and optimizing enterprise software - BS, MS, or PhD in Computer Science or equivalent work experienceOutstanding attention to detail and communication skills, ability to analyze / interpret technical data, ability to identify / resolve system bottlenecks, ability to identify potential limitations in software architectures, ability to write clear and concise technical documents, excellent troubleshooting and reasoning skillsUnderstanding of throughput, latency, memory and CPU utilization, concepts of multi-threading, cache, memory management, Java Virtual Machine, JITWorking knowledge of HTTP load generation / measurement software (e.g. LoadRunner, JMeter, TestMaker, Silk Performer, etc)Working knowledge of APM and Observability tools like Splunk, NewRelic, Prometheus and Grafana - Working knowledge of AJAX, HTTP, REST, JSONWorking knowledge of database tuning and administration (e.g. Oracle, DB2, Oracle AWR / STASPACK)Working knowledge of operating system design / tuning / tools (e.g. Linux, Solaris, OProfile, VTune)Roles and responsibilities: The Performance and Reliability engineering architect must demonstrate the ability to understand novel system architectures, pinpoint problem areas, and develop architectures and succinct test plans across multiple hardware and software tiers.Design modern scalable cloud-native architecture capable of transacting billions of online transactions every dayOrchestrate SLO/SLI design workshops and set performance goals with product owners and engineering leadersWork closely with management, product owners, developers and quality engineers to understand product requirements and design appropriate performance and reliability test plans to verify features performance and scalability requirementsProvide accurate scoping, clear strategies, and milestones for performance and reliability testsCommunicate status, risks to test complete dates, and offer mitigation strategies to ensure on-time delivery of all committed projects to cross functional teamsParticipate in discussions and planning with development teamsDesign and implement scalability, stability, stress, reliability, availability, and longevity tests using industry standard tools and custom developed software written in Java or . NETReview automation implementations and encourage sound design principlesReview test plans, results analysis, capacity planning guides for in-flight releasesWork closely with the Production Success Performance Team to provide product and automation expertise for use in patches, research projects, sizing, and capacity planningCoordinate technical presentations and training for capacity planning guidesReport weekly trends highlighting product performance and team status against committed milestones and projects
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.