Site Reliability Engineer

Site Reliability Engineering (SRE), Python, Django, FastAPI, Flask, SQL, RESTful, pytest
Description

GSPANN is hiring a Site Reliability Engineer with to ensure high availability and performance of critical systems using tools like Prometheus and Nagios. The role involves developing reliable Python code, managing APIs, and optimizing system efficiency across teams.

Who We Are

GSPANN has been in business for over a decade, with over 2000 employees worldwide, and servicing some of the largest retail, high technology, and manufacturing clients in North America. We provide an environment that enables career growth while still interacting with company leadership.

Visit Why GSPANN for more information.

Location: Hyderabad / Any Offshore Location
Role Type: Full Time
Published On: 17 December 2024
Experience: 8+ Years
Description
GSPANN is hiring a Site Reliability Engineer with to ensure high availability and performance of critical systems using tools like Prometheus and Nagios. The role involves developing reliable Python code, managing APIs, and optimizing system efficiency across teams.
Role and Responsibilities
  • Ensure critical systems' high availability and performance.
  • Design and implement monitoring systems using tools like Prometheus or Nagios.
  • Analyze system performance and optimize efficiency.
  • Design, build, and maintain efficient, reusable, and reliable Python code.
  • Follow best practices in Python development, including frameworks like Django, Flask, and FastAPI.
  • Develop and maintain RESTful APIs using frameworks such as DRF and FastAPI.
  • Manage and optimize relational (like MySQL) and NoSQL (like MongoDB) databases.
  • Maintain data integrity and optimize database performance through effective design and management.
  • Implement and manage CI/CD pipelines to automate deployment processes.
  • Develop scripts and tools to automate repetitive tasks and reduce manual effort.
  • Write and maintain unit tests using Python testing frameworks like pytest, PyUnit, and Unit Test.
  • Debug and resolve software issues to ensure smooth application operation.
  • Respond to and resolve incidents, minimizing downtime and impact.
  • Conduct root cause analysis and implement preventive measures.
  • Manage web servers, deployment processes, and activities.
  • Oversee queuing mechanisms and background tasks for efficient processing.
  • Collaborate with development and operations teams to improve system reliability and performance.
  • Communicate effectively with stakeholders to understand requirements and provide updates.
Skills and Experience
  • Bring 8+ years of experience as an SRE.
  • Demonstrate hands-on experience in Python 3.
  • Design, build, and maintain reusable, reliable Python code.
  • Adhere to best practices in Python, Django, Flask, FastAPI, etc.
  • Possess good knowledge of databases, including relational and NoSQL.
  • Gain experience in building RESTful APIs using frameworks like DRF, FastAPI, etc.
  • Demonstrate proficiency in unit testing and debugging using Python testing frameworks.
  • Understand web servers, deployment processes, queuing mechanisms, and background tasks.
  • Communicate clearly and effectively.

Key Details

Location: Hyderabad / Any Offshore Location
Role Type: Full Time
Published On: 17 December 2024
Experience: 8+ Years

Apply Now