Senior/Lead Technical Support Engineer

Hyderabad, TS, India
Full Time
Senior Executive
Experience: 5-10 years
Role Overview:
This role is responsible for troubleshooting and resolving production incidents. This role acts as a bridge between the support and development teams, handling technical investigations, applying quick fixes, and escalating critical issues. By managing and resolving incidents effectively, this role allows the development team to focus on R&D and feature development.


Key Responsibilities:
  1. Incident Management and Troubleshooting
    • Take ownership of production incidents, perform deep-dive investigations, and provide immediate resolutions or workarounds.
    • Monitor production alerts, logs, and error notifications in real-time to ensure rapid incident response.
    • Escalate unresolved issues to the development team only when necessary, minimizing their involvement in routine incidents.
    • Document all production issues, resolutions, and lessons learned to improve troubleshooting efficiency.
    • Develop and maintain incident response plans to ensure a structured troubleshooting approach.
  2. Collaboration and Support Enablement
    • Work closely with the support team to assist with technical escalations and ensure customer issues are addressed quickly.
    • Coordinate with the development team to report recurring issues that need long-term fixes while reducing their direct involvement in incident handling.
    • Communicate incident status, impact, and resolution progress to key stakeholders and leadership.
  3. System Monitoring and Performance Optimization
    • Monitor support emails, process failure notification emails, and Prometheus alerts to proactively detect or prevent incidents before they occur.
    • Work with DevOps to improve observability, logging, and alerting strategies.
  4. Suggest Workarounds and Implement Quick Fixes
    • Understand the product and customer use cases to provide workaround solutions when needed.
    • Execute minor SQL queries and data fixes to resolve customer issues without requiring development team intervention.
  5. Leadership and Team Management
    • Lead and mentor a team of junior support engineers, ensuring they follow best practices in incident handling.
    • Train the support team on troubleshooting common production issues.
    • Establish clear ownership of incident response to reduce ad-hoc escalations to the development team.

Required Qualifications:
Technical Skills:
  • 5+ years of experience in production support, incident management, or site reliability engineering.
  • Good expertise in Linux/Unix systems and troubleshooting.
  • Experience with monitoring tools such as ELK Stack, Grafana, Prometheus, and CloudWatch.
  • Proficiency in SQL (MySQL, PostgreSQL, or Oracle) for running queries and applying minor data fixes.
  • Hands-on experience with log analysis and debugging using ELK Stack.
  • Knowledge of scripting languages such as Shell, Python, or Groovy to automate incident handling.
  • Familiarity with microservices, REST APIs, and message queues like RabbitMQ and Kafka.
Soft Skills and Leadership:
  • Strong problem-solving and troubleshooting skills under pressure.
  • Ability to mentor junior engineers and effectively lead small teams.
  • Excellent communication skills for collaboration with engineering, CS and DevOps teams
  • Proactive mindset to reduce developer involvement in incident handling and improve overall system reliability.
Share

Apply for this position

Required*
Apply with Indeed
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

To comply with government Equal Employment Opportunity and/or Affirmative Action reporting regulations, we are requesting (but NOT requiring) that you enter this personal data. This information will not be used in connection with any employment decisions, and will be used solely as permitted by state and federal law. Your voluntary cooperation would be appreciated. Learn more.

Voluntary Self-Identification of Disability
Voluntary Self-Identification of Disability Form CC-305
OMB Control Number 1250-0005
Expires 04/30/2026
Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Please check one of the boxes below:

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.

You must enter your name and date
Human Check*