Position Summary
We are seeking a dedicated MIS Executive to join our 24/7 website monitoring team.
This role is critical in ensuring optimal performance, availability, and security of our web applications and digital infrastructure. The successful candidate will be responsible for real-time monitoring, incident response, and maintaining comprehensive reporting on system performance metrics.
Key Responsibilities Primary Monitoring Duties
Continuous Surveillance: Monitor multiple websites, web applications, and digital platforms 24/7 using advanced monitoring tools and dashboards
Performance Tracking: Track key performance indicators including uptime, response time, page load speed, and user experience metrics
Alert Management: Respond to automated alerts and notifications within defined SLA timeframes (typically 2-5 minutes)
Incident Detection: Proactively identify potential issues, anomalies, and performance degradation before they impact end users
Status Reporting: Maintain real-time status updates and communicate system health to stakeholders
Technical Operations
Tool Management: Utilize monitoring platforms such as Nagios, Zabbix, New Relic, Pingdom, or similar enterprise monitoring solutions
Log Analysis: Review server logs, application logs, and error reports to identify patterns and root causes Database Monitoring: Monitor database performance, query execution times, and connection pools Security Surveillance: Watch for security threats, suspicious activities, and potential cyber attacks Backup Verification: Ensure automated backups are running successfully and data integrity is maintained
Incident Response & Escalation
First-Level Support: Provide immediate response to critical incidents and system outages
Escalation Management: Follow established escalation procedures for complex issues requiring specialized expertise
Documentation: Create detailed incident reports, including timeline, impact assessment, and resolution steps Communication: Coordinate with development teams, network administrators, and third-party vendors during incidents
Post-Incident Analysis: Participate in post-mortem reviews to prevent recurring issues
Reporting & Analytics
Performance Reports: Generate daily, weekly, and monthly performance reports for management review
Trend Analysis: Identify trends in system performance and recommend preventive actions
SLA Compliance: Track and report on Service Level Agreement compliance metrics
Dashboard Management: Maintain and update monitoring dashboards for various stakeholder groups
Capacity Planning: Assist in capacity planning by analyzing usage patterns and growth trends
Education & Experience
Bachelor's degree in Computer Science, Information Technology, MIS, or related field 2-4 years of experience in website monitoring, system administration, or IT operations Experience with 24/7 shift operations and on-call responsibilities
Proven track record in incident management and problem resolution
Technical Skills
Monitoring Tools: Proficiency with monitoring platforms (Nagios, Zabbix, SolarWinds, New Relic, Datadog) Web Technologies: Strong understanding of HTTP/HTTPS, DNS, CDN, load balancers, and web server technologies
Database Knowledge: Familiarity with MySQL, PostgreSQL, Oracle, or SQL Server monitoring
Scripting: Basic scripting skills in Python, Bash, or PowerShell for automation tasks Network Fundamentals: Understanding of TCP/IP, firewalls, and network troubleshooting Cloud Platforms: Experience with AWS, Azure, or GCP monitoring services
Soft Skills
Attention to Detail: Exceptional attention to detail and ability to spot anomalies quickly Communication: Excellent written and verbal communication skills for incident reporting Problem-Solving: Strong analytical and troubleshooting skills
Stress Management: Ability to work effectively under pressure during critical incidents
Time Management: Excellent prioritization skills to handle multiple concurrent issues
Team Collaboration: Strong teamwork skills for coordinating with cross-functional teams
Preferred Qualifications
Industry certifications (CompTIA Network+, ITIL Foundation, AWS Certified) Experience with DevOps practices and CI/CD pipelines
Knowledge of security monitoring and SIEM tools
Experience with containerized environments (Docker, Kubernetes) Familiarity with log management tools (ELK Stack, Splunk)
Previous experience in e-commerce or financial services environments
Working Conditions & ScheduleShift Requirements
24/7 Coverage: Rotating shifts including nights, weekends, and holidays Shift Duration: 8-12 hour shifts depending on operational requirements On-Call Duties: Participate in on-call rotation for critical incidents
Position Summary We are seeking a dedicated MIS Executive to join our 24/7 website monitoring team.
This role is critical in ensuring optimal performance, availability, and security of our web applications and digital infrastructure. The successful candidate will be responsible for real-time monitoring, incident response, and maintaining comprehensive reporting on system performance metrics.
Key Responsibilities Primary Monitoring Duties
Continuous Surveillance: Monitor multiple websites, web applications, and digital platforms 24/7 using advanced monitoring tools and dashboards
Performance Tracking: Track key performance indicators including uptime, response time, page load speed, and user experience metrics
Alert Management: Respond to automated alerts and notifications within defined SLA timeframes (typically 2-5 minutes)
Incident Detection: Proactively identify potential issues, anomalies, and performance degradation before they impact end users
Status Reporting: Maintain real-time status updates and communicate system health to stakeholders
Technical Operations
Tool Management: Utilize monitoring platforms such as Nagios, Zabbix, New Relic, Pingdom, or similar enterprise monitoring solutions
Log Analysis: Review server logs, application logs, and error reports to identify patterns and root causes Database Monitoring: Monitor database performance, query execution times, and connection pools Security Surveillance: Watch for security threats, suspicious activities, and potential cyber attacks Backup Verification: Ensure automated backups are running successfully and data integrity is maintained
Incident Response & Escalation
First-Level Support: Provide immediate response to critical incidents and system outages
Escalation Management: Follow established escalation procedures for complex issues requiring specialized expertise
Documentation: Create detailed incident reports, including timeline, impact assessment, and resolution steps Communication: Coordinate with development teams, network administrators, and third-party vendors during incidents
Post-Incident Analysis: Participate in post-mortem reviews to prevent recurring issues
Reporting & Analytics
Performance Reports: Generate daily, weekly, and monthly performance reports for management review
Trend Analysis: Identify trends in system performance and recommend preventive actions
SLA Compliance: Track and report on Service Level Agreement compliance metrics
Dashboard Management: Maintain and update monitoring dashboards for various stakeholder groups
Capacity Planning: Assist in capacity planning by analyzing usage patterns and growth trends
Job Types: Full-time, Permanent
Pay: ₹11,807.99 - ₹33,137.63 per month
Benefits:
- Health insurance
Schedule:
- Day shift
- Night shift
- Rotational shift
Work Location: In person