NOC Engineer, Taguig
-
Taguig, Philippines
-
Posted: less than a month ago
-
Save
1. Monitoring & Operations (Core Responsibility)
Provide 24/7 monitoring support for production systems and monitoring dashboards, including platforms such as Zabbix, Prometheus, Grafana, and Huawei Cloud Monitoring.
Continuously monitor key business metrics (QPS, error rate, response time, success rate) and infrastructure health indicators (CPU, memory, disk, and network usage).
Perform routine inspections of online systems, data center environments, and network connectivity.
Maintain shift logs and inspection reports according to operational standards.
2. Alert & Incident Handling
Respond immediately to alerts received via phone calls, SMS, Microsoft Teams, etc., and perform initial troubleshooting based on SOPs.
Identify alert severity levels and execute the corresponding response procedures.
Independently resolve common issues when possible, such as:
oRestarting services
oCleaning up disk space
oResource scaling
oTraffic switching
Escalate unresolved issues to L2 support or development teams following the escalation process, and track incidents until resolution.
Continuously optimize alert rules to reduce false positives, missed alerts, and alert storms.
3. Daily Operational Tasks
Handle operational requests through the ticketing system, including:
oAccount creation
oPermission requests
oResource allocation
oFirewall policy changes
oDomain and SSL certificate applications
Support deployment and release processes, including version releases, rollbacks, and configuration changes.
Assist SRE, DBA, network, and security teams with routine tasks such as:
oBackup verification
oSlow query checks
oVulnerability scan follow-ups
Maintain CMDB asset information to ensure accuracy of servers, IPs, applications, and ownership records.
4. Incident Response & Postmortem
Act as the first responder during incidents by coordinating communication channels, incident bridges, and status updates.
Participate in incident review and postmortem meetings.
Help document timelines and improve SOPs and knowledge base articles.
5. Documentation & Knowledge Management
Create and maintain operational manuals, emergency response plans, and SOP documentation.
Prepare weekly/monthly operational reports, including:
oTop alerts
oIncident counts
oResolution times
oOther operational metrics
Diploma or above in Computer Science, Communications, Networking, or related fields.
13 years of experience in Operations, NOC, or system monitoring roles (strong fresh graduates are also welcome).
Familiar with common Linux commands and able to independently perform:
oLog analysis
oProcess troubleshooting
oNetwork connectivity testing
oDisk and memory issue investigation
Basic understanding of TCP/IP, HTTP, DNS, and load balancing concepts.
Able to interpret outputs from tools such as ping, telnet, curl, and tcpdump.
Familiar with at least one monitoring platform:
oZabbix
oPrometheus + Grafana
oNagios
oAlibaba Cloud / Tencent Cloud Monitoring
Basic operational knowledge of common services and middleware:
oNginx
oTomcat
oRedis
oMySQL
(e.g., checking status, restarting services, reviewing logs)
Basic Shell scripting skills; able to read and make small modifications to scripts.
Strong sense of responsibility, attention to detail, and ability to work under pressure.
Willing to work rotating shifts, night shifts, and holiday coverage.
Good communication skills and ability to stay calm and organized during incidents.
Preferred Qualifications (Plus Points)
Knowledge of ITIL processes or related certifications.
Familiar with ITSM/ticketing systems such as:
oJira Service Management
oServiceNow
oONES
oIn-house ticketing systems
Experience with basic Docker/Kubernetes operations:
oChecking Pod status
oViewing logs
oRestarting services
Experience supporting large-scale systems (e-commerce, finance, gaming, live streaming, etc.).
Ability to write small automation tools using Python.
Relevant certifications such as:
oRHCSA
oHCIA
oHuawei/Cisco networking certifications
oCloud provider certifications
-
Company nameBB Wave Inc.
-
Job positionNOC Engineer
NOC Engineer has been posted in the Taguig Engineering category on Locanto.
For Taguig, there are no other ads posted in this category.
Interested in more? Widen your search to view ads in nearby areas of Taguig. This includes Engineering in San Martin De Porres, Western Bicutan and Lower Bicutan. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.