Job Description
Role Overview
The Data Centre Engineer is responsible for monitoring, maintaining, and optimizing server infrastructure to ensure high availability, performance, and security within the data center environment. This role involves troubleshooting hardware and software issues, implementing infrastructure improvements, and collaborating with cross-functional IT teams.
Key Responsibilities
Monitoring and Maintenance
- Continuously monitor server and infrastructure performance using monitoring tools.
- Perform routine maintenance, upgrades, patching, and health checks to ensure optimal functionality and security.
- Maintain system uptime and reliability across production environments.
Troubleshooting and Issue Resolution
- Identify, diagnose, and resolve hardware and software issues related to servers and infrastructure.
- Troubleshoot complex incidents and collaborate with network, storage, and application teams when required.
- Document incidents, root cause analysis, and resolutions for future reference.
Implementation and Optimization
- Deploy new server infrastructure and configurations in coordination with engineering teams.
- Optimize system performance, resource utilization, and capacity planning.
- Support infrastructure upgrades, migrations, and enhancements.
Security and Compliance
- Implement and maintain security controls to protect server infrastructure.
- Ensure compliance with organizational security policies and industry standards.
- Support vulnerability remediation and audit requirements.
Documentation and Collaboration
- Maintain accurate documentation for configurations, processes, and procedures.
- Collaborate with network engineers, database administrators, and DevOps teams for seamless integration.
- Participate in change management and release processes.
Support and On-Call
- Provide technical support to internal teams and stakeholders.
- Participate in on-call rotations and respond to after-hours incidents when required.
Required Skills & Qualifications
- Experience in data center operations, server administration, or infrastructure support.
- Strong knowledge of:
- Server hardware and operating systems (Linux/Windows)
- Virtualization technologies
- Networking fundamentals
- Monitoring and troubleshooting tools
- Understanding of security best practices and compliance requirements.
- Excellent problem-solving and communication skills.
- Ability to work in a fast-paced, mission-critical environment.
Preferred Qualifications
- Certifications such as:
- CompTIA Server+
- RHCSA/RHCE
- Microsoft certifications
- Data center certifications (CDCP, etc.)
- Experience with automation or scripting (PowerShell, Python, Bash).
- Exposure to cloud or hybrid infrastructure environments.