Altenar is an international IT company founded in 2011, with offices in Malta, Greece, Georgia, the Isle of Man, and Uruguay. We specialize in high-load software development and provide one of the best technology solutions for the iGaming industry worldwide.
The Monitoring Infrastructure Engineer is a specialised system design, implementation and administration role intended to give direct and segregated focus to the systems that aggregate metrics, correlate activity and initiate alerting events from the Altenar production systems. The role is responsible for providing reliable infrastructure to which applications, systems and other infrastructure can send or expose their instrumented metrics or logs. The role is also responsible for providing visualisation systems that make available the metrics and logs for query or further complex calculation and aggregation.
This role works closely with the Security team as it fulfils a critical element of segregation, and is therefore separate in the access and accountability from other teams that produce code or provide deployment and hosting services.
What we are responsible for:
- Providing a fantastic environment to work in
- Supporting your personal and technical development with training opportunities
- Providing career growth in a very interesting company with global customer base
- Providing you with support to ensure you have a good working environment
- Providing guidance and technical direction
- Building a strong Technology Operations team around you to ensure we maintain agility whilst keeping compatible segregation of duties.
What you’d be responsible for:
- Design, creation, maintenance of shared monitoring, log aggregation, visualisation systems.
- Infrastructure setup and configuration for alert routing and supporting the Service Management team in implementing improvements, feature requests related to these systems
- Operating system and hosting infrastructure issue resolution for these systems
- Working closely with the teams forming part of the DevOps collective in the selection of collection agents, log format/structure/labelling/tagging definition, metric instrumentation and dimensionality and empowering as much input-side configuration as possible whilst adhering to security/regulatory compliance requirements.
- Continuous improvement to reduce the overhead of collection, adjust the retention to achieve optimal efficiency and ensure as much value is derived from these systems as possible.
- Being guided by the principles of security by design and serviceability in mind to ensure that agility does not come at the expense of disproportionate risk or the accumulation of technical debt.
- Being trustworthy and accountable in the handling of company data and physical assets
- Acting with accountability in handling the technical intelligence stream from Altenar’s production systems
Desired requirements:
- Degree in a related major (Engineering / CS)
- Proven system administration experience in heterogeneous environments (Windows / Linux with RPM package management predominantly)
- Basic scripting skills (bash, powershell, python or other)
- Detailed knowledge of TCP/IP and related network protocols such as IP multicast, public IPv6
- Good understanding of process scheduling, virtual memory models, swap and troubleshooting performance in the operating system
- Good understanding of distributed/shared block storage, file storage, IO performance
- Good understanding of virtualisation using VMware, KVM and other technologies.
- Good understanding of Site Reliability Engineering concepts and approaches and the so-called golden signals of monitoring
- Good grasp of metric types, multi-dimensional metrics, time-series, units, aggregation and statistical concepts such as percentiles and distributions.
- Good knowledge of query languages
- Experience with monitoring systems (Zabbix, Grafana, Prometheus)
- Experience with log aggregation system infrastructure (Elastic / Graylog) in high-availability, high volume configurations
- Ability to draw up infrastructure blueprints to create hierarchical collection, aggregation points with separation by environment
- Experience with Docker and Kubernetes
- Experience with DB administration (MSSQL and NoSQL MongoDB)
- Working knowledge of Git-based version control
- Working knowledge of Ansible
- Excellent organizational skills and familiarity with project planning approaches/tools
- Excellent communication skills in a highly-accountable/sensitive role
- Strong documentation skills
Benefits:
- Stable and flexible working environment.
- Career growth opportunity.
- Training and professional development events.
- Health insurance for employees and close family members.
- Teamwork and accountability.
- Sense of community and defined company culture.
- International work environment.
- Diverse workplace.
- Gym reimbursement after successfully passing the probationary period.