Aegistech

Director of SRE and OBSERVABILITY

Take me back

Share this Opportunity

Location: NYC, New York

Salary/Pay Range: $250,000 - $275,000

Job Description

We are currently looking for a head of Observability development and engineering with enterprise wide responsibility for all the metrics, telemetry, events and logs across internal and external cloud and distributed computing.



Roles and Responsibilities




  • Managing and developing a team of over 50 people with a mix of employees and contractors.

  • Developing and enhancing full-stack observability for Hybrid, Multi-cloud deployments providing consistent view for various teams.

  • Strong development and agile process background

  • Reviewing existing Observability and Monitoring solutions to uplift technology solutions to provide scale-out, robust, cost effective solutions

  • Identify comprehensive risks and risk-mitigation-mapping matrix, Coordinating with the Security, Risk & Compliance teams

  • Develop high-level solution specifications with attention to integration and feasibility (technical, function and financial)

  • Ensure solution meets all requirements of quality, security, modifiability, extensibility and scalability.

  • Actively seek ways to improve business software processes and interactions

  • Proactive collaboration with team members to identify common challenges and by continually researching best practices in coding

  • Prepare an easy to understand report detailing achieved milestones and short-term and long-term project goals

  •  



Skills / Qualifications Required




  • Advanced years of experience from architecture and design to delivery and support of complex highly scalable robust solutions.

  • Experience in designing scalable enterprise solutions with high volume, high frequency data

  • Good knowledge of self-serviced Monitoring solutions from - Real User Monitoring, Synthetic Monitoring, Application performance monitoring, Endpoint Monitoring, Compute and Storage monitoring.  With solutions like Prometheus, Grafana, AppDymanics, Splunk, ElasticSearch

  • Experience in developing and coordinate cloud architecture across diverse areas including Application Development, Identity and Access Management, Network, Data management and Security to determine functional and non-functional requirements.

  • Experience in Infrastructure as Code, CI/CD tools (Jenkins, Bitbucket, Artifactory, JIR, ansible, Terraform, Cloud Formation Templates, Puppet etc.) 

  • Good knowledge of Operating Systems (Linux, Unix, Solaris, Windows, mainframe will be a plus) and Enterprise Computing

  • Good Understanding of Networking (TCP/IP, IP addresses, HTTP, and DNS is an added advantage)

  • Good understanding of security (knowledge of firewall and other security components) and open source technologies

  • Experience working on large-scale projects, leading teams in an agile methodology.

  • Experience developing software utilizing various languages including Ruby, Python, Java/J2EE, C++, PHP, .NET, GoLang, etc..

  • Familiarity with HTML/CSS, JavaScript and UI/UX design

  • Experience in a modern microservice framework (Spring Boot, Node.js/Express, Microprofile, Ruby on Rails, etc)

  • Experience of cloud platforms (AWS, Google Cloud, Azure, IBM) to design, create, and deploy solutions using Container/Kubernetes (Openshift, IBM Cloud Private, EKS...)

  • Experience designing secure software systems based upon industry-specific specifications

  • Oversee the technology budget and manage the budget and forecasts to ensure relevant targets are met

  • Utilize in-depth knowledge of how technology integrates within  and of direct competitors products and services

  • Make evaluative judgments based on analyzing information in complicated or unique situations, utilizing multiple sources of information

  • Negotiate with senior leaders across functions, and with external parties, as required



Education:




  • Bachelor’s degree/University degree or equivalent experience


Follow Us On