Senior Tech Lead Aiops Engineering
Company | TIAA |
---|---|
Location | Dallas, TX, USA, Charlotte, NC, USA |
Salary | $119500 – $169800 |
Type | Full-Time |
Degrees | |
Experience Level | Senior |
Requirements
- Experience with event management, monitoring and observability toolsets like but not limited to Moogsoft/BigPanda/Service Now, SignalFX/Dynatrace, Splunk, etc.
- Experience creating custom integrations between Monitoring systems leverage automation and APIs.
- Experience with scripting languages such as Python, JavaScript etc.
- Experience developing and administering APIs (Kong or similar technologies) to integrate custom observability data (logs, metrics, events and traces) with other systems.
- Experience with AWS, GCP or other cloud technologies.
- Experience with Automation Tools such as Ansible, BigFix, power automate, etc.
Responsibilities
- Oversees the acquisition, installation, and any upgraded computer components and software and planning for service outages and other problems.
- Identifies, implements and monitors best practices for technology architecture, while providing expert advice on core infrastructure initiatives.
- Executes the engineering and operational roadmaps for domain services.
- Communicates project, operational, and strategy risks and opportunities to senior management, along with corrective action plans when required.
- Publishes metrics to qualify and quantify configuration success, and ensures that metrics measure appropriate operational and strategic goals.
- Approves documentation as it relates to domain configuration, routing, processes, and service records.
- Delivers on all project and operational commitments successfully, including quality and timeliness metrics.
- Coaches, mentors and delegates work to lower level professionals.
Preferred Qualifications
- Awareness and trainings in AI technologies is preferred.
- Experience in LLMs is preferred.
- Provide thought and strategic planning in mapping data elements in Moogsoft and work with support teams to build correlation rules with their data sources to provide value to their department and stakeholders.
- Develop and refine strategies to improve incident detection, correlation and root cause analysis.