Ref. nr: IT 00002261
WORK LOCATION - EXACT ADDRESS:
20 Prosta Str, Warsaw
DESCRIPTION/RESPONSIBILITIES:
Business Purpose / Background:
P&G has declared a Cloud First strategy and will be moving most of our workloads to the Cloud over the next few years. At the same time Core Data Lake and Data Hubs Program led by Data & Analytics Organization (D&A) is leveraging Cloud to transform BI and Analytics for the company, harmonizing various data types and allowing standard solutions tailored specifically for needs of specific Business Units.
As Operations Engineer you will be leveraging strong technical and management / soft skills to deliver Operational capabilities to meet internal D&A applications’ business cases in a PaaS way.
You will be responsible for designing and implementing E2E strategy for logging of infrastructure performance counters, various types of events and logs, ongoing monitoring, alerting and response automation, working together with D&A DevOps teams and D&A Platform Infrastructure/Network roles to drive automated responses to alerts.
Core focus is on all Azure resource logs availability, Network, IaaS S/W logs availability (e.g. Apache Airflow tool) and capabilities to monitor and alert based on them. In work you include “automate everything” mindset while providing secure, performant, stable, cost effective and flexible technical "bricks" to applications.
Responsibilities:
- Producing utility Blueprints for Logging & Monitoring tools
- Staying on top of Logging & Monitoring industry best practices, suggesting them to the team (or piloting new capabilities), defining Best Practices and on-boarding various application teams to Platform Best Practices and Available Capabilities
- Ensuring security compliance for Logging & Monitoring utility, working together with Security Engineer
- Developing automation scripts (PowerShell, Python, Azure Automation) to optimize cost & capacity / performance, drive compliance, or to understand overall Logging & Monitoring utility health
- Working with PG Corporate & Application teams, Cloud Service Partners and other vendors to understand, verify, improve and fix as needed shared Logging & Monitoring capabilities to be leveraged across D&A applications. This includes L3 support for major capability incidents escalations for own scope (may happen outside of work hours or weekends) and "deep" Problem Management support (investigate and find permanent fixes where workaround is used on production system due to capability not working correctly)
- Ensuring compliance / Desired State Configuration on IaaS and PaaS parts of the solution; defining compliance requirements with D&A teams
- With time extending scope other capabilities like Orchestration, ETL Metadata Management, CI/CD, Dashboards, automated testing tools
REQUIRED SKILLS:
Strong technical knowledge and demonstrated year experience in each of the following areas:
- Azure LogAnalytics, Azure Diagnostics, Metrics, Event Logs, Application Logs – ability to set up and query them (knowledge of Kusto Query Language)
- PowerShell and/or Linux environment and shell (bash) scripting
- Experience working with MS Azure Cloud Computing Platform & Services (different resources, offerings and tools) and ARM Templates
- Experience in Containers (Docker / Kubernetes) and their monitoring
Necessary technical skills (1 or more of each domain):
- ELK Stack (Elastic Search, Logstash & Kibana), or other tools enabling gathering and analysis of logs gathered from multiple sources
- Programming Languages: Python, Java, C++, PHP, Perl, Ruby
- Overall understanding of infrastructure and Platform components (network, servers), technical knowledge and experience in Hosting and related technologies (data center, cloud [IaaS, PaaS], computing, Windows, Linux, storage, backup, virtualization, etc.), have a passion for these domains, and can learn technologies quickly
- Ability to quickly penetrate technology areas and ask appropriate and relevant technical questions
Desirable skills:
- Experience in Apache Airflow development and/or operations/monitoring
- Knowledge of PowerBI
- Experience in some infrastructure automation/configuration tool – Ansible, Chef, Puppet or similar - you know how those can help with large-scale, complex deployments
- Stewardship experience to ensure services are compliant with relevant P&G policies (Security, Governance, etc.)
- Having a previous experience working on Information Security matters at an infrastructure platform level
- Basic Vendor Management Skills
Qualifications (completed, or to be built on the role)
- ITIL foundations knowledge; or demonstrated IT operations background
- Microsoft Certified: Azure Developer Associate
- Bachelors Degree - minimum
Personal qualities:
- Excellent communication skills (English) with both technical and non-technical colleagues from geographically distributed teams on all organizational levels
- Problem-solving attitude
- Proactive, initiative taking and not afraid to challenge the status quo
- Experience working with external companies
- Being a team player. Big projects aren’t developed by individuals
Skills You Can Expect to Learn/Build on this Job:
- Deep Technical knowledge and experience in cloud BI & Analytics relevant capabilities
- Big Picture understanding of the Global Business Services (GBS) organization and its Services
- Expertise working in global multi-functional team on Cloud Platforms
- Industry Certifications (ITIL, DevOps, Azure)