Professional Experience
Experience across platform engineering, developer experience, and automation,
focused on building systems that improve engineering velocity, enforce
standards, and scale across teams.
- Designing and building platform capabilities and developer experience systems through open-source projects and structured experimentation, focused on governance, automation, and AI-augmented engineering workflows.
- Built Policy Mesh, an open-source AI control plane that deterministically routes inference across local and cloud LLMs based on cost, data sensitivity, and policy, with rule-based evaluation, explainable decisioning, and structured audit logging for governance and observability.
- Built StackLayer, a reproducible platform lab that provisions a layered, enterprise-style environment including Kubernetes, CI/CD, control plane, developer experience, and observability, enabling realistic testing of platform patterns, pipelines, and governance models outside of cloud dependencies.
- Developed and applied a structured AI-assisted development workflow using tools such as Codex, Claude Code, and Cursor, orchestrating planning, implementation, review, and context management to accelerate delivery while maintaining control and reproducibility.
- Represented the Principal Engineering organization in a cross-team initiative spanning engineering, product, and delivery functions, driving alignment on platform architecture, testing strategy, and workflow standardization.
- Designed and led development of a testing and validation framework, including architecture, starter implementations, and a Python-based orchestration layer for ordered CLI and API workflows, enabling consistent validation patterns across teams.
- Built platform-level automation for deployment validation, including a Python CLI for Rafay environment health and status checks, integrating validation directly into delivery workflows.
- Architected and implemented a Change Management-as-Code platform on AWS using Python, Lambda, API Gateway, DynamoDB, IAM, Secrets Manager, and CloudFormation, enabling engineering teams to programmatically manage the full lifecycle of ServiceNow change records.
- Integrated change automation into CI/CD and platform workflows, enabling systems such as Rafay and Terraform pipelines to create, approve, implement, and close changes through synchronous and asynchronous APIs with DynamoDB-backed state tracking.
- Led Backstage from proof of concept to production as an internal developer platform, owning architecture, roadmap, and backlog, and delivering React plugin integrations for LDAP and ServiceNow to enable service onboarding and workflow visibility.
- Developed governance tooling for secrets management in collaboration with the HashiCorp Vault team, including Python CLI and SDK components that enforced naming standards, policy controls, logging, and auditability across teams.
- Led multiple engineering teams totaling 20+ engineers responsible for enterprise platform automation, hybrid cloud provisioning, and internal tooling across VMware vRealize, custom frameworks, and supporting services.
- Owned the enterprise automation platform, VMware vRealize, as both product owner and technical leader, driving modernization of hybrid cloud provisioning, policy models, and operating practices.
- Reduced infrastructure provisioning time from 1-3 days to under 2 hours by designing and implementing end-to-end automation integrated with ServiceNow change management, approval workflows, and CMDB recording.
- Established enterprise automation standards and CI/CD frameworks using Ansible and Jenkins to support day-2 operations, improve release velocity, and reduce manual intervention across teams.
- Partnered across engineering, enterprise architecture, security, and platform teams to deliver compliant, scalable platform capabilities aligned with governance and operational requirements.
- Built and introduced a platform metrics and visibility system using Python/Django, aggregating VMware and CMDB data to provide insight into resource usage, ownership, and automation health, improving transparency and operational decision-making.
- Led engineering and operations for a distributed private cloud platform supporting ~4,000 virtual machines across multiple geographically distributed datacenters, providing standardized infrastructure services to internal teams.
- Designed and delivered IaaS and XaaS platform capabilities, enabling fully configured, operational, and micro-segmented Windows and Linux environments as repeatable, service-based offerings.
- Built end-to-end automation across the infrastructure lifecycle by integrating systems including ServiceNow, VMware NSX, Active Directory, Citrix NetScaler, Tripwire, Infoblox, and IPAM platforms via REST APIs, enabling automated provisioning, compliance enforcement, and decommissioning workflows.
- Developed internal platform applications using Python/Django for compliance, cloud metrics, and operational visibility, including CMDB correlation, rule-based ownership assignment, and performance/capacity dashboards.
- Created a reusable automation framework for software-defined networking with VMware NSX, abstracting REST APIs and data models into Python-based libraries and significantly accelerating development of network automation workflows across the team.
- Oversaw cross-functional engineering and operational delivery, including internal and vendor resources, code reviews, escalation support, intake prioritization, and controlled release processes within regulated environments.
- Managed and supported enterprise infrastructure across Windows, virtualization, and core services, establishing a strong foundation in large-scale systems operations.
- Provided technical leadership to an offshore engineering team of 8 engineers, including mentoring, escalation support, operational guidance, and change control oversight.
- Led infrastructure initiatives including system upgrades, platform implementations, and disaster recovery exercises, ensuring readiness, reliability, and controlled execution in regulated environments.
- Introduced early automation and reporting capabilities using Python, C#, Perl, and QlikView, including ETL pipelines, data models, and operational dashboards to improve visibility and reduce manual effort.
- Designed and implemented enterprise security baselines for Windows Server environments using Active Directory Group Policy and monitoring systems, improving consistency and compliance across infrastructure.
- Progressed through roles in desktop, systems, and infrastructure engineering, building a foundation across enterprise platforms and operational environments.
- Supported and engineered large-scale systems across Windows, Citrix, VMware, Active Directory, storage, and disaster recovery, including high-availability trading platforms.
- Led and mentored engineering teams, coordinated infrastructure initiatives, and served as a liaison across operations, support, and business stakeholders.
- Developed early automation and reporting solutions using scripting and application frameworks to improve operational efficiency, data visibility, and system management.
- Delivered infrastructure modernization efforts including virtualization, datacenter consolidation, high availability, and business continuity initiatives.
Contact
Interested in working together?
I focus on platform engineering, developer experience, and AI-enabled
engineering systems, particularly where teams need to improve delivery speed
while maintaining strong governance, standards, and operational control.
Open to discussing Staff/Principal roles, consulting, and platform-focused
initiatives.
Contact Lior