About the job Infra/Dev Ops Engineer (Metal-as-a-Service Expert, LATAM)
Job Description
Location:
Fully remote,
LATAM timezone
Start date:
ASAP
Languages:
Fluent English is mandatory
Industry:
Cloud Computing / Web3 / AI European Saa S
Blackfluo.ai is a fully remote company with teams located around the globe. We specialize in developing Saa S solutions for businesses and consulting firms. Our innovative AI assistant is designed to support daily operations by taking over repetitive and time‑consuming tasks, allowing our clients to focus on what truly matters.
If you're excited about working on ambitious projects in a dynamic and flexible environment, we'd love to hear from you!
Professional Background:
We're looking for an
Infra/Dev Ops engineer with deep expertise in Metal-as-a-Service (MAAS)
and bare‑metal automation. You should have a strong Linux background and direct experience managing large‑scale on‑prem and cloud‑adjacent infrastructureshundreds of nodes across multiple sites.
Startup or hyper‑growth experience is a strong plus. Autonomy, speed, and problem‑solving are essential.
Your Responsibilities:
Maintain and support core infrastructure systems with deep knowledge of Linux (Debian/Ubuntu preferred).
Work close to the metal: BIOS, IPMI, RAID setups, and hardware‑level diagnostics are part of your comfort zone.
Design and maintain scalable networks using VLANs, L2/L3 routing, VPNs, and especially Uni Fi equipment.
Automate infrastructure provisioning and operations with Ansible, Bash/Python, and Git‑based workflows.
Set up and manage observability stacks, including Prometheus/Grafana for metrics and Graylog, ELK, or Loki for log centralization.
Build tooling for server discovery, config auto‑generation,
automated OS deployments , PXE/Preseed/Cloud‑init, and
strong MAAS‑based provisioning.
Integrate and/or develop internal APIs for tracking compute and GPU resource allocation, as well as external APIs (billing, monitoring, Open Stack, etc.).
Deploy and maintain virtualization and orchestration systems such as Open Stack (preferably with Kolla‑Ansible), Proxmox VE, or VMware ESXi.
Support container‑based workloads and isolate services efficiently.
What You Bring:
Expert‑level Linux administration (preferably Debian/Ubuntu).
Strong MAAS / Ironic / bare‑metal automation experience (mandatory).
Excellent networking fundamentals: VLANs, routing, VPNs.
Infrastructure as Code (Ansible), scripting (Bash/Python), Git Ops.
Experience with monitoring and logging tools like Prometheus, Grafana, ELK/Graylog.
Comfort with custom deployment automation (PXE, Preseed, MAAS/Ironic).
Familiarity with resource tracking, API integrations, and dashboard development.
Proven experience with Open Stack, Proxmox, VMware, and container orchestration.
Nice To Have
Familiarity with firewall rules, access control, and security policies.
Experience with Cloudflare API for DNS management and tunnel setups.
IT asset management or software license tracking exposure is a plus.
Why Join Us:
A European tech startup on a mission to reinvent cloud infrastructure through decentralization. By combining modular data‑center architecture with advanced automation, the goal is to deliver a sovereign, energy‑efficient, and high‑performance alternative to traditional cloud providers.
Expect a dynamic, challenge‑driven work environment where autonomy and problem‑solving are encouraged. Every contribution has a direct impact on the success of a bold and forward‑thinking project.