SR Technical Operations Engineer (Web3 Core Platforms)
Description
We at Pearster are looking for a SR Technical Operations Engineer (Web3 Core Platforms) to join an American prominent Blockchain company through our team. This company is a cloud-based infrastructure provider that powers the global blockchain ecosystem. Its mission is to be the indispensable utility that enables companies and innovators worldwide to build next-generation, Web3-enabled businesses and applications using blockchain technology.
The Role
Make mainnet boring. We launch chains, keep them fast, and fix the stuff that shouldn’t break. You’ll run Kubernetes at scale, ship Infrastructure-as-Code (IaC), lead incidents, and write the tooling that keeps production sane. Occasionally patching a client or upstreaming small fixes is a plus.
This role gives you high-impact ownership of production systems, with a focus on reliability, automation, and performance in a fast-paced, remote-first environment.
Responsibilities
- Launch & Upgrade Chains: Standups, hard-forks, snapshots, pruning, clean rollbacks.
- Automate Everything: Modules, golden images, CI/CD, zero-touch deployments across regions.
- Run Kubernetes at Scale: Safe rollouts, HPA/VS/Ingress tuning, capacity and cost planning.
- Own Incidents: Lead SEV0–2, publish RCAs, and implement fixes that prevent recurrence.
- Build Signal, Not Noise: Define SLOs/error budgets, create useful dashboards, and alerts that only page when user impact occurs.
- Code Where It Counts: Write or extend tools for snapshots, replay/load, state sync checks; patch client bugs in production and upstream when appropriate.
Requirements
- Location: Latam / Europe (remote-based).
- Linux + Kubernetes: Debug real production issues including networking, storage, rollouts, and performance.
- IaC (Terraform, Helm, Ansible): Build repeatable, scalable infrastructure.
- Programming (Go or Python + Bash): Automate repetitive tasks and build small, precise tools.
- Blockchain Operations: JSON-RPC internals, running/tuning RPC and validator nodes, log analysis.
- Observability: Define and monitor SLOs/error budgets, Prometheus/Grafana instrumentation.
- Networking: Strong fundamentals in DNS, TLS, load balancing; reasoning about anycast/BGP when needed.Experience & Signals We Care About:
- Clear production ownership: understanding blast radius and rollback plans.
- SLO thinking and measurable improvements in alert noise, latency, or MTTR.
- Real RPC/validator operations experience (beyond laptop demos).
- Tooling/code contributions that improve operations (Go/Python, IaC modules, or small upstream fixes).
- Ability to explain complex failures simply and leave systems simpler.
- Production experience with one or more: EVM (Geth/Erigon/Nethermind/Besu), Cosmos SDK/CometBFT, Solana (Agave/QUIC), Substrate
- Multi-cloud experience, including capacity and cost modeling that survives real-world conditions.
Benefits
Fully remote work arrangement as a contractor.
Competitive salary in USD.
PTO days per year.
100% company-covered international certifications.
Access to coworking spaces worldwide.
English classes.
Engaging team-building activities.
Personalized gifts.
Welcome kit.
Referral programs and much more!
- Locations
- United States
- Remote status
- Fully Remote