Job Title: DevOps / Site Reliability Engineer (SRE)
Location: Coimbatore, Tamil Nadu (Hybrid/Remote)
Job Type: Full-time
About the Role
We are looking for a high-performance DevOps / Site Reliability Engineer (SRE) to own the stability, deployment, and performance scaling of our real-time, low-latency meta-dispatch kernel. Unlike typical cloud-only roles, this position bridges elite software engineering with bare-metal Linux infrastructure management.
You will work directly with our core architecture team to ensure our concurrent Go ingestion layers, Python heuristic engines, and real-time gRPC communication pipelines operate with deterministic microsecond latency. You will design, implement, and maintain the infrastructure that keeps hundreds of high-throughput mobile and aerial assets synchronized across Country.
Key Responsibilities
- Infrastructure Ownership: Configure, optimize, and maintain our enterprise bare-metal Dell PowerEdge server environment running high-density Linux host distributions.
- Kernel & Network Tuning: Maximize packet-processing throughput by implementing advanced Linux system configurations, including CPU core isolation (isolcpus), interrupt affinity, socket re-use parameters (SO_REUSEPORT), and 1GB HugePages allocation.
- SRE Framework Implementation: Codify the reliability of our network core. Define and monitor precise Service Level Indicators (SLIs) and Service Level Objectives (SLOs) around memory allocation boundaries, network saturation, and gRPC payload latency.
- CI/CD Pipeline Architecture: Build and automate robust deployment pipelines that securely compile cross-platform, statically linked Go binaries and containerized Python workloads.
- Observability & Monitoring: Design and scale high-fidelity telemetry dashboards monitoring the "Four Golden Signals" (Latency, Traffic, Errors, Saturation) to proactively mitigate performance degradation.
- Security & Fail-safe Engineering: Implement and maintain mutual TLS (mTLS) cryptographic handshakes across public wireless networks and manage root security permissions within Linux systemd service units.
Required Technical Skills
- Systems & Infrastructure: 3+ years of experience in Linux System Administration managing dedicated bare-metal servers (compute sharding, hardware offloading, storage arrays).
- Programming/Scripting: Proficiency in Go (Golang) and Python for writing automation scripts, monitoring tools, and understanding low-level execution paths.
- Networking Protocols: Deep understanding of high-concurrency network architectures, specifically handling low-level UDP sockets, TCP, gRPC, and protocol buffer serialization.
- Process Management: Strong experience deploying, performance-capping, and securing system services via Linux systemd.
- Databases: Hands-on experience scaling and managing high-frequency write operations inside MongoDB or PostgreSQL.
Preferred Qualifications
- Familiarity with spatial indexing libraries (specifically the Uber H3 spatial grid system or PostGIS).
- Experience configuring network elements over commercial 5G/LTE cellular backhauls or Machine-to-Machine (M2M) SIM communication channels.
- A strong background in blameless post-mortem operational cultures and automated toil reduction.
What We Offer
- The opportunity to work on a cutting-edge, high-impact proprietary kernel platform.
- A tech-first environment where performance engineering takes priority over boilerplate cloud configuration.
- Competitive compensation and growth opportunities within a fast-scaling venture.
Pay: ₹160,446.92 - ₹567,232.34 per year
Benefits:
- Flexible schedule
- Internet reimbursement
- Work from home
Work Location: Remote