Technical RCA Annotator (Telecom Logs)
Place of work
Work from home
Job details
Job description, work day and responsibilities
Full job description
HireArt is helping our client find a skilled Technical RCA Annotator (Telecom Logs) to support the development of an LLM system specialized in automated log analysis and root cause analysis (RCA) for software systems.
In this role, you will be responsible for annotating complex telemetry data, creating training datasets, and ensuring high-quality ground truth labels for machine learning model development.
Responsibilities Include:
Data Annotation and Telemetry Analysis
Annotate telemetry data to identify failure origin, start time, and root cause (e.g., CPU overload, disk I/O, network issues).
Work across multi-modal telemetry sources including logs (INFO, WARN, ERROR), metrics (CPU usage, response time), and traces (service interactions and dependencies).
Create contextual annotations that capture both high-level service and low-level component failure details.
Domain-Specific Annotation Expertise
Apply knowledge of telecommunications systems (e.g., Data, IMS, LC, LTE, MMCP, NR, QMI-DFC, QMI-LANCER, RF) to annotate modem failures.
Annotate service topologies and system architecture, including component dependencies and failure propagation.
Perform temporal analysis to label cascading or time-sensitive failure sequences.
Quality Assurance and Collaboration
Ensure consistent, high-quality annotations with strong inter-annotator agreement.
Identify edge cases and out-of-distribution failure scenarios.
Collaborate with engineering teams to validate annotation accuracy against known incidents.
Bachelor's degree in Computer Science, Software Engineering, Telecommunications, or a related technical field
3+ years of experience in one or more of the following:
System administration and troubleshooting
Network operations center (NOC) operations
DevOps/SRE practices
Software debugging and incident response
Domain Expertise & Analytical Skills:
Experience with telemetry systems and observability tools (e.g., DataDog, Splunk, ELK, Prometheus), including log parsing, system monitoring, and root cause analysis
Strong understanding of Linux/Unix systems, including kernel-level debugging and system-level troubleshooting
Familiarity with telecommunications protocols and standards
Skilled in analyzing large-scale datasets (~300k tokens, millions of log lines), identifying anomalous patterns, and conducting systematic failure investigations
Professional-level fluency in English
Preferred Qualifications:
Familiarity with 3GPP specifications, cellular network technologies, and modem systems (e.g., Qualcomm); experience with distributed systems including microservices architecture, service mesh technologies, and time-series databases
Experience annotating data for ML projects; understanding of large language models (LLMs), their capabilities and limitations, and evaluation metrics such as precision, recall, F1-score, BLEU, and BERTScore
Proficiency in scripting languages (Python, Bash) and working with structured data formats like JSON, XML, CSV, and log files
Proficient with Git workflows, annotation platforms, and collaboration tools used for team coordination
Experience querying and analyzing large datasets in support of ML training or system analysis tasks
Commitment: This is a fully remote independent contractor position that will be staffed directly with HireArt's client.
You will be redirected to another website to apply.
Offer ID: #1246218,
Published: 10 hours ago,
Company registered: 1 month ago