Negotiable
Outside
Remote
USA
Summary: The KDB+ / Q Engineer role involves developing monitoring and alerting capabilities for a large installation of KDB+ databases used by quant and trading firms. The position focuses on assessing current server deployments, evaluating existing monitoring tools, and designing analytics dashboards to optimize performance. The role is remote and requires expertise in KDB+ and Q programming. The contract duration is expected to exceed 10 months.
Key Responsibilities:
- Assess current KDB+/Q server deployment, usage patterns, and SLAs.
- Capture inventory of available metrics, logs, and traces.
- Analyze data ingestion and query pathways, including latency and throughput.
- Evaluate existing monitoring tools and identify gaps in monitoring coverage.
- Design and implement analytics dashboards for visualizing Service Level Indicators (SLIs).
- Generate automated reports on server environment profiles and resource utilization.
- Identify and profile slow or resource-intensive queries with optimization suggestions.
- Provide recommendations for tuning and optimization to the client team.
Key Skills:
- Expertise in KDB+ and Q programming language.
- Experience with monitoring tools such as Cerebro and Telegraf.
- Strong analytical skills for assessing server performance and resource utilization.
- Ability to design and implement dashboards for performance monitoring.
- Knowledge of data ingestion and query optimization techniques.
Salary (Rate): undetermined
City: undetermined
Country: USA
Working Arrangements: remote
IR35 Status: outside IR35
Seniority Level: undetermined
Industry: IT
Role: KDB+ / Q Engineer
Location: NYC
Duration: 10+ Months
Primary skill: KDB+ / Q (KDB+ is an ultra low latency high performance time series DB that is used by quant / trading firms that drive millisecond efficiency). Q is the programming language to interact with KDB+ databases.
Scope: Client has a huge installation of on-premises instances / servers (e.g., KDB deployment). The vision is to develop monitoring / alerting capability to sense when bottlenecks might happen, to proactively address these situations, and build state-of-the-art
Discovery
- Current KDB+/Q server deployment, usage patterns, and SLAs
- Capture Inventory of available metrics, logs, and traces
- Capture data ingestion and query pathways, including latency and throughput
Assessment
- Evaluate existing monitoring tools (i.e., Cerebro, Telegraf, custom tools) used in monitoring and managing the KDB+ ecosystem and custom code
- Identify gaps or inefficiencies in current monitoring coverage
- Analyze the integration points and data consumption patterns of applications interacting with KDB+
- Understand key application dependencies and performance considerations
Analytics/Telemetry Dashboard
- Design and implementation of dashboards by aligning with Client on what level the dashboards will be aggregated at, and then to visualize Service Level Indicators (SLIs) such as disk space utilization, query latency, system availability, CPU/memory usage, and error rates
- Recommendations for SLI thresholds and alerting
Deliverables
Overall Optimization of Telemetry:
- Server environment profile report: Generate automated report based on programmatically collected info on server specs, KDB versions, process topology, and deployment architecture
- Data aggregation (Process & Session): List of in-scope running processes, user sessions, and their configurations, and aggregating the data for analysis
- Resource Utilization and Server Log Analysis: Report and dashboard summarizing latency, CPU, memory, disk, and network usage patterns, and recommendations for tuning and optimization to Client team to perform
- Query Performance Analysis: Identification and profiling of slow or resource-intensive queries, with optimization suggestions
- Performance Dashboard: Visual dashboard for real-time monitoring of defined metrics
Regards,
Radiantze Inc