Special Interest Groups (SIGs)

Content Source

This content is automatically synced from SIGS.md in the llm-d/llm-d repository.

📝 To suggest changes, please edit the source file or create an issue.

SIG Overview

Special Interest Groups (SIGs) are the primary organizational units for coordinating work across the llm-d project. Each SIG focuses on a specific area of the project's technology stack and is responsible for driving design, implementation, and maintenance of their respective components.

SIGs provide a mechanism for:

Focused expertise: Bringing together contributors with specialized knowledge in specific areas
Coordinated development: Ensuring consistent architectural decisions across related components
Community building: Creating smaller, more manageable groups for collaboration and mentorship
Accountability: Clear ownership and responsibility for specific project areas

SIG Structure and Governance

SIG Leadership

Each SIG has:

SIG Leads (2-3 people): Responsible for overall SIG direction, coordination, and decision-making

SIG Responsibilities

Drive technical design and implementation in their area
Maintain documentation and architectural decisions
Coordinate with other SIGs on cross-cutting concerns
Mentor new contributors and grow the community
Participate in project-wide planning and releases

SIG Meetings

Regular meetings (typically weekly) for technical discussions

Relationship to Project Governance

SIGs operate within the broader llm-d project governance framework defined in PROJECT.md:

SIGs follow the project's lazy consensus decision-making process
Major cross-SIG decisions require project maintainer approval
All SIG work follows the project's contribution guidelines

Active Special Interest Groups

SIG	Focus Area	Meeting Schedule	Documentation
SIG Inference Scheduler	Intelligent request routing, load balancing, and traffic management	Weekly Tuesdays 12:00 PM ET (Convert to your TZ)	• Meeting Recordings and Docs • llm-d-inference-scheduler Repository
SIG Benchmarking	Performance testing, benchmarking frameworks, and optimization	Weekly Thursdays 1:00 PM ET (Convert to your TZ)	• Meeting Recordings and Docs • llm-d-benchmark Repository
SIG PD-Disaggregation	Prefill/decode separation, distributed serving, and workload disaggregation	Weekly Tuesdays 1:30 PM ET (Convert to your TZ)	• Meeting Recordings and Docs
SIG KV-Disaggregation	KV caching, prefix caching, and distributed storage systems	Weekly Tuesdays 12:00 PM ET (Convert to your TZ)	• Meeting Recordings and Docs • llm-d-kv-cache-manager Repository
SIG Installation	Kubernetes integration, deployment tooling, and platform operations	Weekly Thursdays 11:00 AM ET (Convert to your TZ)	• Meeting Recordings and Docs
SIG Autoscaling	Traffic-aware autoscaling, resource management, and capacity planning	Weekly Wednesdays 2:00 PM ET (Convert to your TZ)	• Meeting Recordings and Docs
SIG Observability	Monitoring, logging, metrics, and operational visibility	Weekly Thursdays 12:30 PM ET (Convert to your TZ)	• Meeting Recordings and Docs

SIG Detailed Descriptions

SIG Inference Scheduler

👥 Leadership: Nili Guy, Abdullah Gharaibeh, Vita Bortnikov

⭐️ North Star Design Document ↗️ (Google Docs)

Charter: Develop and maintain intelligent request routing and load balancing systems that optimize for latency, throughput, and resource utilization across distributed inference workloads.

Key Areas:

vLLM-optimized inference scheduling algorithms
KV-cache aware routing and load balancing
Integration with Kubernetes Gateway API and Inference Gateway Extension
Flow control and traffic shaping
SLA-aware request prioritization

💬 Communication:

Slack Channel: #sig-inference-scheduler
Meeting Recordings and Docs: Public Google Drive

SIG Benchmarking

👥 Leadership: Marcio A L Silva, Ashok Chandrasekar

⭐️ North Star Design Document ↗️ (Google Docs)

Charter: Establish comprehensive performance testing and benchmarking frameworks to ensure llm-d delivers optimal performance across diverse workloads and hardware configurations.

Key Areas:

Benchmarking frameworks and methodologies
Performance regression testing
Workload simulation and synthetic data generation
Hardware-specific optimization
Performance analysis and profiling tools

💬 Communication:

Slack Channel: #sig-benchmarking
Meeting Recordings and Docs: Public Google Drive

SIG PD-Disaggregation

👥 Leadership: Robert Shaw

⭐️ North Star Design Document ↗️ (Google Docs)

Charter: Design and implement prefill/decode disaggregation patterns that enable efficient separation of inference workloads across heterogeneous hardware and scaling requirements.

Key Areas:

Prefill/decode workload separation
Disaggregated serving architecture
Cross-instance communication protocols
Heterogeneous hardware optimization
Dynamic workload balancing between Prefill and Decode instances

💬 Communication:

Slack Channel: #sig-pd-disaggregation
Meeting Recordings and Docs: Public Google Drive

SIG KV-Disaggregation

👥 Leadership: Maroon Ayoub, Danny Harnik

⭐️ North Star Design Document ↗️ (Google Docs)

Charter: Design and implement distributed KV caching solutions that improve inference performance through intelligent cache management, prefix sharing, and disaggregated storage.

Key Areas:

Distributed KV cache architecture
Prefix cache hierarchies (local, remote, shared)
Cache-aware scheduling and routing
Storage optimization for inference workloads
Integration with vLLM's KVConnector

💬 Communication:

Slack Channel: #sig-kv-disaggregation
Meeting Recordings and Docs: Public Google Drive

SIG Installation

👥 Leadership: Brent Salisbury, Greg Pereira

⭐️ North Star Design Document ↗️ (Google Docs)

Charter: Ensure llm-d integrates seamlessly with Kubernetes and provides robust deployment, scaling, and operational capabilities for production environments.

Key Areas:

Kubernetes-native deployment patterns
Helm charts and operators
Installation and configuration management
Multi-node orchestration with LeaderWorkerSet
Platform integration and operational best practices

💬 Communication:

Slack Channel: #sig-installation
Meeting Recordings and Docs: Public Google Drive

SIG Autoscaling

👥 Leadership: Tamar Eilam, Abhishek Malvankar

⭐️ North Star Design Document ↗️ (Google Docs)

Charter: Develop intelligent autoscaling solutions that automatically adjust llm-d deployments based on traffic patterns, workload characteristics, and hardware utilization.

Key Areas:

Traffic-aware autoscaling algorithms
Hardware-specific scaling policies
Workload-based capacity planning
Integration with Kubernetes HPA/VPA
Cost-optimized scaling strategies

💬 Communication:

Slack Channel: #sig-autoscaling
Meeting Recordings and Docs: Public Google Drive

SIG Observability

👥 Leadership: Sally O'Malley, Roy Nissim, Benedikt Bongartz

⭐️ North Star Design Document ↗️ (Google Docs)

Charter: Provide comprehensive monitoring, logging, and observability capabilities that enable operators to understand system behavior, diagnose issues, and optimize performance.

Key Areas:

Metrics collection and visualization
Distributed tracing and logging
Performance monitoring and alerting
Operational dashboards and reporting
Integration with monitoring ecosystems (Prometheus, Grafana, etc.)

💬 Communication:

Slack Channel: #sig-observability
Meeting Recordings and Docs: Public Google Drive

Getting Involved

Joining a SIG

Attend a meeting: Check the project calendar for SIG meeting times
Join the conversation: Participate in SIG-specific channels on Slack
Review documentation: Read the SIG's charter and current initiatives
Start contributing: Look for "good first issues" labeled with the SIG's area

SIG Communication Channels

Slack: Each SIG has dedicated channels in the llm-d Slack workspace
Google Groups: Join llm-d-contributors for comment access to SIG documents
GitHub: Issues and discussions are labeled by SIG area
Calendar: All SIG meetings are on the shared project calendar

SIG Formation and Evolution

Creating a New SIG

Identify need: Demonstrate community interest and technical necessity
Draft charter: Define scope, goals, and initial leadership
Proposal process: Submit proposal following project contribution guidelines
Community review: Present at weekly project standup and gather feedback
Approval: Obtain approval from project maintainers

SIG Lifecycle Management

Active: Regular meetings, active development, engaged community
Maintenance: Limited active development, focus on stability and bug fixes
Archived: No longer active, historical reference only

SIGs may evolve, merge, or be archived based on project needs and community engagement.

Resources

Project Calendar: llm-d Public Calendar
Slack Workspace: https://llm-d.slack.com
Google Groups: https://groups.google.com/g/llm-d-contributors
Community Governance: PROJECT.md
Contributing Guidelines: CONTRIBUTING.md

Maintenance

This document is maintained by the project maintainers and updated as SIGs evolve. For questions or suggestions about SIG structure, please reach out via:

Weekly project standup (Wednesdays 12:30 PM ET)
llm-d Slack channel
GitHub issues in the llm-d/llm-d repository

SIG Overview​

SIG Structure and Governance​

SIG Leadership​

SIG Responsibilities​

SIG Meetings​

Relationship to Project Governance​

Active Special Interest Groups​

SIG Detailed Descriptions​

SIG Inference Scheduler​

SIG Benchmarking​

SIG PD-Disaggregation​

SIG KV-Disaggregation​

SIG Installation​

SIG Autoscaling​

SIG Observability​

Getting Involved​

Joining a SIG​

SIG Communication Channels​

SIG Formation and Evolution​

Creating a New SIG​

SIG Lifecycle Management​

Resources​

Maintenance​

SIG Overview

SIG Structure and Governance

SIG Leadership

SIG Responsibilities

SIG Meetings

Relationship to Project Governance

Active Special Interest Groups

SIG Detailed Descriptions

SIG Inference Scheduler

SIG Benchmarking

SIG PD-Disaggregation

SIG KV-Disaggregation

SIG Installation

SIG Autoscaling

SIG Observability

Getting Involved

Joining a SIG

SIG Communication Channels

SIG Formation and Evolution

Creating a New SIG

SIG Lifecycle Management

Resources

Maintenance