Special Interest Groups (SIGs)
This content is automatically synced from SIGS.md in the llm-d/llm-d repository.
📝 To suggest changes, please edit the source file or create an issue.
SIG Overview
Special Interest Groups (SIGs) are the primary organizational units for coordinating work across the llm-d project. Each SIG focuses on a specific area of the project's technology stack and is responsible for driving design, implementation, and maintenance of their respective components.
SIGs provide a mechanism for:
- Focused expertise: Bringing together contributors with specialized knowledge in specific areas
- Coordinated development: Ensuring consistent architectural decisions across related components
- Community building: Creating smaller, more manageable groups for collaboration and mentorship
- Accountability: Clear ownership and responsibility for specific project areas
SIG Structure and Governance
SIG Leadership
Each SIG has:
- SIG Leads (2-3 people): Responsible for overall SIG direction, coordination, and decision-making
SIG Responsibilities
- Drive technical design and implementation in their area
- Maintain documentation and architectural decisions
- Coordinate with other SIGs on cross-cutting concerns
- Mentor new contributors and grow the community
- Participate in project-wide planning and releases
SIG Meetings
- Regular meetings (typically weekly) for technical discussions
Relationship to Project Governance
SIGs operate within the broader llm-d project governance framework defined in PROJECT.md:
- SIGs follow the project's lazy consensus decision-making process
- Major cross-SIG decisions require project maintainer approval
- All SIG work follows the project's contribution guidelines
Active Special Interest Groups
SIG | Focus Area | Meeting Schedule | Documentation |
---|---|---|---|
SIG Inference Scheduler | Intelligent request routing, load balancing, and traffic management | Weekly Tuesdays 12:00 PM ET (Convert to your TZ) | • Meeting Recordings and Docs • llm-d-inference-scheduler Repository |
SIG Benchmarking | Performance testing, benchmarking frameworks, and optimization | Weekly Thursdays 1:00 PM ET (Convert to your TZ) | • Meeting Recordings and Docs • llm-d-benchmark Repository |
SIG PD-Disaggregation | Prefill/decode separation, distributed serving, and workload disaggregation | Weekly Tuesdays 1:30 PM ET (Convert to your TZ) | • Meeting Recordings and Docs |
SIG KV-Disaggregation | KV caching, prefix caching, and distributed storage systems | Weekly Tuesdays 12:00 PM ET (Convert to your TZ) | • Meeting Recordings and Docs • llm-d-kv-cache-manager Repository |
SIG Installation | Kubernetes integration, deployment tooling, and platform operations | Weekly Thursdays 11:00 AM ET (Convert to your TZ) | • Meeting Recordings and Docs |
SIG Autoscaling | Traffic-aware autoscaling, resource management, and capacity planning | Weekly Wednesdays 2:00 PM ET (Convert to your TZ) | • Meeting Recordings and Docs |
SIG Observability | Monitoring, logging, metrics, and operational visibility | Weekly Thursdays 12:30 PM ET (Convert to your TZ) | • Meeting Recordings and Docs |
SIG Detailed Descriptions
SIG Inference Scheduler
👥 Leadership: Nili Guy, Abdullah Gharaibeh, Vita Bortnikov
⭐️ North Star Design Document ↗️ (Google Docs)
Charter: Develop and maintain intelligent request routing and load balancing systems that optimize for latency, throughput, and resource utilization across distributed inference workloads.
Key Areas:
- vLLM-optimized inference scheduling algorithms
- KV-cache aware routing and load balancing
- Integration with Kubernetes Gateway API and Inference Gateway Extension
- Flow control and traffic shaping
- SLA-aware request prioritization
💬 Communication:
- Slack Channel: #sig-inference-scheduler
- Meeting Recordings and Docs: Public Google Drive
SIG Benchmarking
👥 Leadership: Marcio A L Silva, Ashok Chandrasekar
⭐️ North Star Design Document ↗️ (Google Docs)
Charter: Establish comprehensive performance testing and benchmarking frameworks to ensure llm-d delivers optimal performance across diverse workloads and hardware configurations.
Key Areas:
- Benchmarking frameworks and methodologies
- Performance regression testing
- Workload simulation and synthetic data generation
- Hardware-specific optimization
- Performance analysis and profiling tools
💬 Communication:
- Slack Channel: #sig-benchmarking
- Meeting Recordings and Docs: Public Google Drive
SIG PD-Disaggregation
👥 Leadership: Robert Shaw
⭐️ North Star Design Document ↗️ (Google Docs)
Charter: Design and implement prefill/decode disaggregation patterns that enable efficient separation of inference workloads across heterogeneous hardware and scaling requirements.
Key Areas:
- Prefill/decode workload separation
- Disaggregated serving architecture
- Cross-instance communication protocols
- Heterogeneous hardware optimization
- Dynamic workload balancing between Prefill and Decode instances
💬 Communication:
- Slack Channel: #sig-pd-disaggregation
- Meeting Recordings and Docs: Public Google Drive
SIG KV-Disaggregation
👥 Leadership: Maroon Ayoub, Danny Harnik
⭐️ North Star Design Document ↗️ (Google Docs)
Charter: Design and implement distributed KV caching solutions that improve inference performance through intelligent cache management, prefix sharing, and disaggregated storage.
Key Areas:
- Distributed KV cache architecture
- Prefix cache hierarchies (local, remote, shared)
- Cache-aware scheduling and routing
- Storage optimization for inference workloads
- Integration with vLLM's KVConnector
💬 Communication:
- Slack Channel: #sig-kv-disaggregation
- Meeting Recordings and Docs: Public Google Drive
SIG Installation
👥 Leadership: Brent Salisbury, Greg Pereira
⭐️ North Star Design Document ↗️ (Google Docs)
Charter: Ensure llm-d integrates seamlessly with Kubernetes and provides robust deployment, scaling, and operational capabilities for production environments.
Key Areas:
- Kubernetes-native deployment patterns
- Helm charts and operators
- Installation and configuration management
- Multi-node orchestration with LeaderWorkerSet
- Platform integration and operational best practices
💬 Communication:
- Slack Channel: #sig-installation
- Meeting Recordings and Docs: Public Google Drive
SIG Autoscaling
👥 Leadership: Tamar Eilam, Abhishek Malvankar
⭐️ North Star Design Document ↗️ (Google Docs)
Charter: Develop intelligent autoscaling solutions that automatically adjust llm-d deployments based on traffic patterns, workload characteristics, and hardware utilization.
Key Areas:
- Traffic-aware autoscaling algorithms
- Hardware-specific scaling policies
- Workload-based capacity planning
- Integration with Kubernetes HPA/VPA
- Cost-optimized scaling strategies
💬 Communication:
- Slack Channel: #sig-autoscaling
- Meeting Recordings and Docs: Public Google Drive
SIG Observability
👥 Leadership: Sally O'Malley, Roy Nissim, Benedikt Bongartz
⭐️ North Star Design Document ↗️ (Google Docs)
Charter: Provide comprehensive monitoring, logging, and observability capabilities that enable operators to understand system behavior, diagnose issues, and optimize performance.
Key Areas:
- Metrics collection and visualization
- Distributed tracing and logging
- Performance monitoring and alerting
- Operational dashboards and reporting
- Integration with monitoring ecosystems (Prometheus, Grafana, etc.)
💬 Communication:
- Slack Channel: #sig-observability
- Meeting Recordings and Docs: Public Google Drive
Getting Involved
Joining a SIG
- Attend a meeting: Check the project calendar for SIG meeting times
- Join the conversation: Participate in SIG-specific channels on Slack
- Review documentation: Read the SIG's charter and current initiatives
- Start contributing: Look for "good first issues" labeled with the SIG's area
SIG Communication Channels
- Slack: Each SIG has dedicated channels in the llm-d Slack workspace
- Google Groups: Join llm-d-contributors for comment access to SIG documents
- GitHub: Issues and discussions are labeled by SIG area
- Calendar: All SIG meetings are on the shared project calendar
SIG Formation and Evolution
Creating a New SIG
- Identify need: Demonstrate community interest and technical necessity
- Draft charter: Define scope, goals, and initial leadership
- Proposal process: Submit proposal following project contribution guidelines
- Community review: Present at weekly project standup and gather feedback
- Approval: Obtain approval from project maintainers
SIG Lifecycle Management
- Active: Regular meetings, active development, engaged community
- Maintenance: Limited active development, focus on stability and bug fixes
- Archived: No longer active, historical reference only
SIGs may evolve, merge, or be archived based on project needs and community engagement.
Resources
- Project Calendar: llm-d Public Calendar
- Slack Workspace: https://llm-d.slack.com
- Google Groups: https://groups.google.com/g/llm-d-contributors
- Community Governance: PROJECT.md
- Contributing Guidelines: CONTRIBUTING.md
Maintenance
This document is maintained by the project maintainers and updated as SIGs evolve. For questions or suggestions about SIG structure, please reach out via:
- Weekly project standup (Wednesdays 12:30 PM ET)
- llm-d Slack channel
- GitHub issues in the llm-d/llm-d repository