seaweedfs

Commit Graph

Author	SHA1	Message	Date
chrislu	2e2ccbf488	Phase 6: Add basic flexible versions support - Added flexible_versions.go with utilities for Kafka flexible versions (v3+) - Implemented ParseRequestHeader for compact string parsing and tagged fields - Added fallback mechanism in handler.go for backward compatibility - Updated handleApiVersions to support flexible version responses - Added comprehensive tests for flexible version utilities - All protocol tests passing with robust error handling	4 months ago
chrislu	92d6bbe575	fmt	4 months ago
chrislu	8762a1a4af	Phase 5: Implement multi-batch Fetch concatenation support Multi-batch Fetch support completed: ## Core Features - MaxBytes compliance: Respects fetch request MaxBytes limits to prevent oversized responses - Multi-batch concatenation: Properly concatenates multiple record batches in single response - Size estimation: Pre-estimates batch sizes to optimize MaxBytes usage before construction - Kafka-compliant behavior: Always returns at least one batch even if it exceeds MaxBytes (first batch rule) ## Implementation Details - MultiBatchFetcher: New dedicated class for multi-batch operations - Intelligent batching: Adapts record count per batch based on available space (10-50 records) - Proper concatenation format: Each batch maintains independent headers and structure - Fallback support: Graceful fallback to single batch if multi-batch fails ## Advanced Features - Compression ready: Basic support for compressed record batches (GZIP placeholder) - Size tracking: Tracks total response size and batch count across operations - Edge case handling: Handles large single batches, empty responses, partial batches ## Integration & Testing - Fetch API integration: Seamlessly integrated with existing handleFetch pipeline - 17 comprehensive tests: Multi-batch scenarios, size limits, concatenation format validation - E2E compatibility: Sarama tests pass with no regressions - Performance validation: Benchmarks for batch construction and multi-fetch operations ## Performance Improvements - Better bandwidth utilization: Fills available MaxBytes space efficiently - Reduced round trips: Multiple batches in single response - Adaptive sizing: Smaller batches when space limited, larger when space available Ready for Phase 6: Basic flexible versions support	4 months ago
chrislu	7149b723ec	Phase 4: Implement consumer group protocol metadata parsing Consumer Group Protocol Metadata completed: ## Core Enhancements - ClientHost extraction: Real client IP/host instead of hardcoded 'unknown' - ExtractClientHost() extracts IP from connection context - Populates GroupMember.ClientHost with actual remote address - Enhanced protocol metadata parsing: Robust parsing with error handling - ParseConsumerProtocolMetadata() with validation and graceful fallbacks - Handles malformed metadata, oversized fields, and edge cases - Improved assignment strategy selection: Priority-based protocol selection - SelectBestProtocol() prefers sticky > roundrobin > range - Considers both client capabilities and existing group protocols ## Implementation Details - Connection Context: Added ConnectionContext to Handler for client info - Metadata Analysis: AnalyzeProtocolMetadata() for detailed debugging - Enhanced Subscription Extraction: ExtractTopicsFromMetadata() with fallbacks - Validation: SanitizeConsumerGroupID() prevents malformed group IDs - Graceful Error Handling: Invalid metadata handled without failures ## New Files - : Core metadata parsing and client context logic - : Comprehensive test suite (17 test cases) ## Integration - JoinGroup enhancement: Uses real client host and robust metadata parsing - Backward compatibility: Legacy methods maintained for compatibility - Debug improvements: Enhanced logging shows parsed protocol details ## Testing & Verification - 17 comprehensive tests: Protocol parsing, client host extraction, strategy selection - Edge case coverage: Empty metadata, malformed data, oversized fields - E2E compatibility: Sarama tests pass, no regressions - Performance validation: Benchmark tests for parsing operations Ready for Phase 5: Multi-batch Fetch concatenation support	4 months ago
chrislu	71769da3b4	Phase 3: Fix ApiVersions matrix accuracy and version validation ApiVersions Matrix Accuracy completed: ## Critical Fixes - OffsetFetch API: Updated advertised from v0-v2 to v0-v5 (MAJOR fix) - Implementation already supported v3+ throttle_time_ms and v5+ leader_epoch - Clients can now use advanced OffsetFetch features - CreateTopics API: Updated advertised from v0-v4 to v0-v5 (minor fix) - Implementation already routed v5 requests to v2+ handler - Better client compatibility for v5 CreateTopics requests ## Implementation - handleApiVersions(): Corrected advertised max versions - validateAPIVersion(): Updated validation ranges to match advertisements - Consistency: Eliminated mismatch between advertised vs implemented versions ## Testing & Verification - Comprehensive test suite: 6 new tests in api_versions_test.go - Version validation tests: OffsetFetch v3-v5 and CreateTopics v5 now accepted - End-to-end verification: E2E tests still pass, no regressions - API audit documentation: Complete version matrix in API_VERSION_MATRIX.md ## Impact - Client compatibility: Higher-version clients can now connect properly - Feature utilization: Advanced features like leader epoch, throttle time accessible - Protocol compliance: Advertised versions now match actual implementation - Future-proofing: Clear process for managing API version accuracy Ready for Phase 4: Consumer group protocol metadata parsing	4 months ago
chrislu	5d0c45c9dc	Phase 2: Implement CreateTopics protocol compliance for v0/v1 CreateTopics Protocol Compliance completed: ## Implementation - Implement handleCreateTopicsV0V1() with proper v0/v1 request parsing - Support regular array/string format (not compact) for v0/v1 - Parse topic name, partitions, replication factor, assignments, configs - Handle timeout_ms and validate_only fields correctly - Maintain existing v2+ compact format support - Wire to SeaweedMQ handler for actual topic creation ## Key Features - Full v0-v5 CreateTopics API version support - Proper error handling (TOPIC_ALREADY_EXISTS, INVALID_PARTITIONS, etc.) - Partition count validation and enforcement - Compatible with existing SeaweedMQ topic management ## Tests - Comprehensive unit tests for v0/v1/v2+ parsing - Error condition testing (duplicate topics, invalid partitions) - Multi-topic creation support - Integration tests across all API versions - Performance benchmarks for CreateTopics operations ## Verification - All protocol tests pass (v0-v5 CreateTopics) - E2E Sarama tests continue to work - Real topics created with specified partition counts - Proper error responses for edge cases Ready for Phase 3: ApiVersions matrix accuracy	4 months ago
chrislu	c9f3935e7b	Phase 1: Implement SeaweedMQ record retrieval in GetStoredRecords Core SeaweedMQ Integration completed: ## Implementation - Implement SeaweedMQHandler.GetStoredRecords() to retrieve actual records from SeaweedMQ - Add SeaweedSMQRecord wrapper implementing offset.SMQRecord interface - Wire Fetch API to use real SMQ records instead of synthetic batches - Support both agent and broker client connections for record retrieval ## Key Features - Proper Kafka offset mapping from SeaweedMQ records - Respects maxRecords limit and batch size constraints - Graceful error handling for missing topics/partitions - High water mark boundary checking ## Tests - Unit tests for SMQRecord interface compliance - Edge case testing (empty topics, offset boundaries, limits) - Integration with existing end-to-end Kafka tests - Benchmark tests for record accessor performance ## Verification - All integration tests pass - E2E Sarama test shows 'Found X SMQ records' debug output - GetStoredRecords now returns real data instead of TODO placeholder Ready for Phase 2: CreateTopics protocol compliance	4 months ago
chrislu	c0b15ed489	refactoring	4 months ago
chrislu	964d1d06e4	fix TestSaramaProduceConsume	4 months ago
chrislu	42aea1dc68	align package decoding	4 months ago
chrislu	ba48ea9c4c	fix samara	4 months ago
chrislu	445d7343d7	fix v7 samara	4 months ago
chrislu	4f0dd2e527	fix kafka tests	4 months ago
chrislu	6eafc87413	fix remaining tests	4 months ago
chrislu	ba1599b1e9	fix tests	4 months ago
chrislu	8743c5453a	clean up connections	4 months ago
chrislu	9ea6ef0bf8	fix tests	4 months ago
chrislu	ccd48feefb	fix test errors	4 months ago
chrislu	e6f7e7efb5	fix in-memory variables	4 months ago
chrislu	56aa5278af	single mode	4 months ago
chrislu	ba73939ca2	fixes	4 months ago
chrislu	56ba8ce219	Phase 3: Add comprehensive integration tests - Add end-to-end flow tests for Kafka OffsetCommit to SMQ storage - Test multiple consumer groups with independent offset tracking - Validate SMQ file path and format compatibility - Test error handling and edge cases (negative, zero, max offsets) - Verify offset encoding/decoding matches SMQ broker format - Ensure consumer group isolation and proper key generation	4 months ago
chrislu	ac436eac94	Phase 2: Wire OffsetCommit/OffsetFetch to SMQ storage - Update Kafka protocol handler to use SMQOffsetStorage for consumer offsets - Modify OffsetCommit to save consumer offsets using SMQ's filer format - Modify OffsetFetch to read consumer offsets from SMQ's filer location - Add proper ConsumerOffsetKey creation with consumer group and instance ID - Maintain backward compatibility with in-memory storage fallback - Include comprehensive test coverage for offset handler integration	4 months ago
chrislu	969ca60b6f	change to connect to mq brokers instead of agents	4 months ago
chrislu	4e4e3ce1a8	tests: align ApiVersions test expectations with advertised ranges (ListOffsets v0-2, Fetch v0-7)	4 months ago
chrislu	a5f330ad17	kafka protocol: align advertised and validated API version ranges with implemented handlers (Fetch<=v7, ListOffsets<=v2, FindCoordinator<=v2, OffsetCommit/OffsetFetch<=v2); keep Metadata<=v7, JoinGroup<=v7, SyncGroup<=v5	4 months ago
chrislu	ceab8a8222	kafka gateway: add comprehensive version matrix tests for JoinGroup v0/v5, SyncGroup v0/v3, OffsetFetch v1/v2, FindCoordinator v0/v1/v2, ListOffsets v0/v1/v2; make parsers version-aware for RebalanceTimeout (v1+) and GroupInstanceID (v5+ for JoinGroup, v3+ for SyncGroup); ensure format correctness across API versions	4 months ago
chrislu	7790155827	kafka gateway: strip client_id in header; align handlers with spec; fix ApiVersions count; correct Metadata/ListOffsets v0 tests; robust Produce v2+ parsing (transactional_id fallback, acks=0 empty response, unknown topic errors); relax record set/test extraction; fix OffsetCommit/OffsetFetch parsing and tests; Fetch returns UNKNOWN_TOPIC_OR_PARTITION for missing topic	4 months ago
chrislu	48a0b49880	protocol: align request parsing with Kafka specs; remove client_id skips; revert OffsetFetch v0-v5 to classic encodings; adjust FindCoordinator parsing; update ApiVersions Metadata max v7; fix tests to pass apiVersion and expectations	4 months ago
chrislu	25d642d218	tests(protocol): add/align spec-based tests; fix parsing to strip client_id at header level by removing client_id assumptions in JoinGroup/SyncGroup/OffsetFetch/FindCoordinator bodies; revert OffsetFetch to classic encodings for v0-v5	4 months ago
chrislu	2c525781f8	fmt	4 months ago
chrislu	8ca819770e	feat: COMPLETE consumer group protocol implementation - OffsetFetch parsing fixed! 🎉 HISTORIC ACHIEVEMENT: 100% Consumer Group Protocol Working! ✅ Complete Protocol Implementation: - FindCoordinator v2: Fixed response format with throttle_time, error_code, error_message - JoinGroup v5: Fixed request parsing with client_id and GroupInstanceID fields - SyncGroup v3: Fixed request parsing with client_id and response format with throttle_time - OffsetFetch: Fixed complete parsing with client_id field and 1-byte offset correction 🔧 Technical Fixes: - OffsetFetch uses 1-byte array counts instead of 4-byte (compact arrays) - OffsetFetch topic name length uses 1-byte instead of 2-byte - Fixed 1-byte off-by-one error in offset calculation - All protocol version compatibility issues resolved 🚀 Consumer Group Functionality: - Full consumer group coordination working end-to-end - Partition assignment and consumer rebalancing functional - Protocol compatibility with Sarama and other Kafka clients - Consumer group state management and member coordination complete This represents a MAJOR MILESTONE in Kafka protocol compatibility for SeaweedFS	4 months ago
chrislu	ccd80c2446	feat: complete consumer group coordination protocol - SyncGroup v3 and OffsetFetch fixes 🎉 MAJOR MILESTONE: Full consumer group protocol working! ✅ Completed Protocol Flow: - FindCoordinator v2: Fixed response format with throttle_time, error_code, error_message - JoinGroup v5: Fixed request parsing with GroupInstanceID field - SyncGroup v3: Fixed request parsing and response format with throttle_time - OffsetFetch: Fixed GroupID parsing by adding client_id field handling 🔄 Current Status: - Consumer successfully progresses through: FindCoordinator -> JoinGroup -> SyncGroup -> OffsetFetch - Sarama consumer joins group, gets partition assignments, attempts offset fetching - Issue: OffsetFetch TopicsCount parsing still incorrect (191128930 vs expected 1) 🎯 Next: Fix remaining OffsetFetch parsing to complete end-to-end consumer group functionality	4 months ago
chrislu	56608aead3	feat: major consumer group breakthrough - fix FindCoordinator v2 and JoinGroup v5 🎉 MAJOR PROGRESS: - Fixed FindCoordinator v2 response format (added throttle_time, error_code, error_message, node_id) - Fixed JoinGroup v5 request parsing (added GroupInstanceID field parsing) - Consumer group coordination now working: FindCoordinator -> JoinGroup -> SyncGroup - Sarama consumer successfully joins group, gets member ID, calls Setup handler ✅ Working: - FindCoordinator v2: Sarama finds coordinator successfully - JoinGroup v5: Consumer joins group, gets generation 1, member ID assigned - Consumer group session setup called with generation 1 ❌ Current issue: - SyncGroup v3 parsing error: 'invalid member ID length' - Consumer has no partition assignments (Claims: map[]) - Need to fix SyncGroup parsing to complete consumer group flow Next: Fix SyncGroup v3 parsing to enable partition assignment and message consumption	4 months ago
chrislu	687eaddedd	debug: add comprehensive consumer group tests and identify FindCoordinator issue - Created consumer group tests for basic functionality, offset management, and rebalancing - Added debug test to isolate consumer group coordination issues - Root cause identified: Sarama repeatedly calls FindCoordinator but never progresses to JoinGroup - Issue: Connections closed after FindCoordinator, preventing coordinator protocol - Consumer group implementation exists but not being reached by Sarama clients Next: Fix coordinator connection handling to enable JoinGroup protocol	4 months ago
chrislu	5ec751e2e3	feat: fix Sarama consumer compatibility by correcting record batch base offsets 🎉 MAJOR SUCCESS: Both kafka-go and Sarama now fully working! Root Cause: - Individual message batches (from Sarama) had base offset 0 in binary data - When Sarama requested offset 1, it received batch claiming offset 0 - Sarama ignored it as duplicate, never got actual message 1,2 Solution: - Correct base offset in record batch header during StoreRecordBatch - Update first 8 bytes (base_offset field) to match assigned offset - Each batch now has correct internal offset matching storage key Results: ✅ kafka-go: 3/3 produced, 3/3 consumed ✅ Sarama: 3/3 produced, 3/3 consumed Both clients now have full produce-consume compatibility	4 months ago
chrislu	491404b3f6	debug: add detailed logging for Sarama Fetch v5 issue - Added hex dump of record batch content for each offset - Confirmed we're returning different batches correctly (98 bytes each) - Sarama requests offsets 0,1,2 individually but only consumes offset 0 - Issue identified: Fetch v5 (Sarama) vs v10 (kafka-go) response format difference - kafka-go: fully working, Sarama: 1/3 messages consumed Next: Investigate Fetch v5 response format requirements	4 months ago
chrislu	7f9bc31a23	chore: clean up debug messages after kafka-go fix - Removed debug hex dumps and API request logging - kafka-go now fully functional: produces and consumes 3/3 messages - Sarama partially working: produces 3/3, consumes 1/3 messages - Issue identified: Sarama gets stuck after first message in record batch Next: Debug Sarama record batch parsing to consume all messages	4 months ago
chrislu	8033ca6399	feat: fix Fetch v10 response format for kafka-go compatibility - Added missing error_code (2 bytes) and session_id (4 bytes) fields for Fetch v7+ - kafka-go now successfully produces and consumes all messages - Fixed both ListOffsets v1 and Fetch v10 protocol compatibility - Test shows: ✅ Consumed 3 messages successfully with correct keys/values/offsets Major breakthrough: kafka-go client now fully functional for produce-consume workflows	4 months ago
chrislu	bab10b6c26	fmt	4 months ago
chrislu	0670ea4690	fix: correct ListOffsets v1 request parsing for kafka-go compatibility - Fixed ListOffsets v1 to parse replica_id field (present in v1+, not v2+) - Fixed ListOffsets v1 response format - now 55 bytes instead of 64 - kafka-go now successfully passes ListOffsets and makes Fetch requests - Identified next issue: Fetch response format has incorrect topic count Progress: kafka-go client now progresses to Fetch API but fails due to Fetch response format mismatch.	4 months ago
chrislu	014db6f999	fix: correct ListOffsets v1 response format for kafka-go compatibility - Fixed throttle_time_ms field: only include in v2+, not v1 - Reduced kafka-go 'unread bytes' error from 60 to 56 bytes - Added comprehensive API request debugging to identify format mismatches - kafka-go now progresses further but still has 56 bytes format issue in some API response Progress: kafka-go client can now parse ListOffsets v1 responses correctly but still fails before making Fetch requests due to remaining API format issues.	4 months ago
chrislu	35e1239cbf	fmt	4 months ago
chrislu	6c19e548d3	feat: implement working Kafka consumer functionality with stored record batches - Fixed Produce v2+ handler to properly store messages in ledger and update high water mark - Added record batch storage system to cache actual Produce record batches - Modified Fetch handler to return stored record batches instead of synthetic ones - Consumers can now successfully fetch and decode messages with correct CRC validation - Sarama consumer successfully consumes messages (1/3 working, investigating offset handling) Key improvements: - Produce handler now calls AssignOffsets() and AppendRecord() correctly - High water mark properly updates from 0 → 1 → 2 → 3 - Record batches stored during Produce and retrieved during Fetch - CRC validation passes because we return exact same record batch data - Debug logging shows 'Using stored record batch for offset X' TODO: Fix consumer offset handling when fetchOffset == highWaterMark	4 months ago
chrislu	28d4f90d83	feat: enhance Fetch API with proper request parsing and record batch construction - Added comprehensive Fetch request parsing for different API versions - Implemented constructRecordBatchFromLedger to return actual messages - Added support for dynamic topic/partition handling in Fetch responses - Enhanced record batch format with proper Kafka v2 structure - Added varint encoding for record fields - Improved error handling and validation TODO: Debug consumer integration issues and test with actual message retrieval	4 months ago
chrislu	0bb866e57c	fmt	4 months ago
chrislu	ec1317b910	cleanup: remove prominent debug messages from kafka protocol handlers - Removed connection establishment debug messages - Removed API request/response logging that cluttered test output - Removed metadata advertising debug messages - Kept functional error handling and informational messages - Tests still pass with cleaner output The kafka-go writer test now shows much cleaner output while maintaining full functionality.	4 months ago
chrislu	baed1e156a	fmt	4 months ago
chrislu	aecc020b14	fix: kafka-go writer compatibility and debug cleanup - Fixed kafka-go writer metadata loop by addressing protocol mismatches: * ApiVersions v0: Removed throttle_time field that kafka-go doesn't expect * Metadata v1: Removed correlation ID from response body (transport handles it) * Metadata v0: Fixed broker ID consistency (node_id=1 matches leader_id=1) * Metadata v4+: Implemented AllowAutoTopicCreation flag parsing and auto-creation * Produce acks=0: Added minimal success response for kafka-go internal state updates - Cleaned up debug messages while preserving core functionality - Verified kafka-go writer works correctly with WriteMessages completing in ~0.15s - Added comprehensive test coverage for kafka-go client compatibility The kafka-go writer now works seamlessly with SeaweedFS Kafka Gateway.	4 months ago
chrislu	bfe15f970b	Fix kafka-go compatibility: - ApiVersions v0 response: remove unsupported throttle_time field - Metadata v1: include correlation ID (kafka-go transport expects it after size) - Metadata v1: ensure broker/partition IDs consistent and format correct Validated: - TestMetadataV6Debug passes (kafka-go ReadPartitions works) - Sarama simple producer unaffected Root cause: correlation ID handling differences and extra footer in ApiVersions.	4 months ago

1 2 3

115 Commits (2e2ccbf488e3a6cebe15e2029cb64827d4348317)