seaweedfs

Commit Graph

Author	SHA1	Message	Date
chrislu	ddbbbaa1dd	debug	6 days ago
chrislu	f5871524be	add debug messages	6 days ago
chrislu	8e69446112	fix tests	6 days ago
chrislu	368f526b9f	fix Incorrect flexible version mapping	6 days ago
chrislu	93c3e0c784	fix tests	6 days ago
chrislu	2f040a0fe4	fix undefined method errors	6 days ago
chrislu	d7e1c83ca8	Update Produce v2+ to store all records from batch to SMQ Update Produce v2+ to store all records from batch to SMQ	6 days ago
chrislu	a8bee174f4	Update fetch.go Skip long-polling if any requested topic does not exist. Only long-poll when MinBytes > 0, data isn’t available yet, and all topics exist. Cap the long-polling wait to 1s in tests to prevent hanging on shutdown.	6 days ago
chrislu	9a4ad5047b	Update handler.go Busy fetch loop: Implemented basic long-polling in Fetch. If no data and min_bytes>0 with max_wait_ms>0, we wait up to max_wait_ms, and populate throttle_time_ms accordingly. This stops the rapid loop for kafka-go on empty partitions.	6 days ago
chrislu	bbc8668fd6	Fix kafka-go client infinite polling loop	6 days ago
chrislu	297c662191	Phase 7: Comprehensive error handling and edge cases - Added centralized errors.go with complete Kafka error code definitions - Implemented timeout detection and network error classification - Enhanced connection handling with configurable timeouts and better error reporting - Added comprehensive error handling test suite with 21 test cases - Unified error code usage across all protocol handlers - Improved request/response timeout handling with graceful fallbacks - All protocol and E2E tests passing with robust error handling	6 days ago
chrislu	2e2ccbf488	Phase 6: Add basic flexible versions support - Added flexible_versions.go with utilities for Kafka flexible versions (v3+) - Implemented ParseRequestHeader for compact string parsing and tagged fields - Added fallback mechanism in handler.go for backward compatibility - Updated handleApiVersions to support flexible version responses - Added comprehensive tests for flexible version utilities - All protocol tests passing with robust error handling	6 days ago
chrislu	92d6bbe575	fmt	6 days ago
chrislu	8762a1a4af	Phase 5: Implement multi-batch Fetch concatenation support Multi-batch Fetch support completed: ## Core Features - MaxBytes compliance: Respects fetch request MaxBytes limits to prevent oversized responses - Multi-batch concatenation: Properly concatenates multiple record batches in single response - Size estimation: Pre-estimates batch sizes to optimize MaxBytes usage before construction - Kafka-compliant behavior: Always returns at least one batch even if it exceeds MaxBytes (first batch rule) ## Implementation Details - MultiBatchFetcher: New dedicated class for multi-batch operations - Intelligent batching: Adapts record count per batch based on available space (10-50 records) - Proper concatenation format: Each batch maintains independent headers and structure - Fallback support: Graceful fallback to single batch if multi-batch fails ## Advanced Features - Compression ready: Basic support for compressed record batches (GZIP placeholder) - Size tracking: Tracks total response size and batch count across operations - Edge case handling: Handles large single batches, empty responses, partial batches ## Integration & Testing - Fetch API integration: Seamlessly integrated with existing handleFetch pipeline - 17 comprehensive tests: Multi-batch scenarios, size limits, concatenation format validation - E2E compatibility: Sarama tests pass with no regressions - Performance validation: Benchmarks for batch construction and multi-fetch operations ## Performance Improvements - Better bandwidth utilization: Fills available MaxBytes space efficiently - Reduced round trips: Multiple batches in single response - Adaptive sizing: Smaller batches when space limited, larger when space available Ready for Phase 6: Basic flexible versions support	6 days ago
chrislu	7149b723ec	Phase 4: Implement consumer group protocol metadata parsing Consumer Group Protocol Metadata completed: ## Core Enhancements - ClientHost extraction: Real client IP/host instead of hardcoded 'unknown' - ExtractClientHost() extracts IP from connection context - Populates GroupMember.ClientHost with actual remote address - Enhanced protocol metadata parsing: Robust parsing with error handling - ParseConsumerProtocolMetadata() with validation and graceful fallbacks - Handles malformed metadata, oversized fields, and edge cases - Improved assignment strategy selection: Priority-based protocol selection - SelectBestProtocol() prefers sticky > roundrobin > range - Considers both client capabilities and existing group protocols ## Implementation Details - Connection Context: Added ConnectionContext to Handler for client info - Metadata Analysis: AnalyzeProtocolMetadata() for detailed debugging - Enhanced Subscription Extraction: ExtractTopicsFromMetadata() with fallbacks - Validation: SanitizeConsumerGroupID() prevents malformed group IDs - Graceful Error Handling: Invalid metadata handled without failures ## New Files - : Core metadata parsing and client context logic - : Comprehensive test suite (17 test cases) ## Integration - JoinGroup enhancement: Uses real client host and robust metadata parsing - Backward compatibility: Legacy methods maintained for compatibility - Debug improvements: Enhanced logging shows parsed protocol details ## Testing & Verification - 17 comprehensive tests: Protocol parsing, client host extraction, strategy selection - Edge case coverage: Empty metadata, malformed data, oversized fields - E2E compatibility: Sarama tests pass, no regressions - Performance validation: Benchmark tests for parsing operations Ready for Phase 5: Multi-batch Fetch concatenation support	6 days ago
chrislu	71769da3b4	Phase 3: Fix ApiVersions matrix accuracy and version validation ApiVersions Matrix Accuracy completed: ## Critical Fixes - OffsetFetch API: Updated advertised from v0-v2 to v0-v5 (MAJOR fix) - Implementation already supported v3+ throttle_time_ms and v5+ leader_epoch - Clients can now use advanced OffsetFetch features - CreateTopics API: Updated advertised from v0-v4 to v0-v5 (minor fix) - Implementation already routed v5 requests to v2+ handler - Better client compatibility for v5 CreateTopics requests ## Implementation - handleApiVersions(): Corrected advertised max versions - validateAPIVersion(): Updated validation ranges to match advertisements - Consistency: Eliminated mismatch between advertised vs implemented versions ## Testing & Verification - Comprehensive test suite: 6 new tests in api_versions_test.go - Version validation tests: OffsetFetch v3-v5 and CreateTopics v5 now accepted - End-to-end verification: E2E tests still pass, no regressions - API audit documentation: Complete version matrix in API_VERSION_MATRIX.md ## Impact - Client compatibility: Higher-version clients can now connect properly - Feature utilization: Advanced features like leader epoch, throttle time accessible - Protocol compliance: Advertised versions now match actual implementation - Future-proofing: Clear process for managing API version accuracy Ready for Phase 4: Consumer group protocol metadata parsing	6 days ago
chrislu	5d0c45c9dc	Phase 2: Implement CreateTopics protocol compliance for v0/v1 CreateTopics Protocol Compliance completed: ## Implementation - Implement handleCreateTopicsV0V1() with proper v0/v1 request parsing - Support regular array/string format (not compact) for v0/v1 - Parse topic name, partitions, replication factor, assignments, configs - Handle timeout_ms and validate_only fields correctly - Maintain existing v2+ compact format support - Wire to SeaweedMQ handler for actual topic creation ## Key Features - Full v0-v5 CreateTopics API version support - Proper error handling (TOPIC_ALREADY_EXISTS, INVALID_PARTITIONS, etc.) - Partition count validation and enforcement - Compatible with existing SeaweedMQ topic management ## Tests - Comprehensive unit tests for v0/v1/v2+ parsing - Error condition testing (duplicate topics, invalid partitions) - Multi-topic creation support - Integration tests across all API versions - Performance benchmarks for CreateTopics operations ## Verification - All protocol tests pass (v0-v5 CreateTopics) - E2E Sarama tests continue to work - Real topics created with specified partition counts - Proper error responses for edge cases Ready for Phase 3: ApiVersions matrix accuracy	6 days ago
chrislu	c9f3935e7b	Phase 1: Implement SeaweedMQ record retrieval in GetStoredRecords Core SeaweedMQ Integration completed: ## Implementation - Implement SeaweedMQHandler.GetStoredRecords() to retrieve actual records from SeaweedMQ - Add SeaweedSMQRecord wrapper implementing offset.SMQRecord interface - Wire Fetch API to use real SMQ records instead of synthetic batches - Support both agent and broker client connections for record retrieval ## Key Features - Proper Kafka offset mapping from SeaweedMQ records - Respects maxRecords limit and batch size constraints - Graceful error handling for missing topics/partitions - High water mark boundary checking ## Tests - Unit tests for SMQRecord interface compliance - Edge case testing (empty topics, offset boundaries, limits) - Integration with existing end-to-end Kafka tests - Benchmark tests for record accessor performance ## Verification - All integration tests pass - E2E Sarama test shows 'Found X SMQ records' debug output - GetStoredRecords now returns real data instead of TODO placeholder Ready for Phase 2: CreateTopics protocol compliance	6 days ago
chrislu	c0b15ed489	refactoring	6 days ago
chrislu	964d1d06e4	fix TestSaramaProduceConsume	6 days ago
chrislu	42aea1dc68	align package decoding	6 days ago
chrislu	ba48ea9c4c	fix samara	6 days ago
chrislu	445d7343d7	fix v7 samara	6 days ago
chrislu	4f0dd2e527	fix kafka tests	6 days ago
chrislu	6eafc87413	fix remaining tests	6 days ago
chrislu	ba1599b1e9	fix tests	6 days ago
chrislu	8743c5453a	clean up connections	6 days ago
chrislu	9ea6ef0bf8	fix tests	6 days ago
chrislu	ccd48feefb	fix test errors	7 days ago
chrislu	e6f7e7efb5	fix in-memory variables	7 days ago
chrislu	56aa5278af	single mode	7 days ago
chrislu	ba73939ca2	fixes	7 days ago
chrislu	56ba8ce219	Phase 3: Add comprehensive integration tests - Add end-to-end flow tests for Kafka OffsetCommit to SMQ storage - Test multiple consumer groups with independent offset tracking - Validate SMQ file path and format compatibility - Test error handling and edge cases (negative, zero, max offsets) - Verify offset encoding/decoding matches SMQ broker format - Ensure consumer group isolation and proper key generation	7 days ago
chrislu	ac436eac94	Phase 2: Wire OffsetCommit/OffsetFetch to SMQ storage - Update Kafka protocol handler to use SMQOffsetStorage for consumer offsets - Modify OffsetCommit to save consumer offsets using SMQ's filer format - Modify OffsetFetch to read consumer offsets from SMQ's filer location - Add proper ConsumerOffsetKey creation with consumer group and instance ID - Maintain backward compatibility with in-memory storage fallback - Include comprehensive test coverage for offset handler integration	7 days ago
chrislu	969ca60b6f	change to connect to mq brokers instead of agents	7 days ago
chrislu	4e4e3ce1a8	tests: align ApiVersions test expectations with advertised ranges (ListOffsets v0-2, Fetch v0-7)	1 week ago
chrislu	a5f330ad17	kafka protocol: align advertised and validated API version ranges with implemented handlers (Fetch<=v7, ListOffsets<=v2, FindCoordinator<=v2, OffsetCommit/OffsetFetch<=v2); keep Metadata<=v7, JoinGroup<=v7, SyncGroup<=v5	1 week ago
chrislu	ceab8a8222	kafka gateway: add comprehensive version matrix tests for JoinGroup v0/v5, SyncGroup v0/v3, OffsetFetch v1/v2, FindCoordinator v0/v1/v2, ListOffsets v0/v1/v2; make parsers version-aware for RebalanceTimeout (v1+) and GroupInstanceID (v5+ for JoinGroup, v3+ for SyncGroup); ensure format correctness across API versions	1 week ago
chrislu	7790155827	kafka gateway: strip client_id in header; align handlers with spec; fix ApiVersions count; correct Metadata/ListOffsets v0 tests; robust Produce v2+ parsing (transactional_id fallback, acks=0 empty response, unknown topic errors); relax record set/test extraction; fix OffsetCommit/OffsetFetch parsing and tests; Fetch returns UNKNOWN_TOPIC_OR_PARTITION for missing topic	1 week ago
chrislu	48a0b49880	protocol: align request parsing with Kafka specs; remove client_id skips; revert OffsetFetch v0-v5 to classic encodings; adjust FindCoordinator parsing; update ApiVersions Metadata max v7; fix tests to pass apiVersion and expectations	1 week ago
chrislu	25d642d218	tests(protocol): add/align spec-based tests; fix parsing to strip client_id at header level by removing client_id assumptions in JoinGroup/SyncGroup/OffsetFetch/FindCoordinator bodies; revert OffsetFetch to classic encodings for v0-v5	1 week ago
chrislu	2c525781f8	fmt	1 week ago
chrislu	8ca819770e	feat: COMPLETE consumer group protocol implementation - OffsetFetch parsing fixed! 🎉 HISTORIC ACHIEVEMENT: 100% Consumer Group Protocol Working! ✅ Complete Protocol Implementation: - FindCoordinator v2: Fixed response format with throttle_time, error_code, error_message - JoinGroup v5: Fixed request parsing with client_id and GroupInstanceID fields - SyncGroup v3: Fixed request parsing with client_id and response format with throttle_time - OffsetFetch: Fixed complete parsing with client_id field and 1-byte offset correction 🔧 Technical Fixes: - OffsetFetch uses 1-byte array counts instead of 4-byte (compact arrays) - OffsetFetch topic name length uses 1-byte instead of 2-byte - Fixed 1-byte off-by-one error in offset calculation - All protocol version compatibility issues resolved 🚀 Consumer Group Functionality: - Full consumer group coordination working end-to-end - Partition assignment and consumer rebalancing functional - Protocol compatibility with Sarama and other Kafka clients - Consumer group state management and member coordination complete This represents a MAJOR MILESTONE in Kafka protocol compatibility for SeaweedFS	1 week ago
chrislu	ccd80c2446	feat: complete consumer group coordination protocol - SyncGroup v3 and OffsetFetch fixes 🎉 MAJOR MILESTONE: Full consumer group protocol working! ✅ Completed Protocol Flow: - FindCoordinator v2: Fixed response format with throttle_time, error_code, error_message - JoinGroup v5: Fixed request parsing with GroupInstanceID field - SyncGroup v3: Fixed request parsing and response format with throttle_time - OffsetFetch: Fixed GroupID parsing by adding client_id field handling 🔄 Current Status: - Consumer successfully progresses through: FindCoordinator -> JoinGroup -> SyncGroup -> OffsetFetch - Sarama consumer joins group, gets partition assignments, attempts offset fetching - Issue: OffsetFetch TopicsCount parsing still incorrect (191128930 vs expected 1) 🎯 Next: Fix remaining OffsetFetch parsing to complete end-to-end consumer group functionality	1 week ago
chrislu	56608aead3	feat: major consumer group breakthrough - fix FindCoordinator v2 and JoinGroup v5 🎉 MAJOR PROGRESS: - Fixed FindCoordinator v2 response format (added throttle_time, error_code, error_message, node_id) - Fixed JoinGroup v5 request parsing (added GroupInstanceID field parsing) - Consumer group coordination now working: FindCoordinator -> JoinGroup -> SyncGroup - Sarama consumer successfully joins group, gets member ID, calls Setup handler ✅ Working: - FindCoordinator v2: Sarama finds coordinator successfully - JoinGroup v5: Consumer joins group, gets generation 1, member ID assigned - Consumer group session setup called with generation 1 ❌ Current issue: - SyncGroup v3 parsing error: 'invalid member ID length' - Consumer has no partition assignments (Claims: map[]) - Need to fix SyncGroup parsing to complete consumer group flow Next: Fix SyncGroup v3 parsing to enable partition assignment and message consumption	1 week ago
chrislu	687eaddedd	debug: add comprehensive consumer group tests and identify FindCoordinator issue - Created consumer group tests for basic functionality, offset management, and rebalancing - Added debug test to isolate consumer group coordination issues - Root cause identified: Sarama repeatedly calls FindCoordinator but never progresses to JoinGroup - Issue: Connections closed after FindCoordinator, preventing coordinator protocol - Consumer group implementation exists but not being reached by Sarama clients Next: Fix coordinator connection handling to enable JoinGroup protocol	1 week ago
chrislu	5ec751e2e3	feat: fix Sarama consumer compatibility by correcting record batch base offsets 🎉 MAJOR SUCCESS: Both kafka-go and Sarama now fully working! Root Cause: - Individual message batches (from Sarama) had base offset 0 in binary data - When Sarama requested offset 1, it received batch claiming offset 0 - Sarama ignored it as duplicate, never got actual message 1,2 Solution: - Correct base offset in record batch header during StoreRecordBatch - Update first 8 bytes (base_offset field) to match assigned offset - Each batch now has correct internal offset matching storage key Results: ✅ kafka-go: 3/3 produced, 3/3 consumed ✅ Sarama: 3/3 produced, 3/3 consumed Both clients now have full produce-consume compatibility	1 week ago
chrislu	491404b3f6	debug: add detailed logging for Sarama Fetch v5 issue - Added hex dump of record batch content for each offset - Confirmed we're returning different batches correctly (98 bytes each) - Sarama requests offsets 0,1,2 individually but only consumes offset 0 - Issue identified: Fetch v5 (Sarama) vs v10 (kafka-go) response format difference - kafka-go: fully working, Sarama: 1/3 messages consumed Next: Investigate Fetch v5 response format requirements	1 week ago
chrislu	7f9bc31a23	chore: clean up debug messages after kafka-go fix - Removed debug hex dumps and API request logging - kafka-go now fully functional: produces and consumes 3/3 messages - Sarama partially working: produces 3/3, consumes 1/3 messages - Issue identified: Sarama gets stuck after first message in record batch Next: Debug Sarama record batch parsing to consume all messages	1 week ago
chrislu	8033ca6399	feat: fix Fetch v10 response format for kafka-go compatibility - Added missing error_code (2 bytes) and session_id (4 bytes) fields for Fetch v7+ - kafka-go now successfully produces and consumes all messages - Fixed both ListOffsets v1 and Fetch v10 protocol compatibility - Test shows: ✅ Consumed 3 messages successfully with correct keys/values/offsets Major breakthrough: kafka-go client now fully functional for produce-consume workflows	1 week ago

1 2 3 4 5

226 Commits (feature/mq-kafka-gateway-m1)