seaweedfs

Commit Graph

Author	SHA1	Message	Date
chrislu	7e46abf052	fmt	1 month ago
chrislu	2ffdda2661	fix: commit offsets in Cleanup() before rebalancing This commit adds explicit offset commit in the ConsumerGroupHandler.Cleanup() method, which is called during consumer group rebalancing. This ensures all marked offsets are committed BEFORE partitions are reassigned to other consumers, significantly reducing duplicate message consumption during rebalancing. Problem: - Cleanup() was not committing offsets before rebalancing - When partition reassigned to another consumer, it started from last committed offset - Uncommitted messages (processed but not yet committed) were read again by new consumer - This caused ~100-200% duplicate messages during rebalancing in tests Solution: - Add session.Commit() in Cleanup() method - This runs after all ConsumeClaim goroutines have exited - Ensures all MarkMessage() calls are committed before partition release - New consumer starts from the last processed offset, not an older committed offset Benefits: - Dramatically reduces duplicate messages during rebalancing - Improves at-least-once semantics (closer to exactly-once for normal cases) - Better performance (less redundant processing) - Cleaner test results (expected duplicates only from actual failures) Kafka Rebalancing Lifecycle: 1. Rebalance triggered (consumer join/leave, timeout, etc.) 2. All ConsumeClaim goroutines cancelled 3. Cleanup() called ← WE COMMIT HERE NOW 4. Partitions reassigned to other consumers 5. New consumer starts from last committed offset ← NOW MORE UP-TO-DATE Expected Results: - Before: ~100-200% duplicates during rebalancing (2-3x reads) - After: <10% duplicates (only from uncommitted in-flight messages) This is a critical fix for production deployments where consumer churn (scaling, restarts, failures) causes frequent rebalancing.	1 month ago
chrislu	7e755c70ce	feat: add in-memory cache for disk chunk reads This commit adds an LRU cache for disk chunks to optimize repeated reads of historical data. When multiple consumers read the same historical offsets, or a single consumer refetches the same data, the cache eliminates redundant disk I/O. Cache Design: - Chunk size: 1000 messages per chunk - Max chunks: 16 (configurable, ~16K messages cached) - Eviction policy: LRU (Least Recently Used) - Thread-safe with RWMutex - Chunk-aligned offsets for efficient lookups New Components: 1. DiskChunkCache struct - manages cached chunks 2. CachedDiskChunk struct - stores chunk data with metadata 3. getCachedDiskChunk() - checks cache before disk read 4. cacheDiskChunk() - stores chunks with LRU eviction 5. extractMessagesFromCache() - extracts subset from cached chunk How It Works: 1. Read request for offset N (e.g., 2500) 2. Calculate chunk start: (2500 / 1000) * 1000 = 2000 3. Check cache for chunk starting at 2000 4. If HIT: Extract messages 2500-2999 from cached chunk 5. If MISS: Read chunk 2000-2999 from disk, cache it, extract 2500-2999 6. If cache full: Evict LRU chunk before caching new one Benefits: - Eliminates redundant disk I/O for popular historical data - Reduces latency for repeated reads (cache hit ~1ms vs disk ~100ms) - Supports multiple consumers reading same historical offsets - Automatically evicts old chunks when cache is full - Zero impact on hot path (in-memory reads unchanged) Performance Impact: - Cache HIT: ~99% faster than disk read - Cache MISS: Same as disk read (with caching overhead ~1%) - Memory: ~16MB for 16 chunks (16K messages x 1KB avg) Example Scenario (CI tests): - Producer writes offsets 0-4 - Data flushes to disk - Consumer 1 reads 0-4 (cache MISS, reads from disk, caches chunk 0-999) - Consumer 2 reads 0-4 (cache HIT, served from memory) - Consumer 1 rebalances, re-reads 0-4 (cache HIT, no disk I/O) This optimization is especially valuable in CI environments where: - Small memory buffers cause frequent flushing - Multiple consumers read the same historical data - Disk I/O is relatively slow compared to memory access	1 month ago
chrislu	0e481cf97a	fmt	1 month ago
chrislu	c5634470ed	feat: add disk I/O fallback for historical offset reads This commit implements async disk I/O fallback to handle cases where: 1. Data is flushed from memory before consumers can read it (CI issue) 2. Consumers request historical offsets not in memory 3. Small LogBuffer retention in resource-constrained environments Changes: - Add readHistoricalDataFromDisk() helper function - Update ReadMessagesAtOffset() to call ReadFromDiskFn when offset < bufferStartOffset - Properly handle maxMessages and maxBytes limits during disk reads - Return appropriate nextOffset after disk reads - Log disk read operations at V(2) and V(3) levels Benefits: - Fixes CI test failures where data is flushed before consumption - Enables consumers to catch up even if they fall behind memory retention - No blocking on hot path (disk read only for historical data) - Respects existing ReadFromDiskFn timeout handling How it works: 1. Try in-memory read first (fast path) 2. If offset too old and ReadFromDiskFn configured, read from disk 3. Return disk data with proper nextOffset 4. Consumer continues reading seamlessly This fixes the 'offset 0 too old (earliest in-memory: 5)' error in TestOffsetManagement where messages were flushed before consumer started.	1 month ago
chrislu	e1a4bff794	feat: add context timeout propagation to produce path This commit adds proper context propagation throughout the produce path, enabling client-side timeouts to be honored on the broker side. Previously, only fetch operations respected client timeouts - produce operations continued indefinitely even if the client gave up. Changes: - Add ctx parameter to ProduceRecord and ProduceRecordValue signatures - Add ctx parameter to PublishRecord and PublishRecordValue in BrokerClient - Add ctx parameter to handleProduce and related internal functions - Update all callers (protocol handlers, mocks, tests) to pass context - Add context cancellation checks in PublishRecord before operations Benefits: - Faster failure detection when client times out - No orphaned publish operations consuming broker resources - Resource efficiency improvements (no goroutine/stream/lock leaks) - Consistent timeout behavior between produce and fetch paths - Better error handling with proper cancellation signals This fixes the root cause of CI test timeouts where produce operations continued indefinitely after clients gave up, leading to cascading delays.	1 month ago
chrislu	66d87659e5	test: increase timeouts for consumer group operations in E2E tests Consumer group operations (coordinator discovery, offset fetch/commit) are slower in CI environments with limited resources. This increases timeouts to: - ProduceMessages: 10s -> 30s (for when consumer groups are active) - ConsumeWithGroup: 30s -> 60s (for offset fetch/commit operations) Fixes the TestOffsetManagement timeout failures in GitHub Actions CI.	1 month ago
chrislu	39e7bbdc6d	less logs	1 month ago
chrislu	a12f7b2ee8	fix go mod	1 month ago
chrislu	53f9124a26	fix tests	1 month ago
chrislu	1807b8093c	debug fetch offset APIs	1 month ago
chrislu	ba1a8aed64	log read stateless	1 month ago
chrislu	210fc49891	Merge branch 'master' into fix-race-condition	1 month ago
Chris Lu	3d25f206c8	S3: Signature verification should not check permissions (#7335 ) * Signature verification should not check permissions - that's done later in authRequest * test permissions during signature verfication * fix s3 test path * s3tests_boto3 => s3tests * remove extra lines	1 month ago
chrislu	3b75e50b04	removing the unnecessary restart logic and relying on the seek mechanism we already implemented	1 month ago
chrislu	6c1298b5f7	track messages with testStartTime	1 month ago
chrislu	f4a018e731	verify produced messages are consumed	1 month ago
chrislu	e7747a7572	adjust s3 tests	1 month ago
chrislu	38befd30ee	pin s3 test version	1 month ago
chrislu	0bf4ace6b1	reuse cached records	1 month ago
chrislu	7e934d6283	ack messages to broker	1 month ago
chrislu	5222ddaf2f	seekable subscribe messages	1 month ago
chrislu	60e6e63706	avoid goroutine leak	1 month ago
chrislu	f639c42472	clean up consumer protocols	1 month ago
chrislu	e344c6ce24	adjust return values on failures	1 month ago
chrislu	0cbc5e906e	purge unused	1 month ago
chrislu	fd33e03008	less logs, remove unused code	1 month ago
chrislu	bb0e613275	more time	1 month ago
chrislu	5c6b0eaa0d	Update fetch.go	1 month ago
chrislu	718113d085	adjust deadline	1 month ago
chrislu	e9101d9733	add some delays	1 month ago
chrislu	090f73dc66	less logs	1 month ago
chrislu	7c0c212d33	use client timeout wait	1 month ago
chrislu	4766534b84	increase deadline	1 month ago
chrislu	54f4a4285a	consumer group that does not join group	1 month ago
chrislu	9e78705a98	refactor dedup	1 month ago
chrislu	2a0b7604c5	avoid race condition	1 month ago
chrislu	1f128d65c5	debug	1 month ago
chrislu	9eae9e1fed	unlock	1 month ago
chrislu	98b536480d	fix locking	1 month ago
chrislu	73ebc69a82	avoid deadlock	1 month ago
chrislu	fe9e0161d5	fmt	1 month ago
chrislu	92a7e42368	atomic currentStartOffset	1 month ago
chrislu	e2c6f47cf6	Simplified GetOrCreateSubscriber to always reuse existing sessions	1 month ago
chrislu	6ef2f66198	only recreate if we need to seek backward (requested offset < current offset), not on any mismatch	1 month ago
chrislu	6947d906a8	more logs on offset resume	1 month ago
chrislu	63b3a10535	comment	1 month ago
chrislu	bc7e015a41	Inlined the session creation logic to hold the lock continuously	1 month ago
chrislu	2ff548a41d	save checkpoint every 2 seconds	1 month ago
chrislu	233ade4187	fix race condition	1 month ago

1 2 3 4 5 ...

11961 Commits (7e46abf052b029bfd703a6f73ad9b16dea8f25bc) All Branches Search

11961 Commits (7e46abf052b029bfd703a6f73ad9b16dea8f25bc)

All Branches