seaweedfs

Commit Graph

Author	SHA1	Message	Date
chrislu	0e481cf97a	fmt	2 months ago
chrislu	c5634470ed	feat: add disk I/O fallback for historical offset reads This commit implements async disk I/O fallback to handle cases where: 1. Data is flushed from memory before consumers can read it (CI issue) 2. Consumers request historical offsets not in memory 3. Small LogBuffer retention in resource-constrained environments Changes: - Add readHistoricalDataFromDisk() helper function - Update ReadMessagesAtOffset() to call ReadFromDiskFn when offset < bufferStartOffset - Properly handle maxMessages and maxBytes limits during disk reads - Return appropriate nextOffset after disk reads - Log disk read operations at V(2) and V(3) levels Benefits: - Fixes CI test failures where data is flushed before consumption - Enables consumers to catch up even if they fall behind memory retention - No blocking on hot path (disk read only for historical data) - Respects existing ReadFromDiskFn timeout handling How it works: 1. Try in-memory read first (fast path) 2. If offset too old and ReadFromDiskFn configured, read from disk 3. Return disk data with proper nextOffset 4. Consumer continues reading seamlessly This fixes the 'offset 0 too old (earliest in-memory: 5)' error in TestOffsetManagement where messages were flushed before consumer started.	2 months ago
chrislu	e1a4bff794	feat: add context timeout propagation to produce path This commit adds proper context propagation throughout the produce path, enabling client-side timeouts to be honored on the broker side. Previously, only fetch operations respected client timeouts - produce operations continued indefinitely even if the client gave up. Changes: - Add ctx parameter to ProduceRecord and ProduceRecordValue signatures - Add ctx parameter to PublishRecord and PublishRecordValue in BrokerClient - Add ctx parameter to handleProduce and related internal functions - Update all callers (protocol handlers, mocks, tests) to pass context - Add context cancellation checks in PublishRecord before operations Benefits: - Faster failure detection when client times out - No orphaned publish operations consuming broker resources - Resource efficiency improvements (no goroutine/stream/lock leaks) - Consistent timeout behavior between produce and fetch paths - Better error handling with proper cancellation signals This fixes the root cause of CI test timeouts where produce operations continued indefinitely after clients gave up, leading to cascading delays.	2 months ago
chrislu	66d87659e5	test: increase timeouts for consumer group operations in E2E tests Consumer group operations (coordinator discovery, offset fetch/commit) are slower in CI environments with limited resources. This increases timeouts to: - ProduceMessages: 10s -> 30s (for when consumer groups are active) - ConsumeWithGroup: 30s -> 60s (for offset fetch/commit operations) Fixes the TestOffsetManagement timeout failures in GitHub Actions CI.	2 months ago
chrislu	39e7bbdc6d	less logs	2 months ago
chrislu	a12f7b2ee8	fix go mod	2 months ago
chrislu	53f9124a26	fix tests	2 months ago
chrislu	1807b8093c	debug fetch offset APIs	2 months ago
chrislu	ba1a8aed64	log read stateless	2 months ago
chrislu	210fc49891	Merge branch 'master' into fix-race-condition	2 months ago
Chris Lu	3d25f206c8	S3: Signature verification should not check permissions (#7335 ) * Signature verification should not check permissions - that's done later in authRequest * test permissions during signature verfication * fix s3 test path * s3tests_boto3 => s3tests * remove extra lines	2 months ago
chrislu	3b75e50b04	removing the unnecessary restart logic and relying on the seek mechanism we already implemented	2 months ago
chrislu	6c1298b5f7	track messages with testStartTime	2 months ago
chrislu	f4a018e731	verify produced messages are consumed	2 months ago
chrislu	e7747a7572	adjust s3 tests	2 months ago
chrislu	38befd30ee	pin s3 test version	2 months ago
chrislu	0bf4ace6b1	reuse cached records	2 months ago
chrislu	7e934d6283	ack messages to broker	2 months ago
chrislu	5222ddaf2f	seekable subscribe messages	2 months ago
chrislu	60e6e63706	avoid goroutine leak	2 months ago
chrislu	f639c42472	clean up consumer protocols	2 months ago
chrislu	e344c6ce24	adjust return values on failures	2 months ago
chrislu	0cbc5e906e	purge unused	2 months ago
chrislu	fd33e03008	less logs, remove unused code	2 months ago
chrislu	bb0e613275	more time	2 months ago
chrislu	5c6b0eaa0d	Update fetch.go	2 months ago
chrislu	718113d085	adjust deadline	2 months ago
chrislu	e9101d9733	add some delays	2 months ago
chrislu	090f73dc66	less logs	2 months ago
chrislu	7c0c212d33	use client timeout wait	2 months ago
chrislu	4766534b84	increase deadline	2 months ago
chrislu	54f4a4285a	consumer group that does not join group	2 months ago
chrislu	9e78705a98	refactor dedup	2 months ago
chrislu	2a0b7604c5	avoid race condition	2 months ago
chrislu	1f128d65c5	debug	2 months ago
chrislu	9eae9e1fed	unlock	2 months ago
chrislu	98b536480d	fix locking	2 months ago
chrislu	73ebc69a82	avoid deadlock	2 months ago
chrislu	fe9e0161d5	fmt	2 months ago
chrislu	92a7e42368	atomic currentStartOffset	2 months ago
chrislu	e2c6f47cf6	Simplified GetOrCreateSubscriber to always reuse existing sessions	2 months ago
chrislu	6ef2f66198	only recreate if we need to seek backward (requested offset < current offset), not on any mismatch	2 months ago
chrislu	6947d906a8	more logs on offset resume	2 months ago
chrislu	63b3a10535	comment	2 months ago
chrislu	bc7e015a41	Inlined the session creation logic to hold the lock continuously	2 months ago
chrislu	2ff548a41d	save checkpoint every 2 seconds	2 months ago
chrislu	233ade4187	fix race condition	2 months ago
chrislu	ffc45a538d	Added bounds checking after calculating startIdx. Problem: Race condition in cache lookup logic: Thread A reads cache metadata (17+ records, endOffset = 32) Thread B modifies/truncates the cache to 17 records Thread A calculates startIdx = 19 based on old metadata Slice operation consumedRecords[19:17] panics	2 months ago
chrislu	f15eaaf8b9	nil checking	2 months ago
chrislu	fba4fc3a7d	All consumers share the same group for load balancing across partitions	2 months ago

1 2 3 4 5 ...

11958 Commits (0e481cf97acd87422c764042f45f035205d30573) All Branches Search

11958 Commits (0e481cf97acd87422c764042f45f035205d30573)

All Branches