seaweedfs

History

chrislu 890e5c8bd8 fix: add timeout to master volume lookup to prevent indefinite blocking The masterVolumeProvider.LookupVolumeIds method was using the context directly without a timeout, which could cause it to block indefinitely if the master is slow to respond or unreachable. Problem: err := pb.WithMasterClient(false, p.masterClient.GetMaster(ctx), ...) resp, err := client.LookupVolume(ctx, &master_pb.LookupVolumeRequest{...}) - No timeout on gRPC call to master - Could block indefinitely if master is unresponsive - Inconsistent with FilerClient which uses 5s timeout - This is a fallback path (cache miss) but still needs protection Scenarios where this could hang: 1. Master server under heavy load (slow response) 2. Network issues between client and master 3. Master server hung or deadlocked 4. Master in process of shutting down Fix: timeoutCtx, cancel := context.WithTimeout(ctx, 5*time.Second) defer cancel() err := pb.WithMasterClient(false, p.masterClient.GetMaster(timeoutCtx), ...) resp, err := client.LookupVolume(timeoutCtx, &master_pb.LookupVolumeRequest{...}) Benefits: - Prevents indefinite blocking on master lookup - Consistent with FilerClient timeout pattern (5 seconds) - Faster failure detection when master is unresponsive - Caller's context still honored (timeout is in addition, not replacement) - Improves overall system resilience Note: 5 seconds is a reasonable default for volume lookups: - Long enough for normal master response (~10-50ms) - Short enough to fail fast on issues - Matches FilerClient's grpcTimeout default		3 months ago
..
admin	muted texts	3 months ago
cluster	adds FilerClient to use cached volume id	3 months ago
command	backup: handle volume not found when backing up (#7465)	3 months ago
credential	Filer Store: postgres backend support pgbouncer (#7077)	6 months ago
filer	refactor: remove unnecessary KeepMasterClientConnected wrapper in filer	3 months ago
filer_client	Clean up logs and deprecated functions (#7339)	4 months ago
glog	Add Kafka Gateway (#7231)	4 months ago
iam	S3: Enforce bucket policy (#7471)	3 months ago
iamapi	fix: goroutine and connection leak in IAM server shutdown	3 months ago
images	Migrates from disintegration/imaging c2019 to cognusion/imaging c2024. (#5533)	2 years ago
kms	S3 API: Add integration with KMS providers (#7152)	6 months ago
mount	improve: address remaining code review findings	3 months ago
mq	S3: Directly read write volume servers (#7481)	3 months ago
notification	fix: dead letter message log message (#7072)	6 months ago
operation	S3: Directly read write volume servers (#7481)	3 months ago
pb	S3: Directly read write volume servers (#7481)	3 months ago
query	Fix date string parsing bug for the SQL Engine. (#7446)	3 months ago
remote_storage	Filer: Fixed critical bugs in the Azure SDK migration (PR #7310) (#7401)	3 months ago
replication	Filer: Fixed critical bugs in the Azure SDK migration (PR #7310) (#7401)	3 months ago
s3api	fix: FilerClient supports multiple filer addresses for high availability	3 months ago
security	remove spoof-able request header (#7103)	6 months ago
sequence	remove unused function	2 years ago
server	refactor: remove unnecessary KeepMasterClientConnected wrapper in filer	3 months ago
sftpd	S3 API: Advanced IAM System (#7160)	5 months ago
shell	Account Info (#7507)	3 months ago
static	Fix Broken Links (#5287)	2 years ago
stats	[volume] refactor and add metrics for flight upload and download data limit condition (#6920)	7 months ago
storage	Volume Server: avoid aggressive volume assignment (#7501)	3 months ago
telemetry	convert error fromating to %w everywhere (#6995)	7 months ago
topology	master: fix negative active volumes (#7440)	3 months ago
util	S3: Directly read write volume servers (#7481)	3 months ago
wdclient	fix: add timeout to master volume lookup to prevent indefinite blocking	3 months ago
worker	go fmt	3 months ago
Makefile	test versioning also (#7000)	7 months ago
weed.go	set exit status	11 months ago