seaweedfs

Commit Graph

Author	SHA1	Message	Date
Chris Lu	0647f66bb5	filer.sync: add exponential backoff on unexpected EOF during replication (#8557 ) * filer.sync: add exponential backoff on unexpected EOF during replication When the source volume server drops connections under high traffic, filer.sync retries aggressively (every 1-6s), hammering the already overloaded source. This adds a longer exponential backoff (10s to 2min) specifically for "unexpected EOF" errors, reducing pressure on the source while still retrying indefinitely until success. Also adds more logging throughout the replication path: - Log source URL and error at V(0) when ReadPart or io.ReadAll fails - Log content-length and byte counts at V(4) on success - Log backoff duration in retry messages Fixes #8542 * filer.sync: extract backoff helper and fix 2-minute cap - Extract nextEofBackoff() and isEofError() helpers to deduplicate the backoff logic between fetchAndWrite and uploadManifestChunk - Fix the cap: previously 80s would double to 160s and pass the < 2min check uncapped. Now doubles first, then clamps to 2min. * filer.sync: log source URL instead of empty upload URL on read errors UploadUrl is not populated until after the reader is consumed, so the V(0) and V(4) logs were printing an empty string. Add SourceUrl field to UploadOption and populate it from the HTTP response in fetchAndWrite. * filer.sync: guard isEofError against nil error * filer.sync: use errors.Is for EOF detection, fix log wording - Replace broad substring matching ("read input", "unexpected EOF") with errors.Is(err, io.ErrUnexpectedEOF) and errors.Is(err, io.EOF) so only actual EOF errors trigger the longer backoff - Fix awkward log phrasing: "interrupted replicate" → "interrupted while replicating" * filer.sync: remove EOF backoff from uploadManifestChunk uploadManifestChunk reads from an in-memory bytes.Reader, so any EOF errors there are from the destination side, not a broken source stream. The long source-oriented backoff is inappropriate; let RetryUntil handle destination retries at its normal cadence. --------- Co-authored-by: Copilot <copilot@github.com>	22 hours ago
Aleksey Kosov	4511c2cc1f	Changes logging function (#6919 ) * updated logging methods for stores * updated logging methods for stores * updated logging methods for filer * updated logging methods for uploader and http_util * updated logging methods for weed server --------- Co-authored-by: akosov <a.kosov@kryptonite.ru>	9 months ago
Aleksey Kosov	283d9e0079	Add context with request (#6824 )	10 months ago
vadimartynov	86d92a42b4	Added tls for http clients (#5766 ) * Added global http client * Added Do func for global http client * Changed the code to use the global http client * Fix http client in volume uploader * Fixed pkg name * Fixed http util funcs * Fixed http client for bench_filer_upload * Fixed http client for stress_filer_upload * Fixed http client for filer_server_handlers_proxy * Fixed http client for command_fs_merge_volumes * Fixed http client for command_fs_merge_volumes and command_volume_fsck * Fixed http client for s3api_server * Added init global client for main funcs * Rename global_client to client * Changed: - fixed NewHttpClient; - added CheckIsHttpsClientEnabled func - updated security.toml in scaffold * Reduce the visibility of some functions in the util/http/client pkg * Added the loadSecurityConfig function * Use util.LoadSecurityConfiguration() in NewHttpClient func	2 years ago
chrislu	81fdf3651b	grpc connection to filer add sw-client-id header	3 years ago
askeipx	2e78a522ab	remove old raft servers if they don't answer to pings for too long (#3398 ) * remove old raft servers if they don't answer to pings for too long add ping durations as options rename ping fields fix some todos get masters through masterclient raft remove server from leader use raft servers to ping them CheckMastersAlive for hashicorp raft only * prepare blocking ping * pass waitForReady as param * pass waitForReady through all functions * waitForReady works * refactor * remove unneeded params * rollback unneeded changes * fix	4 years ago
Konstantin Lebedev	4d08393b7c	filer prefer volume server in same data center (#3405 ) * initial prefer same data center https://github.com/seaweedfs/seaweedfs/issues/3404 * GetDataCenter * prefer same data center for ReplicationSource * GetDataCenterId * remove glog	4 years ago
chrislu	26dbc6c905	move to https://github.com/seaweedfs/seaweedfs	4 years ago
chrislu	9f9ef1340c	use streaming mode for long poll grpc calls streaming mode would create separate grpc connections for each call. this is to ensure the long poll connections are properly closed.	4 years ago
Chris Lu	0db2517994	go fmt	5 years ago
Chris Lu	5a0f92423e	use grpc and jwt	5 years ago
Chris Lu	921e0d5008	remove verbose log	5 years ago
Chris Lu	9abb041763	filer source: support filerProxy mode	5 years ago
Chris Lu	990fa69bfe	add back AdjustedUrl() related code	5 years ago
Chris Lu	00707ec00f	mount: outsideContainerClusterMode proxy through filer Running mount outside of the cluster would not need to expose all the volume servers to outside of the cluster. The chunk read and write will go through the filer.	5 years ago
Chris Lu	6ca10725b8	Revert "mount: when outside cluster network, use filer as proxy to access volume servers" This reverts commit `096e088d7b`.	5 years ago
Chris Lu	096e088d7b	mount: when outside cluster network, use filer as proxy to access volume servers	5 years ago
Chris Lu	80b8692688	filer.sync: replicate outside of either cluster, only need to see filers	5 years ago
Chris Lu	723ae11db4	refactoring in order to adjust volume server url later	5 years ago
Chris Lu	a8624c2e4f	read from alternative replica related to https://github.com/chrislusf/seaweedfs/issues/1512	5 years ago
Chris Lu	387ab6796f	filer: cross cluster synchronization	6 years ago
Chris Lu	4fc0bd1a81	return http response directly	6 years ago
Chris Lu	ed3cf811f5	refactoring	6 years ago
Chris Lu	f90c43635d	refactoring	6 years ago
Chris Lu	892e726eb9	avoid reusing context object fix https://github.com/chrislusf/seaweedfs/issues/1182	6 years ago
Chris Lu	d335f04de6	support env variables to overwrite toml file	6 years ago
Chris Lu	72a64a5cf8	use the same context object in order to retry	6 years ago
j.laycock	6fc6322c90	Change joeslay paths to chrislusf paths	7 years ago
j.laycock	595a1beff0	Swap imports to use joeslay	7 years ago
Chris Lu	c789b496d8	use cached grpc client	7 years ago
Chris Lu	55bab1b456	add context.Context	7 years ago
Chris Lu	77b9af531d	adding grpc mutual tls	7 years ago
Chris Lu	e8ef501f02	add s3 replication sink	8 years ago
Chris Lu	db69ce89f0	go fmt	8 years ago
Chris Lu	a6cfaba018	able to sync the changes	8 years ago
Chris Lu	25fb6f9a46	fix compilation	8 years ago
Chris Lu	779641e9d4	adjust replicated entry name	8 years ago
Chris Lu	788acdf527	add WIP filer.replicate	8 years ago

36 Commits (0647f66bb54144362c053d135687416dfa0a5802)