seaweedfs

Commit Graph

Author	SHA1	Message	Date
Chris Lu	c284e51d20	fix: multipart upload ETag calculation (#8238 ) * fix multipart etag * address comments * clean up * clean up * optimization * address comments * unquoted etag * dedup * upgrade * clean * etag * return quoted tag * quoted etag * debug * s3api: unify ETag retrieval and quoting across handlers Refactor newListEntry to take S3ApiServer and use getObjectETag, and update setResponseHeaders to use the same logic. This ensures consistent ETags are returned for both listing and direct access. s3api: implement ListObjects deduplication for versioned buckets Handle duplicate entries between the main path and the .versions directory by prioritizing the latest version when bucket versioning is enabled. * s3api: cleanup stale main file entries during versioned uploads Add explicit deletion of pre-existing "main" files when creating new versions in versioned buckets. This prevents stale entries from appearing in bucket listings and ensures consistency. * s3api: fix cleanup code placement in versioned uploads Correct the placement of rm calls in completeMultipartUpload and putVersionedObject to ensure stale main files are properly deleted during versioned uploads. * s3api: improve getObjectETag fallback for empty ExtETagKey Ensure that when ExtETagKey exists but contains an empty value, the function falls through to MD5/chunk-based calculation instead of returning an empty string. * s3api: fix test files for new newListEntry signature Update test files to use the new newListEntry signature where the first parameter is S3ApiServer. Created mockS3ApiServer to properly test owner display name lookup functionality. s3api: use filer.ETag for consistent Md5 handling in getEtagFromEntry Change getEtagFromEntry fallback to use filer.ETag(entry) instead of filer.ETagChunks to ensure legacy entries with Attributes.Md5 are handled consistently with the rest of the codebase. * s3api: optimize list logic and fix conditional header logging - Hoist bucket versioning check out of per-entry callback to avoid repeated getVersioningState calls - Extract appendOrDedup helper function to eliminate duplicate dedup/append logic across multiple code paths - Change If-Match mismatch logging from glog.Errorf to glog.V(3).Infof and remove DEBUG prefix for consistency * s3api: fix test mock to properly initialize IAM accounts Fixed nil pointer dereference in TestNewListEntryOwnerDisplayName by directly initializing the IdentityAccessManagement.accounts map in the test setup. This ensures newListEntry can properly look up account display names without panicking. * cleanup * s3api: remove premature main file cleanup in versioned uploads Removed incorrect cleanup logic that was deleting main files during versioned uploads. This was causing test failures because it deleted objects that should have been preserved as null versions when versioning was first enabled. The deduplication logic in listing is sufficient to handle duplicate entries without deleting files during upload. * s3api: add empty-value guard to getEtagFromEntry Added the same empty-value guard used in getObjectETag to prevent returning quoted empty strings. When ExtETagKey exists but is empty, the function now falls through to filer.ETag calculation instead of returning "". * s3api: fix listing of directory key objects with matching prefix Revert prefix handling logic to use strings.TrimPrefix instead of checking HasPrefix with empty string result. This ensures that when a directory key object exactly matches the prefix (e.g. prefix="dir/", object="dir/"), it is correctly handled as a regular entry instead of being skipped or incorrectly processed as a common prefix. Also fixed missing variable definition. * s3api: refactor list inline dedup to use appendOrDedup helper Refactored the inline deduplication logic in listFilerEntries to use the shared appendOrDedup helper function. This ensures consistent behavior and reduces code duplication. * test: fix port allocation race in s3tables integration test Updated startMiniCluster to find all required ports simultaneously using findAvailablePorts instead of sequentially. This prevents race conditions where the OS reallocates a port that was just released, causing multiple services (e.g. Filer and Volume) to be assigned the same port and fail to start.	2 weeks ago
promalert	9012069bd7	chore: execute goimports to format the code (#7983 ) * chore: execute goimports to format the code Signed-off-by: promalert <promalert@outlook.com> * goimports -w . --------- Signed-off-by: promalert <promalert@outlook.com> Co-authored-by: Chris Lu <chris.lu@gmail.com>	1 month ago
Chris Lu	513ac58504	Filer: fix filer range read (#7078 ) * fix filer range read Only return true if we're reading the ENTIRE chunk from the beginning. // This prevents bandwidth amplification when range requests happen to align // with chunk boundaries but don't actually want the full chunk. * Update weed/filer/filechunks.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	7 months ago
Aleksey Kosov	283d9e0079	Add context with request (#6824 )	9 months ago
chrislu	75bdd4a0d1	refactor	3 years ago
chrislu	296fdc296c	mount: faster add chunks	3 years ago
Chris Lu	d4566d4aaa	more solid weed mount (#4089 ) * compare chunks by timestamp * fix slab clearing error * fix test compilation * move oldest chunk to sealed, instead of by fullness * lock on fh.entryViewCache * remove verbose logs * revert slat clearing * less logs * less logs * track write and read by timestamp * remove useless logic * add entry lock on file handle release * use mem chunk only, swap file chunk has problems * comment out code that maybe used later * add debug mode to compare data read and write * more efficient readResolvedChunks with linked list * small optimization * fix test compilation * minor fix on writer * add SeparateGarbageChunks * group chunks into sections * turn off debug mode * fix tests * fix tests * tmp enable swap file chunk * Revert "tmp enable swap file chunk" This reverts commit `985137ec47`. * simple refactoring * simple refactoring * do not re-use swap file chunk. Sealed chunks should not be re-used. * comment out debugging facilities * either mem chunk or swap file chunk is fine now * remove orderedMutex as semaphore.Weighted not found impactful optimize size calculation for changing large files * optimize performance to avoid going through the long list of chunks * still problems with swap file chunk * rename * tiny optimization * swap file chunk save only successfully read data * fix * enable both mem and swap file chunk * resolve chunks with range * rename * fix chunk interval list * also change file handle chunk group when adding chunks * pick in-active chunk with time-decayed counter * fix compilation * avoid nil with empty fh.entry * refactoring * rename * rename * refactor visible intervals to list.List refactor chunkViews to list.List add IntervalList for generic interval list * change visible interval to use IntervalList in generics * cahnge chunkViews to IntervalList[ChunkView] * use NewFileChunkSection to create * rename variables * refactor * fix renaming leftover * renaming * renaming * add insert interval * interval list adds lock * incrementally add chunks to readers Fixes: 1. set start and stop offset for the value object 2. clone the value object 3. use pointer instead of copy-by-value when passing to interval.Value 4. use insert interval since adding chunk could be out of order * fix tests compilation * fix tests compilation	3 years ago
aronneagu	77699855a7	Return ETag from remote when file doesn't exist on Filer (#4025 )	3 years ago
chrislu	70a4c98b00	refactor filer_pb.Entry and filer.Entry to use GetChunks() for later locking on reading chunks	3 years ago
chrislu	ea2637734a	refactor filer proto chunk variable from mtime to modified_ts_ns	3 years ago
chrislu	036566629a	filer.sync: fix synchronization logic in active-active mode fix https://github.com/seaweedfs/seaweedfs/issues/3328	4 years ago
chrislu	26dbc6c905	move to https://github.com/seaweedfs/seaweedfs	4 years ago
chrislu	a85ed3fe8f	minor	4 years ago
chrislu	2ea18cdcc8	remove dead code	4 years ago
geekboood	fdacd94af5	fix: invalid chunk data when failed to read manifests	4 years ago
justin	3551ca2fcf	enhancement: replace sort.Slice with slices.SortFunc to reduce reflection	4 years ago
chrislu	bb4beebce3	prevent nil	4 years ago
chrislu	4e2388e1b5	mount: set file size if it is only on remote gateway	4 years ago
chrislu	9405eaefdb	filer.sync: fix replicating partially updated file Run two servers with volumes and fillers: server -dir=Server1alpha -master.port=11000 -filer -filer.port=11001 -volume.port=11002 server -dir=Server1sigma -master.port=11006 -filer -filer.port=11007 -volume.port=11008 Run Active-Passive filler.sync: filer.sync -a localhost:11007 -b localhost:11001 -isActivePassive Upload file to 11007 port: curl -F file=@/Desktop/9.xml "http://localhost:11007/testFacebook/" If we request a file on two servers now, everything will be correct, even if we add data to the file and upload it again: curl "http://localhost:11007/testFacebook/9.xml" EQUALS curl "http://localhost:11001/testFacebook/9.xml" However, if we change the already existing data in the file (for example, we change the first line in the file, reducing its length), then this file on the second server will not be valid and will not be equivalent to the first file Снимок экрана 2022-02-07 в 14 21 11 This problem occurs on line 202 in the filer_sink.go file. In particular, this is due to incorrect mapping of chunk names in the DoMinusChunks function. The names of deletedChunks do not match the chunks of existingEntry.Chunks, since the first chunks come from another server and have a different addressing (name) compared to the addressing on the server where the file is being overwritten. Deleted chunks are not actually deleted on the server to which the file is replicated.	4 years ago
Chris Lu	8e2c9713a3	turn on new faster algorithm to translate into visible chunks	4 years ago
Chris Lu	b9a2efd69b	temporarily reverting	4 years ago
Chris Lu	8a124ef9fc	Revert "remove deprecated code" This reverts commit `de7688c539`.	4 years ago
Chris Lu	de7688c539	remove deprecated code	4 years ago
Chris Lu	7336990639	faster file read for large files	4 years ago
Chris Lu	7ab389e7ec	optimization: improve random range query for large files	5 years ago
Konstantin Lebedev	c2269123d3	fix aws style Etag for chunks	5 years ago
Chris Lu	69694a17be	reverting `7d57664c2d`	5 years ago
Chris Lu	7d57664c2d	mount: internals switch to filer.Entry instead of protobuf	5 years ago
Chris Lu	2b76854641	add "weed filer.cat" to read files directly from volume servers	5 years ago
Konstantin Lebedev	e4f2d9eb4a	We return etag using the same algorithm as aws s3 https://teppen.io/2018/06/23/aws_s3_etags/	5 years ago
Chris Lu	387ab6796f	filer: cross cluster synchronization	6 years ago
Chris Lu	eb7929a971	rename filer2 to filer	6 years ago
Chris Lu	99ecf63276	go fmt	6 years ago
Chris Lu	aee27ccbe1	multiple fixes * adjust isOpen count * move ContinuousDirtyPages lock to filehandle * fix problem with MergeIntoVisibles, avoid reusing slices * let filer delete the garbage	6 years ago
Chris Lu	d60bcbf08a	sorting chunks	6 years ago
Chris Lu	e72953dff7	logs	6 years ago
Chris Lu	2ba817afac	read randomly written data	6 years ago
Chris Lu	6111b265e7	fix compilation	6 years ago
Chris Lu	4a77f0820a	clean up logs	6 years ago
Chris Lu	8c9e6eaacd	fix tests	6 years ago
Chris Lu	1d9ea30b72	fix ViewFromVisibleIntervals	6 years ago
Chris Lu	c647deace1	file size support set file length use Attr.FileSize and TotalChunkSize to determine file size	6 years ago
Chris Lu	97d97f3528	go code can read and write chunk manifest	6 years ago
Chris Lu	97239ce6f1	rename filechunk is_gzipped to is_compressed	6 years ago
Chris Lu	f282ed444b	refactoring	6 years ago
Chris Lu	dc08e4098f	add etag only for PUT or large chunked uploads	6 years ago
Chris Lu	ec2eb8bc48	add If-None-Match and If-Modified-Since fix https://github.com/chrislusf/seaweedfs/issues/1269	6 years ago
Chris Lu	4aa82c95e6	refactor	6 years ago
Chris Lu	f06ca04451	avoid overflow	6 years ago
Chris Lu	ae2ee379c0	consistent 64bit size	6 years ago

32 Commits (eda4a000cc05e476ae13e2accb932c19df247abf)