seaweedfs

History

Chris Lu 54de32f207 Support AWS standard IAM role ARN formats (issue #7946 ) (#7948 ) * fix(iam): support both AWS standard and legacy IAM role ARN formats Fix issue #7946 where SeaweedFS only recognized legacy IAM role ARN format (arn:aws:iam::role/RoleName) but not the standard AWS format with account ID (arn:aws:iam::ACCOUNT:role/RoleName). This was breaking EKS pod identity integration which expects the standard format. Changes: - Update ExtractRoleNameFromArn() to handle both formats by searching for 'role/' marker instead of matching a fixed prefix - Update ExtractRoleNameFromPrincipal() to clearly document both STS and IAM formats it supports with or without account ID - Simplify role ARN validation in validateRoleAssumptionForWebIdentity() and validateRoleAssumptionForCredentials() to use the extraction function - Add comprehensive test coverage with 25 test cases covering both formats The fix maintains backward compatibility with legacy format while adding support for standard AWS format with account ID. Fixes: https://github.com/seaweedfs/seaweedfs/issues/7946 * docs: improve docstring coverage for ARN utility functions - Add comprehensive package-level documentation - Enhance ExtractRoleNameFromPrincipal docstring with parameter and return descriptions - Enhance ExtractRoleNameFromArn docstring with detailed format documentation - Add docstrings to test functions explaining test coverage - Update all docstrings to 80%+ coverage for code review compliance * refactor: improve ARN parsing code maintainability and error messages - Define constants for ARN prefixes and markers (stsPrefix, stsAssumedRoleMarker, iamPrefix, iamRoleMarker) - Replace hardcoded magic strings with named constants in ExtractRoleNameFromPrincipal and ExtractRoleNameFromArn - Enhance error messages in sts_service.go to show expected ARN format when validation fails - Error message now shows: 'arn:aws:iam::[ACCOUNT_ID:]role/ROLE_NAME' format - Improves code readability and maintainability - Facilitates future ARN format changes and debugging * feat: add structured ARN type for better debugging and extensibility Implements Option 2 (Structured ARN Type) from ARN handling comparison: New Features: - ARNInfo struct with Original, RoleName, AccountID, and Format fields - ARNFormat enum (Legacy, Standard, Invalid) for type-safe format tracking - ParseRoleARN() function for structured IAM role ARN parsing - ParsePrincipalARN() function for structured STS/IAM principal parsing Benefits: - Better debugging: Can see original ARN, extracted components, and format type - Extensible: Easy to add more fields (Region, Service, etc.) in future - Type-safe: Format is an enum, not a string - Backward compatible: Kept original string-based functions STS Service Updates: - Uses ParseRoleARN() for structured validation - Logs ARN components at V(4) level for debugging (role, account, format) - Better error context when validation fails Test Coverage: - 7 new tests for ParseRoleARN (legacy, standard, invalid formats) - 7 new tests for ParsePrincipalARN (STS/IAM, legacy/standard) - All 39 existing tests still pass - Total: 53 ARN-related tests Comparison with MinIO: - More flexible: Supports both AWS formats (MinIO only supports MinIO format) - Better tested: 53 tests vs MinIO's 8 tests - Structured like MinIO but more practical for AWS use cases * security: fix ARN parsing to prevent malicious ARN acceptance Fix critical security vulnerability where malicious ARNs could bypass validation: - ARNs like 'arn:aws:iam::123456789012:user/role/malicious' were incorrectly accepted - The previous implementation used strings.Index to find 'role/' anywhere in the ARN - This allowed non-role resource types to be accepted if they contained 'role/' in their path Changes: 1. Updated ExtractRoleNameFromArn() to validate resource type is exactly 'role/' 2. Updated ExtractRoleNameFromPrincipal() to validate resource type is exactly 'assumed-role/' 3. Updated ParseRoleARN() to validate structure before extracting fields 4. Updated ParsePrincipalARN() to validate structure before extracting fields 5. Added 6 security test cases to prevent regression The fix validates ARN structure by: - Splitting on ':' to separate account ID from resource type - Verifying resource type starts with exact marker ('role/' or 'assumed-role/') - Only then extracting role name, account ID, and format All 59 tests pass, including new security tests that verify malicious ARNs are rejected. Fixes: GitHub Copilot review #3624499048 * test: add test cases for empty role names and improve validation Address review feedback to improve edge case coverage: 1. Added test case for standard format with empty role name - TestExtractRoleNameFromArn: arn:aws:iam::123456789012:role/ - TestParseRoleARN: arn:aws:iam::123456789012:role/ 2. Added empty role name validation for STS ARNs in ParsePrincipalARN - Now matches ParseRoleARN behavior - Prevents ARNs like arn:aws:sts::assumed-role/ from having valid Format 3. Added test cases for empty STS role names - TestParsePrincipalARN: arn:aws:sts::assumed-role/ - TestParsePrincipalARN: arn:aws:sts::123456789012:assumed-role/ All 65 tests pass (15 for ExtractRoleNameFromArn, 10 for ExtractRoleNameFromPrincipal, 8 for ParseRoleARN, 9 for ParsePrincipalARN, 4 security user ARNs, 2 security STS, plus existing tests). * refactor: simplify ARNInfo by removing Format enum Remove ARNFormat enum (ARNFormatLegacy, ARNFormatStandard, ARNFormatInvalid) as it's not needed for backward compatibility. Simplifications: 1. Removed ARNFormat type and all format constants 2. Removed Format field from ARNInfo struct 3. Validation now checks if RoleName is empty (simpler and clearer) 4. AccountID presence already distinguishes legacy (empty) from standard (non-empty) formats 5. Updated STS service to check RoleName emptiness instead of Format field 6. Improved debug logging to explicitly show "(legacy format)" or "(standard format)" Benefits: - Simpler code with fewer concepts - AccountID field already provides format information - Validation is clearer: empty RoleName = invalid ARN - All 65 tests still pass This change maintains the same functionality while reducing code complexity. No backward compatibility concerns as the structured ARN parsing is new. * test: add comprehensive edge case tests for ARN parsing Add 4 new test functions covering: - Multiple role markers in paths (e.g., role/role/name) - Consecutive slashes in role paths (preserved as valid components) - Special characters valid in AWS role names (+=,.@-_) - Extremely long role names near AWS limits These tests verify the parser's resilience to edge cases and ensure proper handling of various valid role name formats and special characters.		2 months ago
..
admin	Refine Bucket Size Metrics: Logical and Physical Size (#7943)	2 months ago
cluster	Make lock_manager.RenewInterval configurable in LiveLock (#7830)	2 months ago
command	fix: correcting S3 nil cipher dereference in filer init (#7952)	2 months ago
credential	fix: Admin UI user creation fails before filer discovery (#7624) (#7625)	3 months ago
filer	fix: include DiskType in metadata log volume assignment (#7918)	2 months ago
filer_client	Clean up logs and deprecated functions (#7339)	4 months ago
glog	Add Kafka Gateway (#7231)	4 months ago
iam	Support AWS standard IAM role ARN formats (issue #7946) (#7948)	2 months ago
iamapi	IAM: Add Service Account Support (#7744) (#7901)	2 months ago
images	Migrates from disintegration/imaging c2019 to cognusion/imaging c2024. (#5533)	2 years ago
kms	S3 API: Add integration with KMS providers (#7152)	6 months ago
mount	classify grpc errors	2 months ago
mq	fix: comprehensive go vet error fixes and add CI enforcement (#7861)	2 months ago
notification	Fix webhook duplicate deliveries and POST to GET conversion (#7668)	3 months ago
operation	Add S3 volume encryption support with -s3.encryptVolumeData flag (#7890)	2 months ago
pb	Fix: Add -admin.grpc flag to worker for explicit gRPC port (#7926) (#7927)	2 months ago
query	fix: comprehensive go vet error fixes and add CI enforcement (#7861)	2 months ago
remote_storage	Filer: Fixed critical bugs in the Azure SDK migration (PR #7310) (#7401)	4 months ago
replication	fix(gcs): resolve credential conflict and improve backup logging (#7951)	2 months ago
s3api	Fix AWS SDK Signature V4 with STS credentials (issue #7941) (#7944)	2 months ago
security	remove spoof-able request header (#7103)	7 months ago
sequence	remove unused function	2 years ago
server	Fix reporting of EC shard sizes from nodes to masters. (#7835)	2 months ago
sftpd	SFTP: support reloading user store on HUP signal (#7651)	3 months ago
shell	Have `volume.list` account for EC shards when computing disk usage. (#7909)	2 months ago
static	Fix Broken Links (#5287)	2 years ago
stats	feat: add S3 bucket size and object count metrics (#7776)	2 months ago
storage	refactoring	2 months ago
telemetry	convert error fromating to %w everywhere (#6995)	7 months ago
topology	Fix reporting of EC shard sizes from nodes to masters. (#7835)	2 months ago
util	4.05	2 months ago
wdclient	Fix worker reconnection race condition causing context canceled errors (#7825)	2 months ago
worker	Fix: Add -admin.grpc flag to worker for explicit gRPC port (#7926) (#7927)	2 months ago
Makefile	Update README and weed/Makefile (#7571)	3 months ago
weed.go	Fix the issue where fuse command on a node cannot specify multiple configuration directory paths (#7874)	2 months ago