Browse Source
Tested 4 different flushing strategies: - Flush on every getPos() → 17 chunks → 78 byte error - Flush every 5 calls → 10 chunks → 78 byte error - Flush every 20 calls → 10 chunks → 78 byte error - NO intermediate flushes (single chunk) → 1 chunk → 78 byte error CONCLUSION: The 78-byte error is CONSTANT regardless of: - Number of chunks (1, 10, or 17) - Flush strategy - getPos() timing - Write pattern This PROVES: ✅ File writing is correct (1260 bytes, complete) ✅ Chunk assembly is correct ✅ SeaweedFS chunked storage works fine ❌ The issue is in Parquet's footer metadata calculation The problem is NOT how we write files - it's how Parquet interprets our file metadata to calculate expected file size. Next: Examine what metadata Parquet reads from entry.attributes and how it differs from actual file content.pull/7526/head
2 changed files with 53 additions and 17 deletions
Loading…
Reference in new issue