12230 Commits (80b463b7e4e21328aaa5045976dc4e80cc8fc010)
 

Author SHA1 Message Date
chrislu 80b463b7e4 test: add GetPosBufferTest to reproduce Parquet issue - ALL TESTS PASS! 1 week ago
chrislu 4faa6d55f6 docs: comprehensive issue summary - getPos() buffer flush timing issue 1 week ago
chrislu 8f33f5240d debug: confirmed root cause - Parquet tries to read 78 bytes past EOF 1 week ago
chrislu 16b8cf3e52 debug: add logging to EOF return path - FOUND ROOT CAUSE! 1 week ago
chrislu 216ae856ca docs: add comprehensive debugging analysis for EOF exception fix 1 week ago
chrislu 5c30bc8e7b debug: add detailed getPos() tracking with caller stack trace 1 week ago
chrislu e95f7061a4 fix: SeaweedInputStream returning 0 bytes for inline content reads 1 week ago
chrislu c10ae054b6 debug: add logging to SeaweedInputStream constructor to track contentLength 1 week ago
chrislu 9bb000e150 Update SeaweedOutputStream.java 1 week ago
chrislu d7d4d97098 debug: verify JARs contain latest code before running tests 1 week ago
chrislu 4936f733d1 debug: add WARN logging to SeaweedOutputStream base constructor 1 week ago
chrislu c834e30a72 debug: add logging to SeaweedFileSystemStore.createFile() 1 week ago
chrislu aed16ca9d7 fix: enable DEBUG logging for seaweed.hdfs package 1 week ago
chrislu 6fe5c372ee debug: change logs to WARN level to ensure visibility 1 week ago
chrislu c91175cb97 fix: make path variable final for anonymous inner class 1 week ago
chrislu d6f9234cea debug: add aggressive logging to FSDataOutputStream getPos() override 1 week ago
chrislu 58d4d61f89 docs: push instructions for Parquet EOF fix 1 week ago
chrislu 90aa83dbe4 docs: add detailed analysis of Parquet EOF fix 1 week ago
chrislu 9e7ed48688 fix: Override FSDataOutputStream.getPos() to use SeaweedOutputStream position 1 week ago
chrislu a8491ecd3f Update SeaweedOutputStream.java 1 week ago
chrislu 16bd118125 fix: don't split chunk ID on comma - comma is PART of the ID! 1 week ago
chrislu a1fa949221 feat: extract chunk IDs from write log and download from volume 1 week ago
chrislu c774b807e1 fix: search temporary directories for Parquet files 1 week ago
chrislu 7b9b04cd59 feat: add explicit logging when employees Parquet file is written 1 week ago
chrislu 09b0a2505c fix: poll for files to appear instead of fixed sleep 1 week ago
chrislu 64357e73bf feat: proactive download - grab files BEFORE Spark deletes them 1 week ago
chrislu 8e0635b8ba fix: search for filename in 'Encountered error' message 1 week ago
chrislu c5c29bc820 fix: search for failing file in read context (SeaweedInputStream) 1 week ago
chrislu e76107c22e fix: extract chunk ID for the EXACT file causing EOF error 1 week ago
chrislu 0afe330b4e feat: add detailed offset analysis for 78-byte discrepancy 1 week ago
chrislu 72b4bf9098 fix: extract correct chunk ID (not source_file_id) 1 week ago
chrislu 4ec6fbcdc7 fix: download Parquet data directly from volume server 1 week ago
chrislu 4224fcf4f8 chore: trigger new workflow run with real-time monitoring 1 week ago
chrislu a4af6d880d fix: download Parquet file in real-time when EOF error occurs 1 week ago
chrislu 09384e41e3 fix: add comprehensive diagnostics for file location 1 week ago
chrislu 8ea2646084 fix: keep containers running during file download 1 week ago
chrislu f2a20aec8b fix: download Parquet file immediately after test failure 1 week ago
chrislu 2548ad91f7 debug: add comprehensive volume and container diagnostics 1 week ago
chrislu 911eb60946 debug: add directory structure inspection before file download 1 week ago
chrislu 55b5f7f0aa fix: replace heredoc with echo pipe to fix YAML syntax 1 week ago
chrislu 0dc95c0669 fix: run Spark integration tests on all branches 1 week ago
chrislu ac9fbeefac refactor: remove emojis from logging and workflow messages 1 week ago
chrislu 588e29ae57 debug: improve file download with better diagnostics and fallbacks 1 week ago
chrislu fae232075f fix: restart SeaweedFS services before downloading files on test failure 1 week ago
chrislu 8c22780091 fix: restart SeaweedFS services before downloading files on test failure 1 week ago
chrislu af7ee4bfb6 docs: push summary for Parquet diagnostics 1 week ago
chrislu afce69db1e Revert "docs: comprehensive analysis of persistent 78-byte Parquet issue" 1 week ago
chrislu b767825ba0 test: add Parquet file download and inspection on failure 1 week ago
chrislu 8e5f1d60ee docs: comprehensive analysis of persistent 78-byte Parquet issue 1 week ago
chrislu 1ca6d7f441 debug parquet footer writing 1 week ago