12214 Commits (58d4d61f8969f16972b0b8268113a0fefd328e95)
 

Author SHA1 Message Date
chrislu 58d4d61f89 docs: push instructions for Parquet EOF fix 2 weeks ago
chrislu 90aa83dbe4 docs: add detailed analysis of Parquet EOF fix 2 weeks ago
chrislu 9e7ed48688 fix: Override FSDataOutputStream.getPos() to use SeaweedOutputStream position 2 weeks ago
chrislu a8491ecd3f Update SeaweedOutputStream.java 2 weeks ago
chrislu 16bd118125 fix: don't split chunk ID on comma - comma is PART of the ID! 2 weeks ago
chrislu a1fa949221 feat: extract chunk IDs from write log and download from volume 2 weeks ago
chrislu c774b807e1 fix: search temporary directories for Parquet files 2 weeks ago
chrislu 7b9b04cd59 feat: add explicit logging when employees Parquet file is written 2 weeks ago
chrislu 09b0a2505c fix: poll for files to appear instead of fixed sleep 2 weeks ago
chrislu 64357e73bf feat: proactive download - grab files BEFORE Spark deletes them 2 weeks ago
chrislu 8e0635b8ba fix: search for filename in 'Encountered error' message 2 weeks ago
chrislu c5c29bc820 fix: search for failing file in read context (SeaweedInputStream) 2 weeks ago
chrislu e76107c22e fix: extract chunk ID for the EXACT file causing EOF error 2 weeks ago
chrislu 0afe330b4e feat: add detailed offset analysis for 78-byte discrepancy 2 weeks ago
chrislu 72b4bf9098 fix: extract correct chunk ID (not source_file_id) 2 weeks ago
chrislu 4ec6fbcdc7 fix: download Parquet data directly from volume server 2 weeks ago
chrislu 4224fcf4f8 chore: trigger new workflow run with real-time monitoring 2 weeks ago
chrislu a4af6d880d fix: download Parquet file in real-time when EOF error occurs 2 weeks ago
chrislu 09384e41e3 fix: add comprehensive diagnostics for file location 2 weeks ago
chrislu 8ea2646084 fix: keep containers running during file download 2 weeks ago
chrislu f2a20aec8b fix: download Parquet file immediately after test failure 2 weeks ago
chrislu 2548ad91f7 debug: add comprehensive volume and container diagnostics 2 weeks ago
chrislu 911eb60946 debug: add directory structure inspection before file download 2 weeks ago
chrislu 55b5f7f0aa fix: replace heredoc with echo pipe to fix YAML syntax 2 weeks ago
chrislu 0dc95c0669 fix: run Spark integration tests on all branches 2 weeks ago
chrislu ac9fbeefac refactor: remove emojis from logging and workflow messages 2 weeks ago
chrislu 588e29ae57 debug: improve file download with better diagnostics and fallbacks 2 weeks ago
chrislu fae232075f fix: restart SeaweedFS services before downloading files on test failure 2 weeks ago
chrislu 8c22780091 fix: restart SeaweedFS services before downloading files on test failure 2 weeks ago
chrislu af7ee4bfb6 docs: push summary for Parquet diagnostics 2 weeks ago
chrislu afce69db1e Revert "docs: comprehensive analysis of persistent 78-byte Parquet issue" 2 weeks ago
chrislu b767825ba0 test: add Parquet file download and inspection on failure 2 weeks ago
chrislu 8e5f1d60ee docs: comprehensive analysis of persistent 78-byte Parquet issue 2 weeks ago
chrislu 1ca6d7f441 debug parquet footer writing 2 weeks ago
chrislu 65c3ead62f debug: enhance logging to capture footer writes and getPos calls 2 weeks ago
chrislu 9e774d8d75 docs: add Parquet 1.16.0 upgrade summary and testing guide 2 weeks ago
chrislu 12504dc1a6 feat: upgrade Apache Parquet to 1.16.0 to fix EOFException 2 weeks ago
chrislu 3dd14ad2df fmt 2 weeks ago
chrislu 885354bb19 fix: reduce write() logging verbosity, add summary stats 2 weeks ago
chrislu 221252d34e fmt 2 weeks ago
chrislu 48a2ddf6f8 debug: track ALL writes to Parquet files 2 weeks ago
chrislu 86ae3da174 fmt 2 weeks ago
chrislu 10ff54a661 Revert "docs: comprehensive analysis of 78-byte EOFException" 2 weeks ago
chrislu 94ab173eb0 docs: comprehensive analysis of 78-byte EOFException 2 weeks ago
chrislu 7b067d2e59 debug: add getPos() method to track position queries 2 weeks ago
chrislu a3cf4eb843 debug: track stream lifecycle and total bytes written 2 weeks ago
chrislu a5bccca443 debug: add critical diagnostics for EOFException (78 bytes missing) 2 weeks ago
chrislu d9ab0721b9 refactor: merge workflow jobs into single job 2 weeks ago
chrislu 81867f0dd9 debug: add detailed verification for Maven artifact upload 2 weeks ago
chrislu f52d2902b2 fix: copy Maven artifacts into workspace instead of mounting $HOME/.m2 2 weeks ago