14 Commits (2d6b5711207bffb0ba247b3ce97e81ed4bc461f8)

Author SHA1 Message Date
chrislu 80b463b7e4 test: add GetPosBufferTest to reproduce Parquet issue - ALL TESTS PASS! 3 months ago
chrislu 8f33f5240d debug: confirmed root cause - Parquet tries to read 78 bytes past EOF 3 months ago
chrislu aed16ca9d7 fix: enable DEBUG logging for seaweed.hdfs package 3 months ago
chrislu 221252d34e fmt 3 months ago
chrislu a3cf4eb843 debug: track stream lifecycle and total bytes written 3 months ago
chrislu 65d9aacceb debug: enable detailed logging for SeaweedFS client file operations 3 months ago
chrislu 94615996ed workaround: increase Spark task retries for eventual consistency 3 months ago
chrislu 780a1fd059 fix: add file sync and cache settings to prevent EOF on read 3 months ago
chrislu 342705c99e fmt 3 months ago
chrislu 150deefdc0 fix: aggressively suppress Parquet DEBUG logging 3 months ago
chrislu 707e7732a7 fix: suppress verbose Parquet DEBUG logging 3 months ago
chrislu 786f5de7bb ci: refactor Spark workflow for DRY and robustness 3 months ago
chrislu b35463c8b4 spark: fix flaky test by sorting DataFrame before first() 3 months ago
chrislu 89a6d42cee Complete Spark integration test suite 3 months ago