Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable bootstrap precombine #1

Open
wants to merge 2,284 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
2284 commits
Select commit Hold shift + click to select a range
6d37d06
[HUDI-4628] Hudi-flink support GLOBAL_BLOOM,GLOBAL_SIMPLE,BUCKET inde…
TJX2014 Sep 14, 2022
f13ec25
[HUDI-4814] Schedules new clustering plan based on latest clustering …
eric9204 Sep 14, 2022
16328a8
[HUDI-4817] Delete markers after full-record bootstrap operation (#6667)
yihua Sep 14, 2022
e16d63b
[HUDI-4691] Cleaning up duplicated classes in Spark 3.3 module (#6550)
Sep 14, 2022
95ac58e
[HUDI-4752] Add dedup support for MOR table in cli (#6608)
xicm Sep 15, 2022
0afc581
[HUDI-4837] Stop sleeping where it is not necessary after the success…
vburenin Sep 15, 2022
d614242
[HUDI-4843] Delete the useless timer in BaseRollbackActionExecutor (#…
wuwenchi Sep 15, 2022
34c6ce9
[HUDI-4780] hoodie.logfile.max.size It does not take effect, causing …
loukey-lj Sep 15, 2022
394ebb3
[HUDI-4844] Skip partition value resolving when the field does not ex…
danny0405 Sep 15, 2022
59c85a4
[MINOR] Fix the Spark job status description for metadata-only bootst…
yihua Sep 15, 2022
17a23da
[HUDI-3403] Ensure keygen props are set for bootstrap (#6645)
codope Sep 15, 2022
73d8758
[HUDI-4193] Upgrade Protobuf to 3.21.5 (#5784)
slachiewicz Sep 15, 2022
ebdb9f4
[HUDI-4785] Fix partition discovery in bootstrap operation (#6673)
codope Sep 16, 2022
ec62884
[HUDI-4706] Fix InternalSchemaChangeApplier#applyAddChange error to a…
wzx140 Sep 16, 2022
b7687d9
[HUDI-4851] Fixing CSI not handling `InSet` operator properly (#6685)
Sep 16, 2022
c758a96
[HUDI-4796] MetricsReporter stop bug (#6619)
the-other-tim-brown Sep 16, 2022
8e3c365
[HUDI-3861] update tblp 'path' when rename table (#5320)
KnightChess Sep 16, 2022
be67657
[HUDI-4853] Get field by name for OverwriteNonDefaultsWithLatestAvroP…
danny0405 Sep 16, 2022
ba77748
[HUDI-4813] Fix infer keygen not work in sparksql side issue (#6634)
TJX2014 Sep 16, 2022
f7a6637
[HUDI-4856] Missing option for HoodieCatalogFactory (#6693)
danny0405 Sep 17, 2022
702e40c
[HUDI-4864] Fix AWSDmsAvroPayload#combineAndGetUpdateValue when using…
rahil-c Sep 17, 2022
9b886d1
[HUDI-4841] Fix sort idempotency issue (#6669)
voonhous Sep 17, 2022
6572693
[HUDI-4865] Optimize HoodieAvroUtils#isMetadataField to use O(1) comp…
danny0405 Sep 17, 2022
7db20e9
[HUDI-4736] Fix inflight clean action preventing clean service to con…
yihua Sep 17, 2022
8b21bb7
[HUDI-4842] Support compaction strategy based on delta log file num (…
scxwhite Sep 17, 2022
a084dd6
[HUDI-4282] Repair IOException in CHDFS when check block corrupted in…
5herhom Sep 17, 2022
c11537b
[HUDI-4757] Create pyspark examples (#6672)
jonvex Sep 17, 2022
fd0a953
[HUDI-3959] Rename class name for spark rdd reader (#5409)
simonsssu Sep 17, 2022
3db232e
[HUDI-4828] Fix the extraction of record keys which may be cut out (#…
y0908105023 Sep 17, 2022
094ed24
[HUDI-4873] Report number of messages to be processed via metrics (#6…
vburenin Sep 17, 2022
93e94eb
[HUDI-4424] Add new compactoin trigger stratgy: NUM_COMMITS_AFTER_REQ…
fengjian428 Aug 8, 2022
fe9cbbc
Fix errors from cherry-picking
yuzhaojing Sep 23, 2022
53a3dae
Create release branch for version 0.12.1.
yuzhaojing Sep 23, 2022
e524a62
[HUDI-4870] Improve compaction config description (#6706)
yihua Sep 18, 2022
0bd1c53
[HUDI-4808] Fix HoodieSimpleBucketIndex not consider bucket num in lo…
TJX2014 Sep 19, 2022
62e1192
[HUDI-4485] Bump spring shell to 2.1.1 in CLI (#6489)
paul8263 Sep 19, 2022
5a8fab6
[minor] following 3304, some code refactoring (#6713)
danny0405 Sep 19, 2022
34ff529
[HUDI-4832] Fix drop partition meta sync (#6662)
codope Sep 19, 2022
e67b6f5
[HUDI-4810] Fix log4j imports to use bridge API (#6710)
eric9204 Sep 19, 2022
19b63fb
[HUDI-4877] Fix org.apache.hudi.index.bucket.TestHoodieSimpleBucketIn…
TJX2014 Sep 20, 2022
ffeb7eb
[HUDI-4326] add updateTableSerDeInfo for HiveSyncTool (#5920)
kk17 Sep 20, 2022
4e92eb8
[MINOR] fix indent to make build pass (#6721)
YannByron Sep 20, 2022
464d992
[HUDI-4326] Fix hive sync serde properties (#6722)
xushiyan Sep 20, 2022
4f616f7
[HUDI-4875] Fix NoSuchTableException when dropping temporary view aft…
dohongdayi Sep 20, 2022
8c7dca2
[DOCS] Improve the quick start guide for Kafka Connect Sink (#6708)
yihua Sep 20, 2022
218240e
[HUDI-4729] Fix file group pending compaction cannot be queried when …
microbearz Sep 21, 2022
4cc2692
[HUDI-3983] Fix ClassNotFoundException when using hudi-spark-bundle t…
xicm Sep 21, 2022
274325a
[HUDI-4758] Add validations to java spark examples (#6615)
jonvex Sep 21, 2022
5271f0f
[HUDI-4792] Batch clean files to delete (#6580)
parisni Sep 21, 2022
da71afd
[HUDI-4363] Support Clustering row writer to improve performance (#6046)
boneanxs Sep 22, 2022
7facad1
[HUDI-4851] Fixing handling of `UTF8String` w/in `InSet` operator (#6…
Sep 22, 2022
fad706c
[HUDI-3901] Correct the description of hoodie.index.type (#6749)
yihua Sep 23, 2022
bdaef29
[MINOR] Add .mvn directory to gitignore (#6746)
rahil-c Sep 23, 2022
e87a367
[HUDI-4901] Add avro.version to Flink profiles (#6757)
CTTY Sep 23, 2022
21a3bff
[HUDI-4559] Support hiveSync command based on Call Produce Command (#…
XuQianJin-Stars Sep 23, 2022
a91b03f
[HUDI-4883] Supporting delete savepoint for MOR (#6744)
nsivabalan Sep 23, 2022
a25c26d
[HUDI-3523] Introduce AddColumnSchemaPostProcessor to support add col…
wangxianghu Sep 23, 2022
9d31073
Revert "[HUDI-3523] Introduce AddColumnSchemaPostProcessor to support…
wangxianghu Sep 23, 2022
f144874
[HUDI-3523] Introduce AddPrimitiveColumnSchemaPostProcessor to suppor…
wangxianghu Sep 23, 2022
fcdd849
[HUDI-4903] Fix TestHoodieLogFormat`s minor typo (#6762)
wulei0302 Sep 23, 2022
57fa1e7
[MINOR] Drastically reducing concurrency level (to avoid CI flakiness…
Sep 23, 2022
e370c55
Update HoodieIndex.java
wongxingjun Sep 23, 2022
870ef0c
[HUDI-4906] Fix the local tests for hudi-flink (#6763)
danny0405 Sep 23, 2022
bead716
[HUDI-4899] Fixing compatibility w/ Spark 3.2.2 (#6755)
Sep 23, 2022
23d7058
[HUDI-4892] Fix hudi-spark3-bundle (#6735)
yihua Sep 23, 2022
7283523
[MINOR] Fix a few typos in HoodieIndex (#6784)
wongxingjun Sep 24, 2022
bd93c84
[HUDI-4412] Fix multi writer INSERT_OVERWRITE NPE bug (#6130)
liujinhui1994 Sep 24, 2022
6a647c5
[MINOR] retain avro's namespace (#6783)
YannByron Sep 24, 2022
c122ff6
[MINOR] Simple logging fix in LockManager (#6765)
scxwhite Sep 24, 2022
1eb3010
[HUDI-4433] hudi-cli repair deduplicate not working with non-partitio…
brightwon Sep 25, 2022
1a4356f
[RFC-51][HUDI-3478] Update RFC: CDC support (#6256)
xushiyan Sep 25, 2022
830e35c
[HUDI-4915] improve avro serializer/deserializer (#6788)
YannByron Sep 25, 2022
19b74ea
[HUDI-4830] Fix testNoGlobalConfFileConfigured when add hudi-defaults…
Zouxxyy Sep 26, 2022
d75f5ae
[HUDI-4760] Fixing repeated trigger of data file creations w/ cluster…
nsivabalan Sep 26, 2022
88c1ca2
[HUDI-4584] Cleaning up Spark utilities (#6351)
Aug 23, 2022
ebf4120
fix test
yuzhaojing Sep 27, 2022
77fa389
[HUDI-4914] Managed memory weight should be set when sort clustering …
SteNicholas Sep 26, 2022
472c782
[HUDI-4910] Fix unknown variable or type "Cast" (#6778)
KnightChess Sep 26, 2022
903dd3b
[HUDI-4918] Fix bugs about when trying to show the non -existing key …
Forus0322 Sep 26, 2022
f6c9b51
[HUDI-4718] Add Kerberos kinit command support. (#6719)
paul8263 Sep 26, 2022
0b819e6
[HUDI-4902] Set default partitioner for SIMPLE BUCKET index (#6759)
xicm Sep 26, 2022
69b90f6
[MINOR] Update PR template with documentation update (#6748)
yihua Sep 26, 2022
ed7e6c9
[HUDI-4907] Prevent single commit multi instant issue (#6766)
voonhous Sep 27, 2022
33666c8
[HUDI-4923] Fix flaky TestHoodieReadClient.testReadFilterExistAfterBu…
nsivabalan Sep 27, 2022
1c12e84
[HUDI-4848] Fixing repair deprecated partition tool (#6731)
nsivabalan Sep 27, 2022
31ba568
[HUDI-4913] Fix HoodieSnapshotExporter for writing to a different S3 …
yihua Sep 27, 2022
0239652
[HUDI-4453] Fix schema to include partition columns in bootstrap oper…
yihua Sep 28, 2022
5ed94cd
[HUDI-2780] Fix the issue of Mor log skipping complete blocks when re…
hj2016 Sep 28, 2022
c439855
[HUDI-4924] Auto-tune dedup parallelism (#6802)
yihua Sep 28, 2022
8295329
[HUDI-4687] Avoid setAccessible which breaks strong encapsulation (#6…
codope Sep 28, 2022
856c1f5
[MINOR] fixing validate async operations to poll completed clean inst…
nsivabalan Sep 28, 2022
ab30021
[HUDI-4734] Deltastreamer table config change validation (#6753)
jonvex Sep 28, 2022
b36c1db
[HUDI-4934] Revert batch clean files (#6813)
nsivabalan Sep 28, 2022
0ae83cb
Revert "[HUDI-4915] improve avro serializer/deserializer (#6788)"
yuzhaojing Sep 30, 2022
2e0e8ab
[HUDI-4722] Added locking metrics for Hudi (#6502)
Sep 29, 2022
0f1130d
[HUDI-4936] Fix `as.of.instant` not recognized as hoodie config (#5616)
gnailJC Sep 29, 2022
790c4a2
[HUDI-4861] Relaxing `MERGE INTO` constraints to permit limited casti…
Sep 29, 2022
26bbcec
[HUDI-4885] Adding org.apache.avro to hudi-hive-sync bundle (#6729)
nsivabalan Sep 29, 2022
c8a5283
[HUDI-4951] Fix incorrect use of Long.getLong() (#6828)
shenshengli Sep 29, 2022
eff2056
[MINOR] Use base path URI in ITTestDataStreamWrite (#6826)
codope Sep 29, 2022
d83df6f
[HUDI-4308] READ_OPTIMIZED read mode will temporary loss of data when…
Aiden-Dong Sep 29, 2022
6fee77b
[HUDI-4237] Fixing empty partition-values being sync'd to HMS (#6821)
Sep 29, 2022
c48534d
[HUDI-4925] Should Force to use ExpressionPayload in MergeIntoTableCo…
fengjian428 Sep 29, 2022
7f73dcd
[HUDI-4957] Shade JOL in bundles to fix NoClassDefFoundError:GraphLay…
codope Sep 30, 2022
290fc65
[HOTFIX] Fix source release validate script
yuzhaojing Oct 4, 2022
ee08024
[MINOR] Fix deploy script for flink 1.15
yuzhaojing Oct 5, 2022
28cb191
[MINOR] Update release version to reflect published version 0.12.1
yuzhaojing Oct 6, 2022
5110562
[HUDI-4992] Fixing invalid min/max record key stats in Parquet metada…
Oct 7, 2022
baeff43
Bumping release candidate number 2
yuzhaojing Oct 9, 2022
a5978cd
[MINOR] Update release version to reflect published version 0.12.1
yuzhaojing Oct 17, 2022
9ca38c4
[HUDI-4966] Add a partition extractor to handle partition values with…
yihua Oct 3, 2022
e7df7ab
[MINOR] Fix testUpdateRejectForClustering (#6852)
Zouxxyy Oct 3, 2022
14d2578
[HOTFIX] Fix source release validate script (#6865)
xushiyan Oct 5, 2022
039c2e4
[HUDI-4980] Calculate avg record size using commit only (#6864)
xushiyan Oct 5, 2022
76ef1da
[MINOR] Fix deploy script for flink 1.15 (#6872)
xushiyan Oct 6, 2022
2e06842
[HUDI-4972] Fixes to make unit tests work on m1 mac (#6751)
jonvex Oct 7, 2022
e63224d
[HUDI-2786] Docker demo on mac aarch64 (#6859)
jonvex Oct 7, 2022
96c8b0b
[HUDI-4971] Fix shading kryo-shaded with reusing configs (#6873)
xushiyan Oct 8, 2022
74b0f48
Relocate apache http package (#6874)
parisni Oct 10, 2022
1202fbb
[HUDI-4975] Fix datahub bundle dependency (#6896)
xushiyan Oct 10, 2022
7dbb171
[HUDI-4993] Make DataPlatform name and Dataset env configurable in Da…
pramodbiligiri Oct 11, 2022
3aec340
[HUDI-4754] Add compliance check in github actions (#6575)
jonvex Oct 11, 2022
4c4a708
[HUDI-4994] Fix bug that prevents re-ingestion of soft-deleted Datahu…
pramodbiligiri Oct 12, 2022
ba6a3ce
[HUDI-4952] Fixing reading from metadata table when there are no infl…
nsivabalan Oct 12, 2022
a2e0914
[HUDI-5006] Use the same wrapper for timestamp type metadata for parq…
danny0405 Oct 13, 2022
098d34d
[HUDI-5016] Flink clustering does not reserve commit metadata (#6929)
danny0405 Oct 13, 2022
97d2114
[HUDI-3900] Fixing hdfs setup and tear down in tests to avoid flakine…
nsivabalan Oct 13, 2022
5f2fef0
[HUDI-5002] Remove deprecated API usage in SparkHoodieHBaseIndex#gene…
slfan1989 Oct 13, 2022
4eb638b
[HUDI-5010] Fix flink hive catalog external config not work (#6923)
wxplovecc Oct 14, 2022
1f3a47e
[HUDI-5033] Fix Broken Link In MultipleSparkJobExecutionStrategy (#6951)
slfan1989 Oct 15, 2022
bd6e96b
[HUDI-5037] Upgrade org.apache.thrift:libthrift to 0.14.0 (#6941)
vvsd Oct 15, 2022
b2d5332
[HUDI-5003] Fix the type of InLineFileSystem`startOffset to long (#6916)
wulei0302 Oct 17, 2022
5e933f0
[HUDI-4855] Add missing table configs for bootstrap in Deltastreamer …
yihua Oct 17, 2022
39d9b44
[MINOR] Increase maxParameters size in scalastyle (#6987)
Zouxxyy Oct 18, 2022
3cb7fe6
[MINOR] Test case for hoodie.merge.allow.duplicate.on.inserts (#6949)
Zouxxyy Oct 19, 2022
5c11e4a
[HUDI-4982] Add validation job for spark bundles in GitHub Actions (#…
xushiyan Oct 19, 2022
c038e76
[HUDI-5041] Fix lock metric register confict error (#6968)
hbgstc123 Oct 19, 2022
bd60454
[HUDI-4998] Infer partition extractor class first from meta sync part…
xicm Oct 20, 2022
4e276ce
[HUDI-4997] Use jackson-v2 import instead of jackson-v1 (#6893)
slfan1989 Oct 20, 2022
643f906
[HUDI-4995] Relocate httpcomponents (#6906)
xushiyan Oct 20, 2022
1aafffb
[HUDI-4960] Upgrade jetty version for timeline server (#6844)
rmahindra123 Oct 21, 2022
c5a973a
[HUDI-5046] Support all the hive sync options for flink sql (#6985)
danny0405 Oct 21, 2022
2c28d80
[HUDI-5063] Enabling run time stats to be serialized with commit meta…
nsivabalan Oct 22, 2022
1cd54dc
[HUDI-5070] Adding lock provider to testCleaner tests since async cle…
nsivabalan Oct 22, 2022
103dd10
[HUDI-5070] Move flaky cleaner tests to separate class (#7034)
xushiyan Oct 22, 2022
9ad09a3
[HUDI-4971] Remove direct use of kryo from `SerDeUtils` (#7014)
xushiyan Oct 23, 2022
fbeb0dd
[HUDI-5081] Tests clean up in hudi-utilities (#7033)
the-other-tim-brown Oct 24, 2022
2b4df8e
[HUDI-5027] Replace hardcoded hbase config keys with constant variabl…
slfan1989 Oct 24, 2022
185f07e
[HUDI-5061] bulk insert operation don't throw other exception except …
liufangqi Oct 24, 2022
0abbc50
[MINOR] Skip loading last completed txn for single writer (#6660)
YuweiXiao Oct 24, 2022
aab5dcd
[HUDI-4281] Using hudi to build a large number of tables in spark on …
zhangrenhua Oct 24, 2022
487b42e
[HUDI-5042] Fix clustering schedule problem in flink when enable sche…
hbgstc123 Oct 25, 2022
b32e813
[HUDI-5038] Increase default num_instants to fetch for incremental so…
yihua Oct 25, 2022
c4d98ad
[HUDI-4959] Fixing Avro's `Utf8` serialization in Kryo (#7024)
alexeykudinkin Oct 26, 2022
debc92e
[HUDI-4982] Add Utilities and Utilities Slim + Spark Bundle testing t…
jonvex Oct 26, 2022
faf57e1
[HUDI-5085]When a flink job has multiple sink tables, the index loadi…
yangxiao0320 Oct 26, 2022
5f52597
[HUDI-5058] Fix flink catalog read spark table error : primary key co…
waywtdcc Oct 26, 2022
76fe2bb
[HUDI-5087] Fix incorrect merging sequence for Column Stats Record in…
Zhangshunyu Oct 26, 2022
610258f
[HUDI-4946] fix merge into with no preCombineField having dup row by …
KnightChess Oct 27, 2022
213215b
[HUDI-3287] Remove hudi-spark dependencies from hudi-kafka-connect-bu…
codope Oct 28, 2022
b447828
[HUDI-4716] Avoid parquet-hadoop-bundle in hudi-hadoop-mr (#6930)
xushiyan Oct 28, 2022
e3e95dc
[HUDI-5083]Fixed a bug when schema evolution (#7045)
shenshengli Oct 29, 2022
a6c153a
[HUDI-5102] source operator(monitor and reader) support user uid (#7…
1032851561 Oct 29, 2022
390f25d
[MINOR] Fix typos in Spark client related classes (#7083)
lvhu-goodluck Oct 29, 2022
d03efef
[HUDI-4741] hotfix to avoid partial failover cause restored subtask t…
fengjian428 Oct 30, 2022
d411a51
[MINOR] use default maven version since it already fix the warnings r…
fengjian428 Oct 31, 2022
da337b4
Revert "[HUDI-4741] hotfix to avoid partial failover cause restored s…
danny0405 Oct 31, 2022
fab8a4f
[MINOR] Fix doc of org.apache.hudi.sink.meta.CkpMetadata#bootstrap (#…
TJX2014 Oct 31, 2022
6bde533
[HUDI-4799] improve analyzer exception tip when cannot resolve expres…
KnightChess Oct 31, 2022
11baea6
[HUDI-5096] Upgrade jcommander to 1.78 (#7068)
xicm Oct 31, 2022
d36808c
[MINOR] Fix OverwriteWithLatestAvroPayload full class name (#7096)
albericgenius Nov 1, 2022
867043e
[HUDI-5074] Warn if table for metastore sync has capitals in it (#7077)
jonvex Nov 2, 2022
7f53b7a
[HUDI-5124] Fix HoodieInternalRowFileWriter#canWrite error return tag…
slfan1989 Nov 2, 2022
a42888d
[MINOR] update commons-codec:commons-codec 1.4 to 1.13 (#6959)
lxxawfl Nov 2, 2022
a5e67b2
[HUDI-5065] Call close on SparkRDDWriteClient in HoodieCleaner (#7101)
jonvex Nov 2, 2022
0a693b3
[HUDI-4624] Implement Closable for S3EventsSource (#7086)
jonvex Nov 2, 2022
5f41a97
[HUDI-5045] Adding support to configure index type with integ tests (…
nsivabalan Nov 2, 2022
f8bad72
[HUDI-5076] Fixing non serializable path used in engineContext with m…
nsivabalan Nov 3, 2022
9524fa8
[HUDI-4880] Fix corrupted parquet file issue left over by cancelled c…
TengHuo Nov 3, 2022
3d22a19
[HUDI-5147] Flink data skipping doesn't work when HepPlanner calls co…
trushev Nov 3, 2022
a51d52c
[MINOR] Fixing broken test (#7123)
nsivabalan Nov 3, 2022
6ea28ee
[HUDI-5126] Delete duplicate configuration items PAYLOAD_CLASS_NAME (…
Nov 3, 2022
924e150
[HUDI-4989] Fixing deltastreamer init failures (#6862)
nsivabalan Nov 4, 2022
ac2d7d3
[MINOR] Fix flaky test in ITTestHoodieDataSource (#7134)
trushev Nov 4, 2022
06ac819
[HUDI-5088]Fix bug:Failed to synchronize the hive metadata of the Fli…
waywtdcc Nov 7, 2022
80d67f7
[HUDI-5066] Support flink hoodie source metaclient cache (#7017)
chenshzh Nov 7, 2022
15b37e5
[HUDI-5132] Add hadoop-mr bundle validation (#7157)
xushiyan Nov 7, 2022
c40fc8d
[HUDI-2673] Add kafka connect bundle to validation test (#7131)
xushiyan Nov 7, 2022
6e398ae
[HUDI-5025] Rollback failed with log file not found when rollOver in …
wxplovecc Nov 9, 2022
4cd1722
[HUDI-4526] Improve spillableMapBasePath when disk directory is full …
XuQianJin-Stars Nov 9, 2022
09119fb
[HUDI-5111] Improve integration test coverage (#7092)
nsivabalan Nov 9, 2022
99c5a27
[MINOR] Performance improvement of flink ITs with reused miniCluster …
trushev Nov 10, 2022
4d9f20e
[HUDI-5176] Fix incremental source to consider inflight commits befor…
yihua Nov 10, 2022
aa9265c
[MINOR] Balance CI jobs (#6838)
xushiyan Nov 11, 2022
411a768
[HUDI-5185] Fix CLI run compaction failing with --hoodieConfigs (#7168)
xicm Nov 11, 2022
bbdb77e
[HUDI-5191] Fix compatibility with avro 1.10 (#7175)
Zouxxyy Nov 12, 2022
bb73020
[HUDI-5206] RowColumnReader should not return null value for certain …
SteNicholas Nov 15, 2022
a55f70f
[HUDI-5227] Bump Javalin to 4.6.7 and Jetty to 9.4.48 (#7211)
rahil-c Nov 16, 2022
edb30dd
Use as.of.instant for IncrementalRelation (#6921)
boneanxs Nov 16, 2022
a8ed77a
[HUDI-5203] Handle null fields in debezium avro payloads (#7193)
xushiyan Nov 16, 2022
735e061
[HUDI-5233] Fix bug when InternalSchemaUtils.collectTypeChangedCols r…
trushev Nov 18, 2022
330a027
[HUDI-5070] Move flaky cleaner tests to separate class (#7251)
xushiyan Nov 21, 2022
01f043a
[HUDI-5247] Clean up java client tests (#7250)
xushiyan Nov 21, 2022
b000072
[HUDI-5244] Fix bugs in schema evolution client with lost operation f…
trushev Nov 21, 2022
8eebd66
[MINOR] Fix `TestSchemaEvolutionClient` compilation (#7256)
xushiyan Nov 21, 2022
f7927ed
[HUDI-5157] Support dropping all meta fields from source hudi table w…
nsivabalan Nov 22, 2022
d4ec501
[HUDI-5260] Fix insert into sql command with strict sql insert mode (…
kazdy Nov 24, 2022
5230a11
[HUDI-5277] Close HoodieWriteClient before exiting RunClusteringProce…
stream2000 Nov 25, 2022
a78cb09
[HUDI-5242] Do not fail Meta sync in Deltastreamer when inline table …
jonvex Nov 29, 2022
4ccee72
[HUDI-5253] HoodieMergeOnReadTableInputFormat could have duplicate re…
boneanxs Nov 29, 2022
64a359b
[HUDI-5151] Fix bug with broken flink data skipping caused by ClassNo…
trushev Nov 29, 2022
ab80838
[HUDI-5344] Fix CVE - upgrade protobuf-java to 3.18.2 (#6957)
bitmasks Dec 7, 2022
e3c9562
[HUDI-5163] Fix failure handling with spark datasource write (#7140)
nsivabalan Dec 7, 2022
8b294b0
[HUDI-5344] Fix CVE - upgrade protobuf-java (#6960)
lxxawfl Dec 8, 2022
ee60a6a
[HUDI-5345] Avoid fs.exists calls for metadata table in HFileBootstra…
yihua Dec 8, 2022
6fa7ff9
[HUDI-5291] Fixing NPE in MOR column stats accounting (#7349)
alexeykudinkin Dec 8, 2022
0e17e18
[HUDI-5346][HUDI-5320] Fixing Create Table as Select (CTAS) performan…
alexeykudinkin Dec 9, 2022
1a2ffdd
[HUDI-5350] Fix oom cause compaction event lost problem (#7408)
hbgstc123 Dec 9, 2022
5d4f59c
[HUDI-5358] Fix flaky tests in TestCleanerInsertAndCleanByCommits (#7…
yihua Dec 9, 2022
fb552ae
[HUDI-5342] Add new bulk insert sort modes repartitioning data by par…
yihua Dec 10, 2022
bbde3a9
[HUDI-5338] Adjust coalesce behavior within NONE sort mode for bulk i…
yihua Dec 10, 2022
01ba650
[HUDI-5336] Fixing parsing of log files while building file groups (#…
nsivabalan Dec 12, 2022
5e31223
[HUDI-5372] Fix NPE caused by alter table add column. (#7236)
scxwhite Dec 12, 2022
8f5723b
[MINOR] Fix Out of Bounds Exception for DayBasedCompactionStrategy (#…
vamshigv Dec 12, 2022
106d216
[HUDI-5353] Close file readers (#7412)
codope Dec 12, 2022
1743d05
[HUDI-5078] Fixing isTableService for replace commits (#7037)
nsivabalan Dec 12, 2022
8bc9fa5
[HUDI-5296] Allow disable schema on read after enabling (#7421)
nsivabalan Dec 12, 2022
4fd25ca
[HUDI-5348] Cache file slices in HoodieBackedTableMetadata (#7436)
yihua Dec 13, 2022
292630b
Avoiding costly lookups into the schema cache in `SqlTypedRecord`
Dec 14, 2022
ee8c9df
Fixing schemas used for bootstrap reader
Dec 14, 2022
738f673
[HUDI-5375] Fixing reusing file readers with Metadata reader within F…
nsivabalan Dec 15, 2022
d9a4d21
[HUDI-5187] Remove the preCondition check of BucketAssigner assign st…
danny0405 Nov 10, 2022
6d5ea00
[HUDI-5221] Make the decision for flink sql bucket index case-insensi…
danny0405 Nov 16, 2022
0542f56
[HUDI-5223] Partial failover for flink (#7208)
danny0405 Nov 16, 2022
6fa192a
[HUDI-4871][HUDI-4411] Upgrade to spark 3.3.1 & 3.2.2 (#7155)
xushiyan Dec 15, 2022
492f7d7
[HUDI-5097] Fix partition reading without partition fields table conf…
xushiyan Dec 15, 2022
fe86a83
Cherry pick for HUDI-5007 and HUDI-5228 (#7471)
danny0405 Dec 16, 2022
1f0b2dd
[HUDI-5290] Remove the lock in HoodieFlinkWriteClient#writeTableMetad…
nsivabalan Dec 16, 2022
b50dc32
[HUDI-5366] Closing metadata writer from within writeClient (0.12.2) …
nsivabalan Dec 16, 2022
1061de2
[HUDI-3661] Flink async compaction is not thread safe when use waterm…
danny0405 Dec 16, 2022
d971f3e
[HUDI-5104] Add feature flag to disable HoodieFileIndex and fall back…
yihua Dec 16, 2022
3fb6d33
[HUDI-5251] Split GitHub actions CI by spark and flink (#7265)
xushiyan Dec 15, 2022
0a53080
[HUDI-5357] Optimize deployment of release artifacts (#7419)
xushiyan Dec 16, 2022
31959f9
[HUDI-5403] Turn off metadata-table-based file listing in BaseHoodieT…
yihua Dec 17, 2022
3a4ffaf
[HUDI-5409] Avoid file index and use fs view cache in COW input forma…
codope Dec 17, 2022
de51917
Fixing build failures
nsivabalan Dec 17, 2022
94db72e
Bumping mvn version to 0.12.2-1
satishkotha Dec 17, 2022
975eb91
[HUDI-5357] Fix release build commands (#7501)
xushiyan Dec 18, 2022
aea5bb6
[MINOR] Update release version to reflect published version 0.12.2
satishkotha Dec 24, 2022
205996f
Disable precombine field in bootstrap
a49a Feb 16, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
10 changes: 10 additions & 0 deletions .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -29,3 +29,13 @@ github:
- data-integration
- apachespark
- apacheflink
features:
wiki: true
issues: true
projects: true
discussions: true
notifications:
commits: [email protected]
issues: [email protected]
pullrequests: [email protected]
jira_options: link label
16 changes: 8 additions & 8 deletions .codecov.yml
Original file line number Diff line number Diff line change
Expand Up @@ -23,6 +23,13 @@ coverage:
precision: 2
round: down
range: "50...100"
status:
project: # settings affecting project coverage
enabled: no

# do not run coverage on patch nor changes
patch: no
changes: no

# Ignoring Paths
# --------------
Expand All @@ -47,14 +54,7 @@ ignore:
- "hudi-hadoop-mr/src/main/java/com/uber/hoodie/hadoop/HoodieInputFormat.java"
- "hudi-hadoop-mr/src/main/java/com/uber/hoodie/hadoop/realtime/HoodieRealtimeInputFormat.java"

comment:
layout: "reach, diff, flags, files"
behavior: default
require_changes: false # if true: only post the comment if coverage changes
require_base: no # [yes :: must have a base report to post]
require_head: no # [yes :: must have a head report to post]
branches: # https://docs.codecov.io/docs/pull-request-comments#branches
- "master"
comment: false

flags:
hudicli:
Expand Down
2 changes: 1 addition & 1 deletion .github/ISSUE_TEMPLATE/SUPPORT_REQUEST.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ labels: question

**_Tips before filing an issue_**

- Have you gone through our [FAQs](https://cwiki.apache.org/confluence/display/HUDI/FAQ)?
- Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?

- Join the mailing list to engage in conversations and get faster support at [email protected].

Expand Down
51 changes: 17 additions & 34 deletions .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
@@ -1,44 +1,27 @@
## *Tips*
- *Thank you very much for contributing to Apache Hudi.*
- *Please review https://hudi.apache.org/contributing.html before opening a pull request.*
### Change Logs

## What is the purpose of the pull request
_Describe context and summary for this change. Highlight if any code was copied._

*(For example: This pull request adds quick-start document.)*
### Impact

## Brief change log
_Describe any public API or user-facing feature change or any performance impact._

*(for example:)*
- *Modify AnnotationLocation checkstyle rule in checkstyle.xml*
**Risk level: none | low | medium | high**

## Verify this pull request
_Choose one. If medium or high, explain what verification was done to mitigate the risks._

*(Please pick either of the following options)*
### Documentation Update

This pull request is a trivial rework / code cleanup without any test coverage.
_Describe any necessary documentation update if there is any new feature, config, or user-facing change_

*(or)*
- _The config description must be updated if new configs are added or the default value of the configs are changed_
- _Any new feature or user-facing change requires updating the Hudi website. Please create a Jira ticket, attach the
ticket number here and follow the [instruction](https://hudi.apache.org/contribute/developer-setup#website) to make
changes to the website._

This pull request is already covered by existing tests, such as *(please describe tests)*.
### Contributor's checklist

(or)

This change added tests and can be verified as follows:

*(example:)*

- *Added integration tests for end-to-end.*
- *Added HoodieClientWriteTest to verify the change.*
- *Manually verified the change by running a job locally.*

## Committer checklist

- [ ] Has a corresponding JIRA in PR title & commit

- [ ] Commit message is descriptive of the change

- [ ] CI is green

- [ ] Necessary doc changes done or have another open PR

- [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.
- [ ] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute)
- [ ] Change Logs and Impact were stated clearly
- [ ] Adequate tests were added if applicable
- [ ] CI passed
10 changes: 10 additions & 0 deletions .github/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
## How to update the Pull Request Template

When updating the pr template, you must consider if updates need to be made to scripts/pr_compliance.py

## What are the files in workflows?
- bot.yml: runs the hudi unit tests with various versions of scala, spark, and flink
- pr_compliance.yml: checks pr titles and main comment to make sure that everything is filled out and formatted properly
- update_pr_compliance: runs the pr_compliance tests when scripts/pr_compliance.py is updated


36 changes: 0 additions & 36 deletions .github/actions/bot/package.json

This file was deleted.

100 changes: 0 additions & 100 deletions .github/actions/bot/src/action.js

This file was deleted.

124 changes: 89 additions & 35 deletions .github/workflows/bot.yml
Original file line number Diff line number Diff line change
@@ -1,43 +1,97 @@
#
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.
#

name: CI BOT
name: Java CI

on:
issue_comment:
types: [created]
push:
branches:
- master
- 'release-*'
pull_request:
branches:
- master
- 'release-*'
env:
MVN_ARGS: -ntp -B -V -Pwarn-log -Dorg.slf4j.simpleLogger.log.org.apache.maven.plugins.shade=warn -Dorg.slf4j.simpleLogger.log.org.apache.maven.plugins.dependency=warn

jobs:
bot:
test-spark:
runs-on: ubuntu-latest
steps:
- name: clone repository
uses: actions/checkout@v2
strategy:
matrix:
include:
- scalaProfile: "scala-2.11"
sparkProfile: "spark2.4"

- scalaProfile: "scala-2.12"
sparkProfile: "spark2.4"

- scalaProfile: "scala-2.12"
sparkProfile: "spark3.1"

- scalaProfile: "scala-2.12"
sparkProfile: "spark3.2"

- name: bot actions
uses: actions/github-script@v1
- scalaProfile: "scala-2.12"
sparkProfile: "spark3.3"

steps:
- uses: actions/checkout@v2
- name: Set up JDK 8
uses: actions/setup-java@v2
with:
java-version: '8'
distribution: 'adopt'
architecture: x64
- name: Build Project
env:
SCALA_PROFILE: ${{ matrix.scalaProfile }}
SPARK_PROFILE: ${{ matrix.sparkProfile }}
run:
mvn clean install -Pintegration-tests -D"$SCALA_PROFILE" -D"$SPARK_PROFILE" -DskipTests=true $MVN_ARGS
- name: Quickstart Test
env:
PROVIDER : 'apache'
REPOSITORY: 'hudi'
HUDI_TRAVIS_ORG_TOKEN: ${{ secrets.HUDI_TRAVIS_ORG_TOKEN }}
SCALA_PROFILE: ${{ matrix.scalaProfile }}
SPARK_PROFILE: ${{ matrix.sparkProfile }}
run:
mvn test -Punit-tests -D"$SCALA_PROFILE" -D"$SPARK_PROFILE" -DfailIfNoTests=false -pl hudi-examples/hudi-examples-java,hudi-examples/hudi-examples-spark $MVN_ARGS
- name: IT - Bundle Validation
env:
SCALA_PROFILE: ${{ matrix.scalaProfile }}
SPARK_PROFILE: ${{ matrix.sparkProfile }}
if: ${{ !endsWith(env.SPARK_PROFILE, '2.4') }} # skip test spark 2.4 as it's covered by Azure CI
run: |
HUDI_VERSION=$(mvn help:evaluate -Dexpression=project.version -q -DforceStdout)
./packaging/bundle-validation/ci_run.sh $HUDI_VERSION
- name: UT - Common & Spark
env:
SCALA_PROFILE: ${{ matrix.scalaProfile }}
SPARK_PROFILE: ${{ matrix.sparkProfile }}
if: ${{ !endsWith(env.SPARK_PROFILE, '2.4') }} # skip test spark 2.4 as it's covered by Azure CI
run:
mvn test -Punit-tests -D"$SCALA_PROFILE" -D"$SPARK_PROFILE" -pl hudi-common,hudi-spark-datasource/hudi-spark $MVN_ARGS

test-flink:
runs-on: ubuntu-latest
strategy:
matrix:
include:
- flinkProfile: "flink1.13"
- flinkProfile: "flink1.14"
- flinkProfile: "flink1.15"
steps:
- uses: actions/checkout@v2
- name: Set up JDK 8
uses: actions/setup-java@v2
with:
script: |
const path = require('path')
const scriptPath = path.resolve('.github/actions/bot/src/action.js')
require(scriptPath)({core}, {context}, {github})
java-version: '8'
distribution: 'adopt'
architecture: x64
- name: Build Project
env:
FLINK_PROFILE: ${{ matrix.flinkProfile }}
run:
mvn clean install -Pintegration-tests -Dscala-2.12 -D"$FLINK_PROFILE" -Davro.version=1.10.0 -DskipTests=true $MVN_ARGS
- name: Quickstart Test
env:
FLINK_PROFILE: ${{ matrix.flinkProfile }}
run:
mvn test -Punit-tests -Dscala-2.12 -D"$FLINK_PROFILE" -DfailIfNoTests=false -pl hudi-examples/hudi-examples-flink $MVN_ARGS
Loading