forked from apache/giraph
-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathCHANGELOG
1225 lines (732 loc) · 40.3 KB
/
CHANGELOG
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
Giraph Change Log
Release 1.1.0 - unreleased
GIRAPH-727: Support for Watts Strogatz VertexInputFormat (claudio)
GIRAPH-746: Track and log versions of dependencies (nitay)
GIRAPH-751: Build error: convertEdgeToLine in two different classes have the
same erasure, yet neither overrides the other (aarmax0 via claudio)
GIRAPH-732: EdgeOutputFormat (aarmax00 via claudio)
GIRAPH-512: JavaDoc warnings (tdn120 via nitay)
GIRAPH-736: Bring back FindBugs (nitay)
GIRAPH-735: DiskBackedPartitionStore throws NPE due to uninitialized OutEdges
(claudio)
GIRAPH-734: DiskBackedPartitionStore attempting to release a lock it doesn't own
(cmuchinsky via claudio)
GIRAPH-739: Discrepancy among numeric constants corresponding to frequency of
writing in TextAggregatorWriter (korlando via claudio)
GIRAPH-740: ArrayListWritable object is not cleared in readFields()
(korlando via claudio)
GIRAPH-692: Add Bracha Toueg Deadlock Detection in the examples
(aarmax00 via claudio)
GIRAPH-717: HiveJythonRunner with support for pure Jython value types (nitay)
GIRAPH-722: ProgressableUtils.waitForever is not calling progress (majakabiljo)
GIRAPH-549: Tinkerpop/Blueprints/Rexter InputFormat (armax00 via claudio)
GIRAPH-701: Communication improvement using one-to-all message
sending (Bingjing via aching)
GIRAPH-721: Don't call progress on each edge/vertex loaded (majakabiljo)
GIRAPH-720: Provide a way to change job name (majakabiljo)
GIRAPH-718: Output all per-superstep metrics per superstep (majakabiljo)
GIRAPH-716: Stop modifying Configuration since it's not thread-safe (majakabiljo)
GIRAPH-715: Fix MessageValueFactory performance regression (nitay)
GIRAPH-709: More flexible Jython script loading (nitay)
GIRAPH-708: Factories for creation of all IVEM types (nitay)
GIRAPH-710: Define zookeeper version in a property to allow build time
override (apurtell via nitay)
GIRAPH-705: Type converters and giraph-hive cleanup (nitay)
GIRAPH-704: Specialized message stores (majakabiljo)
GIRAPH-703: create an appropriate way to generate the options.html page using maven.
(armax00 via claudio)
GIRAPH-702: Fix multithreaded output (majakabiljo)
GIRAPH-676: A short tutorial on getting started with Giraph (boshmaf via claudio)
GIRAPH-698: Expose Computation to a user (aching)
GIRAPH-311: Master halting in superstep 0 is ignored by workers (majakabiljo)
GIRAPH-688: Make sure Giraph builds against all compatible YARN-enabled Hadoop versions,
warns if none set, works w/new 1.1.0 line (ereisman)
GIRAPH-693: Giraph-Hive check user code as soon as possible (nitay)
GIRAPH-697: Clean up message stores (majakabiljo)
GIRAPH-696: Should be able to spill giraph metrics to a specified
directory on HDFS (claudio)
GIRAPH-695: Add getCombiner and getComputation methods for
MasterCompute (aching)
GIRAPH-623: Website Documentation: table of parameters (armax00 via nitay)
GIRAPH-694: Setting configuration in GiraphConfiguration causes
non thread safe copies (aching)
GIRAPH-683: Jython for Computation (nitay)
GIRAPH-673: Input superstep should support aggregators like any
other superstep (Bingjing via aching)
GIRAPH-686: DiskBackedPartitionStore does not saveVertex after edges
are loaded (claudio)
GIRAPH-681: Graphviz Output Format (nitay)
GIRAPH-468: Make Vertex an Interface (nitay)
GIRAPH-667: all workers suspended at 'saveVertices' when use
out of core (claudio)
GIRAPH-670: [easy] Example Max Computation (nitay)
GIRAPH-675: Mutable edge iterator gets corrupted by calling
vertex.getNumEdges() during iteration (apresta)
GIRAPH-512: JavaDoc warnings (emre.aladag via nitay)
GIRAPH-620: Website Documentation: How to use Hive I/O with Giraph (nitay)
GIRAPH-667: Decouple Vertex data and Computation, make Computation
and Combiner classes switchable (majakabiljo)
GIRAPH-608: Spelling error in Combiner.java (Michael Aro via aching)
GIRAPH-666: Netty execs threads and metrics threads don't get
cleaned up properly (aching)
GIRAPH-665: Reduce ZooKeeper output in tests by changing log level from
INFO to ERROR (aching)
GIRAPH-664: Bump HiveIO dep - fixes tests with hadoop facebook
(nitayj via majakabiljo)
GIRAPH-622: Website Documentation: Out-of-core (claudio)
GIRAPH-659: giraph-hive tests all fail (majakabiljo)
GIRAPH-663: Fix HiveIO metastore host setting (nitay)
GIRAPH-661: Munge symbols helper (nitay)
GIRAPH-662: Incubator release should point to archived (aching)
GIRAPH-655: Fix Hive Build for Hadoop 2.0.0 (nitay)
GIRAPH-660: Move the snapshot to 1.1.0 and also add a releases page for
the 1.0 release (aching)
GIRAPH-560: Input filtering (nitay)
GIRAPH-621: Website Documentation: Basic Design Document (aching)
GIRAPH-658: Remove final modifier from SimpleHiveToEdge.initializeRecords
(majakabiljo)
GIRAPH-656: Input from multiple tables doesn't work with multithreading
(majakabiljo)
GIRAPH-657: Remove unused reuseIncomingEdgeObjects option (apresta)
GIRAPH-592: YourKit profiler (nitay)
GIRAPH-618: Website Documentation: Aggregators (and sharded aggregators)
(majakabiljo)
GIRAPH-651: giraph-hive tests (nitay)
GIRAPH-639: Add support for multiple Vertex/Edge inputs (majakabiljo)
GIRAPH-653: Hadoop_non_secure broken (majakabiljo)
GIRAPH-650: Exception in GiraphConfiguration initialization (majakabiljo)
GIRAPH-648: Allow IO formats to add parameters to Configuration (majakabiljo)
GIRAPH-635: Website Documentation: Missing presentations (ssc)
GIRAPH-647: Update HiveIO to 0.8 (nitay)
GIRAPH-624: ByteArrayPartition reports 0 aggregate edges when used with
DiskBackedPartitionStore (claudio)
GIRAPH-636: Initialize compute OutEdges directly from input OutEdges
(majakabiljo)
GIRAPH-645: Specify maven deploy plugin version (nitay)
GIRAPH-643: Maven Release settings more friendly to Git (nitay)
Release 1.0.0 - 2013-04-15
GIRAPH-633: Rename VertexEdges to OutEdges (apresta)
GIRAPH-634: InternalGiraphRunner with TestGraph is broken (majakabiljo)
GIRAPH-617: Website Documentation: Vertex/Edge Input/Output (apresta)
GIRAPH-634: InternalGiraphRunner with TestGraph is broken (majakabiljo)
GIRAPH-630: Convergence detection broken in
o.a.g.examples.PageRankVertex. (ssc via aching)
GIRAPH-627: YARN build profile is broken. (rvs via aching)
GIRAPH-628: Can't build Giraph without git due to mavanagaiata not
configured right (aching).
GIRAPH-595: InternalVertexRunner.run() should take GiraphConfiguration, not GiraphClasses
(nitay via apresta)
GIRAPH-364: Clean up directories created by test suite (majakabiljo)
GIRAPH-614: SplitMasterWorker=false is broken (majakabiljo)
GIRAPH-616: Decouple vertices and edges in DiskBackedPartitionStore and avoid writing
back edges when the algorithm does not change topology. (claudio)
GIRAPH-613: Remove Writable from the interfaces implemented by Vertex (claudio)
GIRAPH-543: Fix PageRankBenchmark and make WeightedPageRankBenchmark (majakabiljo)
GIRAPH-615: Add support for multithreaded output (majakabiljo)
GIRAPH-612: Improve website for upcoming release (aching)
GIRAPH-527: readVertexInputSplit is always reporting 0 vertices and 0 edges (nitay)
GIRAPH-611: Vertex/EdgeReaderWrapper should configure inner reader (majakabiljo)
GIRAPH-609: More information on runtime exceptions for Callables (aching)
GIRAPH-607: Hive IO bump (nitay)
GIRAPH-564: Input/output formats and readers/writers should implement
ImmutableClassesGiraphConfigurable (aching)
GIRAPH-604: Clean up benchmarks (majakabiljo)
GIRAPH-605: Worker crashes if its vertices have no edges when using edge input (majakabiljo)
GIRAPH-603: AbstractVertexToHive doesn't need message type (majakabiljo)
GIRAPH-589: Remove unnecessary generics from input formats (majakabiljo)
GIRAPH-536: Clean up configuration options (apresta)
GIRAPH-551: Fix log message at the end of edge input (apresta)
GIRAPH-555: Fix log message on generated input splits (apresta)
GIRAPH-602: HiveGiraphRunner should allow disabling/choosing of input
formats at runtime (aching)
GIRAPH-13: Port Giraph to YARN (ereisman)
GIRAPH-600: Create an option to do output during computation (majakabiljo)
GIRAPH-599: Hive IO dependency issues with some Hadoop profiles (nitay via majakabiljo)
GIRAPH-577: Create a testing framework that doesn't require I/O formats (ves via apresta)
GIRAPH-593: Update Hive IO performance improvements (nitay)
GIRAPH-594: auto set reusing objects (nitay)
GIRAPH-597: Don't reuse vertex by default in SimpleHiveToVertex (majakabiljo)
GIRAPH-588: More flexible Hive input (majakabiljo)
GIRAPH-587: Refactor configuration options (nitay)
GIRAPH-581: More flexible Hive output (majakabiljo)
GIRAPH-579: Make it possible to use different out-edges data structures
for input and computation (apresta)
GIRAPH-582: Create a generic option for determining the number of
supersteps that a job runs for (aching)
GIRAPH-586: Customizable default vertex value (apresta)
GIRAPH-580: NPE in HiveGiraphRunner when the vertex output format is
not defined (aching)
GIRAPH-510: Remove HBase Cruft (kelarini via nitay)
GIRAPH-566: Make option for aggregators to be configurable (majakabiljo)
GIRAPH-575: update hive-io (nitay)
GIRAPH-576: BspServiceMaster.failureCleanup() shouldn't pass null in
observers' applicationFailed() method (jgarms via nitay)
GIRAPH-547: Allow in-place modification of edges (apresta)
GIRAPH-537: Fix log messages produced by aggregators (majakabiljo)
GIRAPH-480: Add convergence detection to org.apache.giraph.examples.RandomWalkVertex (ssc)
GIRAPH-565: Make an easy way to gather some logs from workers on master (majakabiljo)
GIRAPH-559: use hive-io-experimental release (nitay)
GIRAPH-562: Implement getConf/setConf in AbstractHive classes (majakabiljo)
GIRAPH-561: Only print vertex/edge input classes if not null (aching)
GIRAPH-554: Set PartitionContext in InternalVertexRunner (majakabiljo)
GIRAPH-559: Giraph build breaks (nitay)
GIRAPH-550: HiveVertexReader should call initialize, not users (nitay)
GIRAPH-523: MasterObserver should pass in superstep number (gaurav.menghani via nitay)
GIRAPH-556: Race condition in EdgeStore (apresta)
GIRAPH-528: Decouple vertex implementation from edge storage (apresta)
GIRAPH-553: Cleanup HCatalogVertexOutputFormat (majakabiljo)
GIRAPH-545: Improve Facebook Hadoop dependency (nitay)
GIRAPH-541: Log before observers (nitay)
GIRAPH-535: Range-partitioning and edge locality benchmark (apresta)
GIRAPH-539: When having open requests log which workers are they sent to (majakabiljo)
GIRAPH-530: GiraphInputFormat#getSplits() should be aware of multithreaded input (apresta)
GIRAPH-532: Give an explanation when trying to use unregistered aggregators (majakabiljo)
GIRAPH-453: Pure Hive I/O (nitay)
GIRAPH-526: HiveGiraphRunner - bug with setting database name (majakabiljo)
GIRAPH-518: Support Hadoop-2.0.3-alpha release on Giraph (ereisman)
GIRAPH-525: Add PartitionClass to the ConfigurationUtils (claudio)
GIRAPH-514: DiskBackedMessageStores should take advantage of machines with multiple disks (claudio)
GIRAPH-520: ReverseEdgeDuplicator (nitay)
GIRAPH-522: JMap Dumper (nitay)
GIRAPH-517: Use stable hcatalog 0.5.0-incubating (nitay)
GIRAPH-503: Refactor platform-independent CLI argument parsing in GiraphRunner into a separate class (ereisman)
GIRAPH-519: EdgeFactory (nitay)
GIRAPH-515: More efficient and flexible edge-based input (apresta)
GIRAPH-516: out-of-core messages dies for ArrayIndexOutOfBoundsException when
running out-of-core messages in UnsafeByteArrayOutputStream (majakabiljo)
GIRAPH-513: OnDiskPartitionStore should take advantage of multiple disks (claudio)
GIRAPH-511: DNS interface and nameserver for multi-interface machines (claudio)
GIRAPH-470 (tavoaqp via nitay)
GIRAPH-504: Create PartitionContext (majakabiljo)
GIRAPH-499: Giraph should not reserve minimum reduce slot memory 1024 since we never use it (ereisman)
GIRAPH-508: Increase the limit on the number of partitions (majakabiljo)
GIRAPH-509: Factor out AggregatorUsage (majakabiljo)
GIRAPH-505: Metrics Updates (nitay)
GIRAPH-506: Concurrency issue - response can arrive before request is added to the outstanding map (majakabiljo)
GIRAPH-501: WorkerObserver (nitay)
GIRAPH-502: In PageRankBenchmark, remove unneeded handling of -t 2 (ekoontz)
GIRAPH-500: Refactor job launch code out of graph package and into job package (ereisman)
GIRAPH-493: Remove EdgeWithSource (nitay)
GIRAPH-429: Number of input split threads set to 1 less than necessary (majakabiljo)
GIRAPH-498: We should check input splits status from zookeeeper once per worker,
not once per split thread (majakabiljo)
GIRAPH-497: Limiting number of open requests doesn't work with multithreading (majakabiljo via ereisman)
GIRAPH-461: Convert static assignment of in-memory partitions with LRU cache (claudio)
GIRAPH-494: Make Edge an interface (nitay)
GIRAPH-492: Saving vertices has no status report, making it hard to
find DFS issues (aching)
GIRAPH-312: Giraph needs an admin script (ereisman)
GIRAPH-469: Refactor GraphMapper (ereisman)
GIRAPH-491: Observer for job lifecycle (nitay)
GIRAPH-490: Constants for GiraphStats / GiraphTimers (nitay)
GIRAPH-488: ArrayOutOfBoundsException in org.apache.giraph.worker.InputSplitPathOrganizer (ereisman)
GIRAPH-418: Create maven profile for CDH 4.1.2 (ekoontz)
GIRAPH-487: VertexInputPath in GiraphRunner refers to EdgeInputPath (taguan via apresta)
GIRAPH-481: mvn compile error when building jar file - Error with Accumulo (tavoaqp via nitay)
GIRAPH-472: Refactor MapFunctions enum to be more general (ereisman)
GIRAPH-254: Constant Variable name misspelled in HashMasterPartitioner.java (Gustavo Salazar Torres via ereisman)
GIRAPH-431: Support edge and vertex value input formats in GiraphRunner (apresta)
GIRAPH-477: Fetching locality info in InputSplitPathOrganizer causes jobs to hang (apresta via ereisman)
GIRAPH-459: Group Vertex Mutations by Partition ID (claudio)
GIRAPH-473: InputSplitPathOrganizer should be aware of multiple threads (apresta via ereisman)
GIRAPH-478: Bring back jar-with-deps for giraph-hcatalog (nitay)
GIRAPH-474: Add an oprtion not to use direct byte buffers. (majakabiljo via ereisman)
GIRAPH-476: SequenceFileVertexOutputFormat (nitay)
GIRAPH-409: Refactor / cleanups (nitay)
GIRAPH-465: MapFunctions cleanup (nitay)
GIRAPH-464: MasterObserver#applicationFailed callback (nitay)
GIRAPH-458: split formats module into accumulo,hbase,hcatalog (nitay)
GIRAPH-463: Create VertexResolver only once (apresta)
GIRAPH-457: update module names (nitay)
GIRAPH-455: Add meta information to built jars (nitay)
GIRAPH-456: Log where master is on every host (nitay)
GIRAPH-141: Multigraph support in Giraph (apresta)
GIRAPH-452: Fix hcatalog jar (nitay)
GIRAPH-450: post-review not working after switch to git (nitay)
GIRAPH-449: License header missing from for-each-profile.sh (apresta)
GIRAPH-447: Clean up GiraphConfiguration related things (nitay)
GIRAPH-445: Max message request size in bytes, initialize buffers to
expected size (majakabiljo)
GIRAPH-444: Cleanup VertexResolver (nitay)
GIRAPH-446: Add a proper timeout for waiting for workers to join a
superstep. (aching)
GIRAPH-443: Properly size netty buffers when encoding requests (majakabiljo)
GIRAPH-395: No need to make HashWorkerPartitioner thread-safe. (aching)
GIRAPH-441: Keep track of connected channels in NettyServer (majakabiljo)
GIRAPH-440: ProgressableUtils - TimeoutException from future.get shouldn't
be rethrown (majakabiljo)
GIRAPH-438: When checkpointing is disable, fast fail (aching)
GIRAPH-437: Missing progress calls when stopping Netty server (majakabiljo)
GIRAPH-439: Fix naming of input superstep counter (apresta)
GIRAPH-424: Fix hashCode modulo computation (majakabiljo)
GIRAPH-396: HcatalogVertexInputFormat outputs a bit too often. (aching)
GIRAPH-435: Serialize server messages for memory and less GC. (aching)
GIRAPH-420: build formats in profiles where it works. (nitay)
GIRAPH-421: Aggregate metrics up to master. (nitay)
GIRAPH-434: Remove unused InputSplitPathOrganizer from BspServiceWorker (apresta via ereisman)
GIRAPH-405: Edge-based input from HCatalog (apresta)
GIRAPH-433: Vertex#initialize() shouldn't be called with null edges (apresta)
GIRAPH-416: MasterObserver for user post-application customization (nitay)
GIRAPH-427: Add committer information for Nitay Joffe to pom.xml (nitay)
GIRAPH-417: Serialize the graph/message cache into byte[] for
improving memory usage and compute speed. (aching)
GIRAPH-386: ClassCastException when giraph.SplitMasterWorker=false
(majakabiljo)
GIRAPH-423: Allow overriding addEdge (apresta)
GIRAPH-422: Setting the log level of the root logger to the same level
can cause issues in Hadoop. (aching)
GIRAPH-415: Refactor / cleanup Hadoop Counters (nitay via majakabiljo)
GIRAPH-413: Make building Facebook profile easier. (nitay via aching)
GIRAPH-407: Metrics Update (nitay via apresta)
GIRAPH-404: More SendMessageCache improvements (majakabiljo)
GIRAPH-412: Checkstyle error from Giraph-403 (majakabiljo)
GIRAPH-403: GraphMapper.notiftySentMessages need to be thread-safe
(nitay via majakabiljo)
GIRAPH-397: We should have copies of aggregators per thread
to avoid synchronizing on aggregate() (majakabiljo)
GIRAPH-406: Enforce partition ids in [0, n-1] (majakabiljo)
GIRAPH-402: slf4j dependency bug (nitay via apresta)
GIRAPH-401: Metrics should be disabled by default (apresta)
GIRAPH-400: Refactoring broke
TestTextDoubleDoubleAdjacencyListVertexInputFormat (apresta)
GIRAPH-155: Allow creation of graph by adding edges that span
multiple workers (apresta)
GIRAPH-398: Missing a dependency (nitay via majakabiljo)
GIRAPH-394: mapreduce.job.user.classpath.first hadoop option typo
(nitay via aching)
GIRAPH-232: Add metrics system into Giraph. (nitay via aching)
GIRAPH-393: Number of input split threads should always be >= 1
(aching)
GIRAPH-388: Improve the way we keep outgoing messages (majakabiljo
via aching).
GIRAPH-389: Multithreading should intelligently allocate the thread
pools. (aching via ereisman)
GIRAPH-273: Aggregators shouldn't use Zookeeper (majakabiljo)
GIRAPH-376: Facebook Hadoop profile broken. (nitay via aching)
GIRAPH-387: GiraphRunner's better handling of configuration property
arguments. (netj via aching)
GIRAPH-385: Script for running on all profiles. (nitay via aching)
GIRAPH-384: Remove duplicate BspCase in giraph-formats-contrib (and
fix GIRAPH-382 breaking the build). (aching)
GIRAPH-382: ZooKeeperExt should handle ConnectionLossException by
retrying. (aching)
GIRAPH-381: Ensure we get the original exception from
GraphMapper#run(). (aching)
GIRAPH-379: HiveGiraphRunner should have a skipOutput option for
testing (aching)
GIRAPH-380: Hadoop_non_secure is broken (majakabiljo)
GIRAPH-372: Write worker addresses to Zookeeper;
move addresses and resolution to NettyClient (majakabiljo)
GIRAPH-373: RandomMessageBenchmark is broken (majakabiljo).
GIRAPH-374: Multithreading in input split loading and compute (aching).
GIRAPH-375: Cleaner MutableVertex API (apresta)
GIRAPH-371: Replace BspUtils in giraph-formats-contrib for
speed. (aching)
GIRAPH-369: bin/giraph broken (Nitay Joffe via ereisman)
GIRAPH-368: HBase Vertex I/O formats handle setConf() internally
(bfem via ereisman)
GIRAPH-367: Expose WorkerInfo to clients (Nitay Joffe via ereisman)
GIRAPH-370: AccumuloVertexOutputFormat public visibility for
TABLE_NAME. (bfem via aching)
GIRAPH-366: TestGraphPartitioner should use getTempPath() everywhere
GIRAPH-346: Top Level POM. (nitay via aching)
GIRAPH-200: Remove hadoop RPC and keep just netty. (apresta)
GIRAPH-363: Fix hadoop_0.23 profile broken by GIRAPH-211 (ekoontz)
GIRAPH-211: Add secure authentication to Netty IPC (ekoontz)
GIRAPH-361: Hive output partition parsing is broken (nitay via apresta)
GIRAPH-360: Keep track of the task id in ChannelRotater to send
requests without knowing the worker id upfront (aching via
ekoontz)
GIRAPH-307: InputSplit list can be long with many workers
(and locality info) and should not be re-created every time a
worker calls reserveInputSplit() (ereisman via majakabiljo)
GIRAPH-358: Rename package format->io in giraph-formats-contrib for
consistency with main package. (apresta via aching)
GIRAPH-350: HBaseVertex i/o formats are not being injected with
Configuration via Configurable interface. (bfem via aching)
GIRAPH-356: Improve ZooKeeper issues. (aching)
GIRAPH-342: Recursive ZooKeeper calls should call progress, dynamic
ZooKeeper can skip delete (aching via majakabiljo)
GIRAPH-351: Fail job early when there is no input (aching via ereisman)
GIRAPH-212: Security is busted since GIRAPH-168. (ekoontz via
aching)
GIRAPH-315: giraph-site.xml isn't read on time. (majakabiljo via
aching)
GIRAPH-325: One more progress call. (majakabiljo via aching)
GIRAPH-328: Outgoing messages from current superstep should be
grouped at the sender by owning worker, not by partition. (Eli
Reisman via aching)
GIRAPH-293: Should aggregators be checkpointed? (majakabiljo via
aching)
GIRAPH-355: Partition.readFields crashes. (maja via aching)
GIRAPH-354: Giraph Formats should use hcatalog-core. (nitayj via
aching)
GIRAPH-353: Received metrics are not thread-safe (aching via ereisman)
GIRAPH-326: Writing input splits to ZooKeeper in parallel (maja)
GIRAPH-335: Add committer information for Maja Kabiljo to pom.xml
(maja)
GIRAPH-341: Improved log messages (timing) and upgraded junit to 4.8
for better tests. (aching)
GIRAPH-352: Loaded vertices don't have their configuration set.
(aching)
GIRAPH-343: Use published hcatalog jars. (nitayj via aching)
GIRAPH-338: More Rat Ignores (Nitay Joffe via ereisman)
GIRAPH-347: GiraphConfiguration broke hcatalog build
(Nitay Joffe via ereisman)
GIRAPH-340: Added client/server ExecutionHandlers to Netty to avoid
and added WrappedAdaptiveReceiveBufferSizePredictorFactory to
debug/predict the size of the incoming messages. (aching)
GIRAPH-274: Jobs still failing due to tasks timeout during
INPUT_SUPERSTEP. (nitayj via aching)
GIRAPH-337: Make a specific Giraph configuration for Class caching
and specific Giraph configuration. (aching)
GIRAPH-334: Bugfix HCatalog Hive profile. (nitayj via aching)
GIRAPH-93: Hive input / output format. (nitayj via aching)
GIRAPH-277: Text Vertex Input/Output Format base classes overhaul.
(nitayj via aching)
GIRAPH-331: ReviewBoard post-review config. (nitayj via aching)
GIRAPH-332: Duplicate unnecessary info in giraph-formats-contrib
compile.xml. (nitay via aching)
GIRAPH-330: Ignores file for Git. (nitay via aching)
GIRAPH-327: Timesout values in BspServiceMaster.barrierOnWorkerList
(majakabiljo via ereisman)
GIRAPH-323: Check if requests are done before calling wait (majakabiljo
via ereisman)
GIRAPH-298: Reduce timeout for TestAutoCheckpoint. (majakabiljo via
aching)
GIRAPH-324: Add option to use combiner in benchmarks. (apresta via
aching)
GIRAPH-191: Random walks on graphs (Gianmarco De Francisci Morales
via ereisman)
GIRAPH-320: Provide a runtime configuration for choosing the
log level (aching via ereisman)
GIRAPH-321: Divide by 0 exception. (ereisman via aching)
GIRAPH-316: Add test-patch.sh for precommit test using Jenkins.
(hyunsik via ereisman)
GIRAPH-319: Receiving two responses for a request causes an
exception. (apresta via aching)
GIRAPH-291: PredicateLock should have a constructor to take in a
custom waiting time and additional testing (aching via ereisman)
GIRAPH-318: New Iterator in LocalityInfoSorter is not working.
(Eli Reisman via apresta)
GIRAPH-317: Add subpackages to comm (Maja Kabiljo via ereisman)
GIRAPH-301: InputSplit Reservations are clumping, leaving many workers
asleep while other process too many splits and get overloaded.
(Eli Reisman via apresta)
GIRAPH-313: Open Netty client and server on master. (majakabiljo via
aching)
GIRAPH-249: Move part of the graph out-of-core when memory is low
(apresta via aching).
GIRAPH-306: Netty requests should be reliable and implement exactly
once semantics. (aching)
GIRAPH-309: Message count is wrong. (aching via apresta)
GIRAPH-246: Periodic worker calls to context.progress() will prevent
timeout on some Hadoop clusters during barrier waits. (Eli Reisman
via aching)
GIRAPH-295: Additional Example Algorithm to compute Outdegree and
Indegree. (Sean Choi via aching)
GIRAPH-305: Adding an argument to GiraphRunner for Master Compute
classes. (Sean Choi via aching)
GIRAPH-302: Thread safety issue with sending partitions around.
(aching via apresta)
GIRAPH-303: Regression: cleanup phase happens earlier than it
should. (majakabiljo via apresta)
GIRAPH-278: Website still tries to load incubator logo (ekoontz)
GIRAPH-300) Improve netty reliability with retrying failed
connections, tracking requests, thread-safe hash partitioning
(aching via apresta).
GIRAPH-296: TotalNumVertices and TotalNumEdges are not saved in
checkpoint. (majakabiljo via apresta)
GIRAPH-297: Checkpointing on master is done one superstep later
(majakabiljo via aching).
GIRAPH-275: Restore data locality to workers reading InputSplits
where possible without querying NameNode, ZooKeeper. (Eli Reisman
via jghoman)
GIRAPH-258: Check type compatibility before submitting job.
(Eli Reisman via jghoman)
GIRAPH-218: Consolidate all I/O Format classes under one roof in
lib/ directory. (Eli Reisman via jghoman)
GIRAPH-259: TestBspBasic.testBspPageRank is broken (majakabiljo via
apresta)
GIRAPH-256: Partitioning outgoing graph data during INPUT_SUPERSTEP
by # of vertices results in wide variance in RPC message sizes. (Eli
Reisman via jghoman)
GIRAPH-290: Add committer information for Alessandro Presta to pom.xml
(apresta)
GIRAPH-286. Remove DISCLAIMER from source tree. (jghoman)
GIRAPH-287: Add option to limit the number of open requests.
(Maja Kabiljo via jghoman)
GIRAPH-262: Netty optimization to handle requests locally whenever
possible. (aching)
GIRAPH-288: Bandwidth tracking - subset of GIRAPH-262. (aching)
GIRAPH-289: Add thread and channel pooling to NettyClient and
NettyServer. (ekoontz via aching)
GIRAPH-276: Fix broken tests in pseudo-distributed mode.
(Alessandro Presta via jghoman)
GIRAPH-281: Add options to control Netty's per-channel receive and
send buffer sizes (ekoontz via aching).
GIRAPH-228: SimpleTriangleClosingVertex should not use ArrayWritable
for a vertex value. (Eli Reisman via jghoman)
GIRAPH-209: Include munge version in artifact name.
(Eli Reisman via jghoman)
GIRAPH-280: Add IntelliJ-generated *.iml and *.ipr files to Apache
Rat's <exclude> list. (ekoontz via aching).
GIRAPH-45: Improve the way to keep outgoing messages (majakabiljo
via aching).
GIRAPH-271: Regression in imports in CommunicationsInterface (netj
via aching).
GIRAPH-267: Jobs can get killed for not reporting status during
INPUT SUPERSTEP (netj via aching).
GIRAPH-266: Average aggregators don't calculate real average
(majakabiljo via aching).
GIRAPH-244: Vertex API redesign (apresta via aching).
GIRAPH-236: Add FindBugs to maven build (Jan van der Lugt via
aching).
GIRAPH-224: Netty server-side combiner (apresta via aching).
GIRAPH-251: Allow to access the distributed cache from Vertexes and
WorkerContext (Gianmarco De Francisci Morales via aching).
GIRAPH-261: Rename isQuiet variable. (Gianmarco De Francisci Morales
via jghoman).
GIRAPH-248: Generic IdentityVertex for IO testing (Sean Choi via
aching).
GIRAPH-222: GIRAPH-222 giraph-formats-contrib needs a README (bfem
via aching).
GIRAPH-257: TestBspBasic.testBspMasterCompute is broken (majakabiljo
via aching).
GIRAPH-81: Create annotations on provided algorithms for cli
(majakabiljo via aching).
GIRAPH-242: HashMapVertex stores neighbor ids twice.
(Alessandro Presta via hyunsik)
GIRAPH-241: Small typos in var names in NettyWorkerClient.java
(Eli Reisman via hyunsik)
GIRAPH-239: IntIntNullIntVertex doesn't save halted state (apresta
via aching)
GIRAPH-238: BasicVertex should have default Writable implementation
(apresta via aching)
GIRAPH-233: Small errors found by FindBugs (Jan van der Lugt via hyunsik)
GIRAPH-216: NullWritable as VertexData, EdgeData or MessageData
should be allowed. (Jan van der Lugt via jghoman)
GIRAPH-221: Make iteration over edges more explicit (apresta via aching).
GIRAPH-225: Guava version in POM.XML is really old. Updated to version 12.0.
(Eli Reisman via hyunsik)
GIRAPH-223: Need to put Giraph jar on classpath, post-GIRAPH-205.
(Eli Reisman via jghoman)
GIRAPH-213: NettyClient.stop() could deadlock according to netty.io docs.
(Eli Reisman via jghoman)
GIRAPH-127: Extending the API with a master.compute() function.
(Jan van der Lugt via jghoman)
GIRAPH-220: Default implementation of BasicVertex#sendMsgToAllEdges().
(Alessandro Presta via jghoman)
GIRAPH-217: Add SimpleTriangleClosingVertex to Giraph examples.
(Eli Reisman via jghoman)
GIRAPH-219: pom in giraph-formats-contrib should have groupId
'org.apache.giraph'. (Brian Femiano via jghoman)
GIRAPH-215: Update site to use Giraph logo and remove 'incubator'
text (ekoontz)
GIRAPH-205: Move Giraph jar to root level of tar.gz.
(Roman Shaposhnik via jghoman)
GIRAPH-206: Break out SimpleShortestPathVertex. (Eli Reisman via jghoman)
GIRAPH-210: Hadoop 1.0 profile has no activation. (jghoman)
GIRAPH-192: Move aggregators to a separate sub-package.
(Jan van der Lugt via jghoman)
GIRAPH-208: LocalTestMode's zookeeper directory is not being cleaned up
after job runs (ekoontz)
GIRAPH-194: Fix up URLs in the pom. (omalley)
GIRAPH-153: HBase/Accumulo Input and Output formats. (bfem via aching)
GIRAPH-187: SequenceFileVertexInputFormat has WritableComparable<I>
as a bounded type for I. (roman4asf via aching)
GIRAPH-20: Move temporary test files from the project directory. (ssc)
GIRAPH-37: Implement Netty-backed IPC. (aching)
GIRAPH-184: Upgrade to junit4. (Devaraj K via jghoman)
GIRAPH-176: BasicRPCCommunications has unnecessary cast of Vertex.
(Devaraj K via jghoman)
GIRAPH-175: Replace manual array copy to utility method call.
(Devaraj K via jghoman)
GIRAPH-181: Add Hadoop 1.0 profile to pom.xml. (ekoontz via aching)
GIRAPH-183: Add Claudio's FOSDEM presentation (slides and video)
to the site. (claudio)
GIRAPH-179: BspServiceMaster's PathFilter can be simplified.
(Devaraj K via jghoman)
GIRAPH-177: SimplePageRankVertex has two redundant casts.
(Devaraj K via jghoman)
GIRAPH-168: Simplify munge directive usage with new munge flag
HADOOP_SECURE (rather than HADOOP_FACEBOOK) and remove usage of
HADOOP (ekoontz via aching).
GIRAPH-85: Simplify return expression in
RPCCommunications::getRPCProxy (Eli Reisman via jghoman)
GIRAPH-171: Total time in MasterThread.run() is calculated
incorrectly (ekoontz via aching).
GIRAPH-144: GiraphJob should not extend Job (users should not be
able to call Job methods like waitForCompletion or setMapper..etc)
(aching).
GIRAPH-159: Case insensitive file/directory name matching will
produce errors on M/R jar unpack (bfem via aching).
GIRAPH-166: add '*.patch' to list of files that Apache Rat ignores
(ekoontz via aching).
GIRAPH-167: mvn -Phadoop_non_secure clean verify fails (ekoontz via
aching).
GIRAPH-163: bin/giraph script overwrites CLASSPATH if "dev
environment" detected (this also removes USER_JAR from CLASSPATH)
(metaman via aching).
GIRAPH-164: fix 5 "Line is longer than 80 characters" style errors
in GiraphRunner (ekoontz via aching).
GIRAPH-162: BspCase.setup() should catch FileNotFoundException
thrown from org.apache.hadoop.fs.FileSystem.listStatus() (ekoontz
via aching).
GIRAPH-161: Handling null messages and edges when initializing
IntIntNullIntVertex (dlogothetis via aching).
GIRAPH-156: Users should be able to set simple 'custom arguments'
via org.apache.giraph.GiraphRunner (ssc)
GIRAPH-154: Worker ports are not synched properly with its peers
(Zhiwei Gu via aching).
GIRAPH-87: Simplify boolean expression in
BspService::checkpointFrequencyMet (Eli Reisman via aching).
GIRAPH-150: PageRankBenchmark accesses wrong conf after GiraphJob is
created (aching).
GIRAPH-40: Added checkstyle for enforcement of code conventions.
All Giraph source files now pass checkstyle. (aching)
GIRAPH-148: giraph-site.xml needs Apache header. (jghoman)
GIRAPH-139: Change PageRankBenchmark to be accessible via bin/giraph.
(jghoman)
GIRAPH-143: Add support for giraph to have a conf file. (jghoman)
GIRAPH-142: _hadoopBsp should be prefixable via configuration. (jghoman)