You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In mallet branch, HDPGibbsSampler line 71-73
simply adding the length of data to totalNumberOfWords should be faster than traversing an index and adding one each time, especially for very large documents
might look like this:
THX - U R right.
Will refactor these lines anyway ;)
However; adding data to the sampler is not the place where a huge proportion of computation is spent.
In mallet branch, HDPGibbsSampler line 71-73
simply adding the length of data to totalNumberOfWords should be faster than traversing an index and adding one each time, especially for very large documents
might look like this:
totalNumberOfWords += ((FeatureSequence) corpus.get(d).getData()).getLength();
The text was updated successfully, but these errors were encountered: