-
Notifications
You must be signed in to change notification settings - Fork 386
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
perf(gnovm): cache PkgIDFromPkgPath for higher performance #3424
perf(gnovm): cache PkgIDFromPkgPath for higher performance #3424
Conversation
🛠 PR Checks SummaryAll Automated Checks passed. ✅ Manual Checks (for Reviewers):
Read More🤖 This bot helps streamline PR reviews by verifying automated checks and providing guidance for contributors and reviewers. ✅ Automated Checks (for Contributors):🟢 Maintainers must be able to edit this pull request (more info) ☑️ Contributor Actions:
☑️ Reviewer Actions:
📚 Resources:Debug
|
Codecov ReportAll modified and coverable lines are covered by tests ✅ 📢 Thoughts on this report? Let us know! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice one, looks good for now IMO
The cache might use a little bit too much memory in the far future but we can revisit then
Please pass CI :)
@n0izn0iz I considered using a small slice say pre-allocated 100 items as the cache then linear looking up but that wasn't much different. -var pkgIDFromPkgPathCache = make(map[string]*PkgID)
+type pkgCached struct {
+ path string
+ id *PkgID
+}
+
+var pkgIDFromPkgPathCache = make([]*pkgCached, 0, 100)
func PkgIDFromPkgPath(path string) PkgID {
pkgIDMu.Lock()
defer pkgIDMu.Unlock()
- pkgID, ok := pkgIDFromPkgPathCache[path]
- if !ok {
- pkgID = new(PkgID)
- *pkgID = PkgID{HashBytes([]byte(path))}
- pkgIDFromPkgPathCache[path] = pkgID
+ // Find the entry if possible, using a slice as
+ // we don't expect too many unique packages and
+ // when small, looking up with a slice is much
+ // cheaper than using a map.
+ for _, cached := range pkgIDFromPkgPathCache {
+ if cached.path == path {
+ return *cached.id
+ }
+ }
+
+ // Now cache the value.
+ cached := &pkgCached{
+ id: PkgID{HashBytes([]byte(path))},
+ path: path,
}
- return *pkgID
+ pkgIDFromPkgPathCache = append(pkgIDFromPkgPathCache, cached)
+ return *cached.id
} A map suffices TBH even if memory increases, it is unlikely that 5,000 packages will be imported and thus we can safely assume memory bloat won't be a problem. The merits of the change to me and in experience massively outweight any minuite RAM increases: being able to even start with 8-10 seconds less allows every contributor to work much faster, and even for Gno to preserve compute resources :-) |
13b78e9
to
b88c690
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That looks reasonable. However, an LRU cache would prevent it from causing a DoS. I'm fine with merging now since this DoS is unlikely to occur before other issues arise, but please add a TODO comment.
It also needs a review from @thehowl, @ltzmaxwell, and @petar-dambovaliev.
Done, thanks @moul I've added the TODO about using an LRU for the future when the trade-off is needed. |
This change comes from noticing that PkgIDFromPkgPath is very heavily invoked within the VM yet its results with the same inputs produced deterministic results aka SHA256(path)[:20] Previously just spinning up the VM would take 80 seconds, with this change that's shaved by ~8-10 seconds down and with repeatable and visible results exhibited through: - Benchmark: ```shell $ benchstat before.txt after.txt name old time/op new time/op delta PkgIDFromPkgPath-8 1.96µs ± 2% 0.35µs ± 1% -82.40% (p=0.000 n=20+18) name old alloc/op new alloc/op delta PkgIDFromPkgPath-8 296B ± 0% 168B ± 0% -43.24% (p=0.000 n=20+20) name old allocs/op new allocs/op delta PkgIDFromPkgPath-8 9.00 ± 0% 7.00 ± 0% -22.22% (p=0.000 n=20+20) ``` - Profiles: * Before ```shell (pprof) list PkgIDFromPkgPath Total: 100.94s ROUTINE ======================== github.com/gnolang/gno/gnovm/pkg/gnolang.PkgIDFromPkgPath in $PATH 220ms 9.26s (flat, cum) 9.17% of Total . . 74:func PkgIDFromPkgPath(path string) PkgID { 220ms 9.26s 75: return PkgID{HashBytes([]byte(path))} . . 76:} . . 77: . . 78:// Returns the ObjectID of the PackageValue associated with path. . . 79:func ObjectIDFromPkgPath(path string) ObjectID { . . 80: pkgID := PkgIDFromPkgPath(path) ``` * After ```shell (pprof) list PkgIDFromPkgPath Total: 93.22s ROUTINE ======================== github.com/gnolang/gno/gnovm/pkg/gnolang.PkgIDFromPkgPath in $PATH 210ms 1.55s (flat, cum) 1.66% of Total 50ms 50ms 78:func PkgIDFromPkgPath(path string) PkgID { . 490ms 79: pkgIDMu.Lock() 10ms 10ms 80: defer pkgIDMu.Unlock() . . 81: 10ms 730ms 82: pkgID, ok := pkgIDFromPkgPathCache[path] . . 83: if !ok { . . 84: pkgID = new(PkgID) . . 85: *pkgID = PkgID{HashBytes([]byte(path))} . . 86: pkgIDFromPkgPathCache[path] = pkgID . . 87: } 140ms 270ms 88: return *pkgID . . 89:} . . 90: . . 91:// Returns the ObjectID of the PackageValue associated with path. . . 92:func ObjectIDFromPkgPath(path string) ObjectID { . . 93: pkgID := PkgIDFromPkgPath(path) ``` Fixes gnolang#3423
8883d82
to
91176e4
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed this over a video call with @odeke-em . Great work.
I agree with @moul , we need @thehowl , @ltzmaxwell and @petar-dambovaliev to bless this as well.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice results:
goos: linux
goarch: amd64
pkg: github.com/gnolang/gno/gnovm/pkg/gnolang
cpu: AMD Ryzen 7 7840U w/ Radeon 780M Graphics
│ master.txt │ 3424.txt │
│ sec/op │ sec/op vs base │
Benchdata/fib.gno_param:4-16 7.601µ ± 1% 7.112µ ± 1% -6.43% (p=0.000 n=10)
Benchdata/fib.gno_param:8-16 56.21µ ± 2% 52.32µ ± 1% -6.91% (p=0.000 n=10)
Benchdata/fib.gno_param:16-16 2.673m ± 1% 2.499m ± 1% -6.51% (p=0.000 n=10)
geomean 104.5µ 97.60µ -6.62%
Good work.
I'm unsure about using a simple mutex; maybe a sync.Map
would be more optimized for the use-case, but let's take this improvement and maybe change into a LRU or Sieve cache at a later time.
This change comes from noticing that PkgIDFromPkgPath is very heavily invoked within the VM yet its results with the same inputs produced deterministic results aka SHA256(path)[:20] Previously just spinning up the VM would take 80 seconds, with this change that's shaved by ~8-10 seconds down and with repeatable and visible results exhibited through
Benchmark:
Profiles:
Fixes #3423