garble

Commit Graph

Author	SHA1	Message	Date
Daniel Martí	5fddfe1e61	rename func and update docs on handleDirectives We've been obfuscating all linknamed names for a while now, so the part in the docs about "recording" is no longer true. All it does is transform the directives to use obfuscated names. Give it a better name and rewrite the docs.	3 years ago
Daniel Martí	d5cbf2edca	separate and rename prefillObjectMaps The name and docs on that func were wildly out of date, since it no longer has anything to do with reflection at all. We only use the linkerVariableStrings map with -literals, so we can avoid the call entirely if the flag isn't set.	3 years ago
Daniel Martí	f85492a728	split typecheck and loadPkgCache from transformer Neither of them has anything to do with transforming Go code; they simply load or compute the information necessary for doing so. Split typecheck into two functions as well. The new typecheck function only does typechecking and nothing else. A new comptueFieldToStruct func fills the fieldToStruct map, which depends on typecheck, but is not needed when computing pkgCache. This isolation also forces us to separate the code that fills pkgCache from the code that fills the in-memory-only maps in transformer, removing the need for the NOTE that we had left around as a reminder.	3 years ago
Daniel Martí	4b0b2acf6f	isolate reflect.go from updating globals directly That is, stop reusing "transformer" as the receiver on methods, and stop writing the results to the global curPkgCache struct. Soon we will need to support computing pkgCache for any dependency, not just the current package, to make the caching properly robust. This allows us to fill reflectInspector with different values. The explicit isolation also helps prevent bugs. For instance, we were calling recursivelyRecordAsNotObfuscated from transformCompile, which happens after we have loaded or saved pkgCache. Meaning, the current package sees a larger pkgCache than its dependents. In this particular case it wasn't causing any bugs, since the two reflect types in question only had unexported fields, but it's still good to treat pkgCache as read-only in transformCompile.	3 years ago
Daniel Martí	d108f21846	apply TODO to rename "cannot obfuscate" APIs They have been exclusively about reflect for over a year now. Make that clearer, and update the docs as well.	3 years ago
Daniel Martí	e079c0af43	declare a type for cachedOutput To properly make our cache robust, we'll need to be able to compute cache entries for dependencies as needed if they are missing. So we'll need to create more of these struct values in the code. Rename cachedOutput to curPkgCache, to clarify that it relates to the current package. While here, remove the "known" prefix on all pkgCache fields. All of the names still make perfect sense without it.	3 years ago
Daniel Martí	da5ddfa45d	avoid go:linkname warnings when building on tip Packages like os and sync have started using go:linknames pointing to packages outside their dependency tree, much like runtime already did. This started causing warnings to be printed while obfuscsating std: > exec garble build -o=out_rebuild ./stdimporter [stderr] # sync //go:linkname refers to syscall.hasWaitingReaders - add `import _ "syscall"` for garble to find the package # os //go:linkname refers to net.newUnixFile - add `import _ "net"` for garble to find the package > bincmp out_rebuild out PASS Relax the restriction in listPackage so that any package in std is now allowed to list packages in runtimeLinknamed, which makes the warnings and any potential problems go away. Also make these std test cases check that no warnings are printed, since I only happened to notice this problem by chance.	3 years ago
Daniel Martí	0f2b59d794	merge the two "known cannot obfuscate" maps Per the TODOs that I left myself in the last commit. As expected, this change allows tidying up the code a bit, makes our use of caching a bit more consistent, and also allows us to load the current package from the cache.	3 years ago
Daniel Martí	7d1bd13778	replace our caching inside GOCACHE with GARBLE_CACHE For each Go package we obfuscate, we need to store information about how we obfuscated it, which is needed when obfuscating its dependents. For example, if A depends on B to use the type B.Foo, A needs to know whether or not B.Foo was obfuscated; it depends on B's use of reflect. We record this information in a gob file, which is cached on disk. To avoid rolling our own custom cache, and since garble is so closely connected with cmd/go already, we piggybacked off of Go's GOCACHE. In particular, for each build cache entry per `go list`'s Export field, we would store a "garble" sibling file with that gob content. However, this was brittle for two reasons: 1) We were doing this without cmd/go's permission or knowledge. We were careful to use filename suffixes similar to Export files, meaning that `go clean` and other commands would treat them the same. However, this could confuse cmd/go at any point in the future. 2) cmd/go trims cache entries in GOCACHE regularly, to keep the size of the build and test caches under control. Right now, this means that every 24h, any file not accessed in the last five days is deleted. However, that trimming heuristic is done per-file. If the trimming removed Garble's sibling file but not the original Export file, this could cause errors such as "cannot load garble export file" which users already ran into. Instead, start using github.com/rogpeppe/go-internal/cache, an exported copy of cmd/go's own cache implementation for GOCACHE. Since we need an entirely separate directory, we introduce GARBLE_CACHE, defaulting to the "garble" directory inside the user's cache directory. For example, on Linux this would be ~/.cache/garble. Inside GARBLE_CACHE, our gob file cache will be under "build", which helps clarify that this cache is used when obfuscating Go builds, and allows placing other kinds of caches inside GARBLE_CACHE. For example, we already have a need for storing linker binaries, which for now still use their own caching mechanism. This commit does not make our cache properly resistant to removed files. The proof is that our seed.txtar testscript still fails the second case. However, we do rewrite all of our caching logic away from Export files, which in itself is a considerable refactor, and we add a few TODOs. One notable change is how we load gob files from dependencies when building the cache entry for the current package. We used to load the gob files from all packages in the Deps field. However, that is the list of all _transitive_ dependencies. Since these gob files are already flat, meaning they contain information about all of their transitive dependencies as well, we need only load the gob files from the direct dependencies, the Imports field. Performance is largely unchanged, since the behavior is similar. However, the change from Deps to Imports saves us some work, which can be seen in the reduced mallocs per obfuscated build. It's unclear why the binary size isn't stable. When reverting the Deps to Imports change, it then settles at 5.386Mi, which is almost exactly in between the two measurements below. I'm not sure why, but that metric appears to be slightly unstable. goos: linux goarch: amd64 pkg: mvdan.cc/garble cpu: AMD Ryzen 7 PRO 5850U with Radeon Graphics │ old │ new │ │ sec/op │ sec/op vs base │ Build-8 11.09 ± 1% 11.08 ± 1% ~ (p=0.796 n=10) │ old │ new │ │ bin-B │ bin-B vs base │ Build-8 5.390Mi ± 0% 5.382Mi ± 0% -0.14% (p=0.000 n=10) │ old │ new │ │ cached-sec/op │ cached-sec/op vs base │ Build-8 415.5m ± 4% 421.6m ± 1% ~ (p=0.190 n=10) │ old │ new │ │ mallocs/op │ mallocs/op vs base │ Build-8 35.43M ± 0% 34.05M ± 0% -3.89% (p=0.000 n=10) │ old │ new │ │ sys-sec/op │ sys-sec/op vs base │ Build-8 5.662 ± 1% 5.701 ± 2% ~ (p=0.280 n=10)	3 years ago
Daniel Martí	cee53a7868	make GarbleActionID a full sha256 hash This is in preparation for the switch to Go's cache package, whose ActionID type is also a full sha256 hash with 32 bytes. We were using "short" hashes as shown by `go tool buildid`, since that was consistent and 15 bytes was generally enough.	3 years ago
Daniel Martí	7872177381	CI: try macos-latest again And bump go-internal to its latest version, to include its fix for those pesky "signal: killed" failures on macos. While here, run the tests with -short on GOARCH=386, and make our use of actions/setup-go a bit more consistent.	3 years ago
Daniel Martí	b4fa94e45b	use go:build in script/imports.txtar This form has been preferred over +build since Go 1.17.	3 years ago
Daniel Martí	414e3b7f70	tidy our build ID hash code a bit First, rename "component" to "hash", since it's shorter and more useful. A full build ID is two or four hashes joined with slashes. Second, add sanity checks that buildIDHashLength is being followed. Otherwise the use of []byte could lead to human error. Third, move all the hash encoding and decoding logic together.	3 years ago
Daniel Martí	0c9a59127a	rename cache global to sharedCache Since we will start importing github.com/rogpeppe/go-internal/cache, and I don't want to have to rename it or leave confusion around.	3 years ago
Daniel Martí	bdcb80ee63	adapt to tip's error message change from "GOROOT" to "std"	3 years ago
Daniel Martí	9f50e1a8a5	tweak when we read and write cachedOutput files We first called the typecheck method, which starts filling cachedOutput with information from the current package, and later we would load the gob files for all dependencies via loadCachedOutputs. This was a bit confusing; instead, load the cached gob files first, and then do all the operations which fill information for curPkg. Similarly, we were waiting until the very end of transformCompile to write curPkg's cachedOutput gob file to the disk cache. We can write the file at an earlier point, before we have obfuscated and re-printed all Go files for the current package. We can also write the file before other work like processImportCfg. None of these changes should affect garble's behavior, but they will make the cache redesign for #708 easier.	3 years ago
Daniel Martí	744e9a375a	suggest a command when asking the user to rebuild garble See a user's apparent confusion in #738.	3 years ago
Daniel Martí	83ee4d0509	internal/literals: add fuzzer To inevstigate #721, I wrote this fuzzer to see if any particular combination of string literals and literal obfuscators would result in a broken program. I didn't find anything, but I reckon this fuzzer can still be useful.	3 years ago
Daniel Martí	b129987b46	CI: upgrade go tip The previous commits fix support for the latest master versions.	3 years ago
Daniel Martí	6933b42920	internal/linker: add README with docs Copying these examples in each of the PRs doesn't feel necessary.	3 years ago
Daniel Martí	5adb4ac800	internal/linker: generate 1.20 patches again From https://github.com/burrowers/go-patches/pull/3.	3 years ago
Daniel Martí	6e1e750755	internal/linker: add working patches for Go 1.21 (master) From https://github.com/burrowers/go-patches/pull/4.	3 years ago
Daniel Martí	14daadbf85	internal/linker: patch each major Go version separately Go 1.21 already breaks one of the three patches. The complexity of the patches will likely increase, and we only ever need to actively maintain the highest set of patches, so splitting by major version feels natural.	3 years ago
Daniel Martí	d6fd552245	internal/linker: show `git apply` stderr on error Otherwise we don't see why a patch failed to apply.	3 years ago
lu4p	1526ce7fd2	rework reflection detection with ssa (#732 ) This is significantly more robust, than the ast based detection and can record very complex cases of indirect parameter reflection. Fixes #554	3 years ago
Daniel Martí	b0ff2fb133	fix encoding/asn1 marshaling of pkix types See the added comment and test, which failed before the fix when the encoded certificate was decoded. It took a while to find the culprit in asn1, but in hindsight it's easy: case reflect.Slice: if t.Elem().Kind() == reflect.Uint8 { return false, TagOctetString, false, true } if strings.HasSuffix(t.Name(), "SET") { return false, TagSet, true, true } return false, TagSequence, true, true Fixes #711.	3 years ago
Daniel Martí	9044b1d31c	add a test to reproduce our caching bugs The test passes, because it expects garble to fail the same way that users are seeing in practice. For #708.	3 years ago
Daniel Martí	950ddaf503	CI: skip regular `go test` on linux See the added comment. An alternative would be to split up the work into more jobs, but that doesn't feel necessary, and we still have a global limit on how many free runners we can run jobs on at once. While here, update the comments, and remove unnecessary step names.	3 years ago
Daniel Martí	23f9f54102	scripts: no need for this directory to be a Go package It's perfectly fine for bench_literals.go to be a main package itself, but the scripts directory doesn't need to be a Go package This was bothering me at times; for example, "go list ./..." would show the scripts directory, or "go install mvdan.cc/garble/..." would build and install a "scripts" binary as well. The file can still be built or run directly, like either of: go build bench_literals.go go run bench_literals.go	3 years ago
Daniel Martí	4a13321052	go.mod: bump deps	3 years ago
Daniel Martí	e6fc593f68	set testscript's RequireExplicitExec and RequireUniqueNames The first makes our test scripts more consistent, as all external program executions happen via "exec" and are not as easily confused with custom builtin commands like our "generate-literals". The second catches mistakes if any of our txtar files have duplicate files, where all but one of the contents would be ignored before.	3 years ago
Daniel Martí	7bb040e635	update to Go tip from April 28th A couple of new packages in runtimeAndDeps, and go list's Package.DepsErrors may now include package build errors which we want to ignore as we would print those as duplicates.	3 years ago
Dominic Breuker	b587d8c01a	use the "simple" obfuscator for large literals Changes literal obfuscation such that literals of any size will be obfuscated, but beyond `maxSize` we only use the `simple` obfuscator. This one seems to apply AND, OR, or XOR operators byte-wise and should be safe to use, unlike some of the other obfuscators which are quadratic on the literal size or worse. The test for literals is changed a bit to verify that obfuscation is applied. The code written to the `extra_literals.go` file by the test helper now ensures that Go does not optimize the literals away when we build the binary. We also append a unique string to all literals so that we can test that an unobfuscated build contains this string while an obfuscated build does not.	3 years ago
Daniel Martí	4d7546703a	update gotip in CI and fix -tiny on the latest tip printOneCgoTraceback now returns a boolean rather than an int. Since we need to have different logic based on the Go version, and toolchainVersionSemver was only set for the main process, move the string to the shared cache global. This is a nice thing to do anyway, to reduce the number of globals. While here, update actions/setup-go to v4, which starts caching GOMODCACHE and GOCACHE by default now. Disable it, because it still doesn't help in our case, and GitHub's Actions caching is still really inefficient. And update staticcheck too.	3 years ago
Daniel Martí	cd003eade0	support `garble run` It's the third time I've wanted it to quickly test out standalone reproducers when people submit bugs. Fixes #661.	3 years ago
Daniel Martí	d1da066120	print chosen seed when building with -seed=random The seedFlag.random field had never worked, as my refactor in December 2021 never set it to true. Even if the boolean was working, we only printed the random seed when we failed. It's still useful to see it when a build succeeds, for example when wanting to reproduce the same binary or when wanting to reverse a panic from the produced binary. Add a test this time. Fixes #696.	3 years ago
Daniel Martí	a186419d3d	avoid rebuilding garble in the main benchmark Similar to what testscript does, we can reuse the test binary by telling TestMain to run the main function rather than the Go tests. This saves a few hundred milliseconds out of each benchmark run.	3 years ago
Daniel Martí	1f39d0af72	update gotip and adapt to upstream changes https://go.dev/cl/466095 lightly refactored the runtime in a way that broke extractNameOff. In particular, the code func cfuncname(f funcInfo) *byte { if !f.valid() \|\| f.nameOff == 0 { return nil } return &f.datap.funcnametab[f.nameOff] } func funcname(f funcInfo) string { return gostringnocopy(cfuncname(f)) } is now simply func funcname(f funcInfo) string { if !f.valid() { return "" } return f.datap.funcName(f.nameOff) } Since extractNameOff looked for the func named cfuncname, and looked for the nameOff selector inside an index expression, all of that code no longer worked properly. It all existed to find the name of the field, nameOff, so that we would automatically adapt if upstream renames it. Unsurprisingly, the code using the field got refactored first. It doesn't seem like the extra code on our part is helping us, and assuming the name of the field works for all Go versions, so do that instead. If upstream does rename the field in the future, the obfuscated Go builds will start failing in an obvious way. If or when that comes to pass, we can change our constant string.	3 years ago
Daniel Martí	9d04637009	ensure the alignment of sync/atomic types works Added in Go 1.19, types like sync/atomic.Uint64 are handy, because they ensure proper alignment even on 32-bit GOOSes. However, this was done via a magic `type align64 struct{}`, which the compiler spotted by name. To keep that magic working, do not obfuscate the name. Neither package path was being obfuscated, as both packages contain compiler intrinsics already. Fixes #686.	3 years ago
Daniel Martí	30927da637	avoid panic when a package's imports cannot be loaded I mistakenly understood that, when the DepsErrors field has errors, the Error field would contain an error as well. That is not always the case; for example, the imports_missing package in the added test script had DepsErrors set but Error empty, causing a nil dereference panic. Make the code more robust, and report both kinds of load errors. Fixes #694.	3 years ago
Daniel Martí	cf49f7f3b1	use the host's GOEXE when building the linker We're building the linker binary for the host GOOS, not the target GOOS that we happen to be building for. I noticed that, after running `go test`, my garble cache would contain both link and link.exe, which made no sense as I run linux and not windows. `go env` has GOHOSTOS to mirror GOOS, but there is no GOHOSTEXE to mirror GOEXE, so we reconstruct it from runtime.GOOS, which is equivalent to GOHOSTOS. Add a regression test as well.	3 years ago
pagran	b87830eb97	use git in english when matching its output See `db58416d7f`. Fixes #698.	3 years ago
Daniel Martí	059c1d68e2	use fewer build flags when building std or cmd When we use `go list` on the standard library, we need to be careful about what flags are passed from the top-level build command, because some flags are not going to be appropriate. In particular, GOFLAGS=-modfile=... resulted in a failure, reproduced via the GOFLAGS variable added to linker.txtar: go: inconsistent vendoring in /home/mvdan/tip/src: golang.org/x/crypto@v0.5.1-0.20230203195927-310bfa40f1e4: is marked as explicit in vendor/modules.txt, but not explicitly required in go.mod golang.org/x/net@v0.7.0: is marked as explicit in vendor/modules.txt, but not explicitly required in go.mod golang.org/x/sys@v0.5.1-0.20230208141308-4fee21c92339: is marked as explicit in vendor/modules.txt, but not explicitly required in go.mod golang.org/x/text@v0.7.1-0.20230207171107-30dadde3188b: is marked as explicit in vendor/modules.txt, but not explicitly required in go.mod To ignore the vendor directory, use -mod=readonly or -mod=mod. To sync the vendor directory, run: go mod vendor To work around this problem, reset the -mod and -modfile flags when calling "go list" on the standard library, as those are the only two flags which alter how we load the main module in a build. The code which builds a modified cmd/link has a similar problem; it already reset GOOS and GOARCH, but it could similarly run into problems if other env vars like GOFLAGS were set. To be on the safe side, we also disable GOENV and GOEXPERIMENT, which we borrow from Go's bootstrapping commands.	3 years ago
pagran	6a8dda9b8e	Add pagran to FUNDING.yml	3 years ago
Daniel Martí	9598e439ed	CI: test against a newer Go tip While here, update staticcheck.	3 years ago
Daniel Martí	6c4274c326	scripts: remove TODO about building third party programs `go build ./...` does indeed compile and link main packages, it just does not move the resulting binaries anywhere permanent like `go install` does. As such, the TODO isn't relevant; the fact that we build all packages inside each module means we are already linking any binaries matched via `./...` from the module root. We don't run any of the binaries, which would catch panics at run-time, but we already have a note at the top about using `garble test`.	3 years ago
Daniel Martí	d0c8c8d844	scripts: start checking google.golang.org/protobuf The current garble release is able to obfuscate it with Go 1.20. While here, re-generate all files to use "go 1.20" directives, and add a TODO about also testing binary builds for each project. See #600.	3 years ago
Daniel Martí	bb69facbd8	cut test time with coverage by half Per the inline comment, we want to build every package in the test suite at least once, otherwise we won't get proper code coverage information. For example, some code only triggers when obfuscating the runtime, and if I run "go test" twice in a row, the second might reuse the first's obfuscation of the runtime and skip the code entirely. However, per the TODO, we used to solve this in a rather wasteful way, by making each test use its own separate GOCACHE directory. Instead, use a new GOCACHE directory, but share it between tests. This is still enough, as we still build each package at least once. Running the command below twice in a row with Go 1.20: go test -cover took about 1m55s on my laptop before, and now takes about 1m10s. Not great, but a noticeable improvement. Both report a total coverage of 88.7%. While here, do the same for GARBLE_CACHE_DIR, and use testscript.Env.Setenv for simplicity.	3 years ago
pagran	86b7e334ba	implement funcInfo.entryoff encryption At linker stage, we now encrypt funcInfo.entryoff value with a simple algorithm (1 xor + 1 mul). This makes it harder to relate function metadata (e.g. name) to function itself in binary, almost without affecting performance.	3 years ago
Daniel Martí	89b27fa7f9	tweaks thanks to code coverage info The test package hack still appears to be needed as of Go 1.20. Interestingly, the cgo filename check does not trigger at all anymore. Presumably this means that it's not needed at all, and obfuscating code generated by cgo appears to be fine. Go with that for now.	3 years ago

1 2 3 4 5 ...

590 Commits (5fddfe1e61cd7df1eac5ea34ba42c06ebb8ef577) All Branches Search

590 Commits (5fddfe1e61cd7df1eac5ea34ba42c06ebb8ef577)

All Branches