garble

Commit Graph

Author	SHA1	Message	Date
Daniel Martí	05d35350cf	record types into ignoreObjects more reliably Our previous logic only took care of fairly simple types, such as a simple struct or a pointer to a struct. If we had a struct embedding another struct, we'd fail to record the objects for the fields in the inner struct, and that would lead to miscompilation: > garble build [stderr] # test/main LZmt64Nm.go:7: outer.InnerField undefined (type *CcUt1wkQ.EmbeddingOuter has no field or method InnerField) To fix this issue, make the function that records all objects under a types.Type smarter. Since it now does more than just dealing with structs, it's also renamed. Since the function now walks types properly, we get to remove the extra ast.Inspect in recordReflectArgs, which is nice. We also make it a method, to avoid the map parameter. A boolean parameter is also added, since we need this feature to only look at the current package when looking at reflect calls. Finally, we add a test case, a simplified version of the original bug report. Fixes #315.	4 years ago
Daniel Martí	2fad0e1583	wrap types.Importer to canonicalize import paths The docs for go/importer.ForCompiler say: The lookup function is called each time the resulting importer needs to resolve an import path. In this mode the importer can only be invoked with canonical import paths (not relative or absolute ones); it is assumed that the translation to canonical import paths is being done by the client of the importer. We use a lookup func for two reasons: first, to support modules, and second, to be able to use our information from "go list -json -export". However, go/types does not canonicalize import paths before calling ImportFrom. This is somewhat understandable; it doesn't know whether an importer was created with a lookup func, and ImportFrom only requires the input path to be canonicalized in that scenario. When the lookup func is nil, the importer canonicalizes by itself via go/build.Import. Before this change, the added crossbuild test would fail: > garble build net/http [stderr] # vendor/golang.org/x/crypto/chacha20 typecheck error: /usr/lib/go/src/vendor/golang.org/x/crypto/chacha20/chacha_generic.go:10:2: could not import crypto/cipher (can't find import: "crypto/cipher") # vendor/golang.org/x/text/secure/bidirule typecheck error: /usr/lib/go/src/vendor/golang.org/x/text/secure/bidirule/bidirule.go:12:2: could not import errors (can't find import: "errors") # vendor/golang.org/x/crypto/cryptobyte typecheck error: /usr/lib/go/src/vendor/golang.org/x/crypto/cryptobyte/asn1.go:8:16: could not import encoding/asn1 (can't find import: "encoding/asn1") # vendor/golang.org/x/text/unicode/norm typecheck error: /usr/lib/go/src/vendor/golang.org/x/text/unicode/norm/composition.go:7:8: could not import unicode/utf8 (can't find import: "unicode/utf8") This is because we'd fall back to importer.Default, which only knows how to find packages in $GOROOT/pkg. Those are missing for cross-builds, unsurprisingly, as those built archives end up in the build cache. After this change, we properly support importing std-vendored packages, so we can get rid of the importer.Default workaround. And, by extension, cross-builds now work as well. Note that, in the added test script, the full build of the binary fails, as there seems to be some sort of linker problem: > garble build [stderr] # test/main d9rqJyxo.uoqIiDs5: relocation target runtime.os9A16A3 not defined We leave that as a TODO for now, as this change is subtle enough as it is.	4 years ago
lu4p	1a9fdb4e8e	Fix calls to linkname functions (#314 )	4 years ago
Daniel Martí	3afc993266	use "go env -json" to collect env info all at once In the worst case scenario, when GOPRIVATE isn't set at all, we would run these three commands: * "go env GOPRIVATE", to fetch GOPRIVATE itself * "go list -m", for GOPRIVATE's fallback * "go version", to check the version of Go being used Now that we support Go 1.16 and later, all these three can be obtained via "go env -json": $ go env -json GOPRIVATE GOMOD GOVERSION { "GOMOD": "/home/mvdan/src/garble/go.mod", "GOPRIVATE": "", "GOVERSION": "go1.16.3" } Note that we don't get the module path directly, but we can use the x/mod/modfile Go API to parse it from the GOMOD file cheaply. Notably, this also simplifies our Go version checking logic, as now we get just the version string without the "go version" prefix and "GOOS/GOARCH" suffix we don't care about. This makes our code a bit more maintainable and robust. When running a short incremental build, we can also see a small speed-up, as saving two "go" invocations can save a few milliseconds: name old time/op new time/op delta Build/Cache-8 168ms ± 0% 166ms ± 1% -1.26% (p=0.009 n=6+6) name old bin-B new bin-B delta Build/Cache-8 6.36M ± 0% 6.36M ± 0% +0.12% (p=0.002 n=6+6) name old sys-time/op new sys-time/op delta Build/Cache-8 222ms ± 2% 219ms ± 3% ~ (p=0.589 n=6+6) name old user-time/op new user-time/op delta Build/Cache-8 857ms ± 1% 846ms ± 1% -1.31% (p=0.041 n=6+6)	4 years ago
Daniel Martí	24d5ff362c	fix a regression involving imported linkname funcs In `ce2c45440a`, we simplified the code a bit and removed one call to obfuscatedTypesPackage. Unfortunately, we introduced a regression; if an exported function is linknamed to another symbol name, and it's called from an importer package, we would have a build failure now: > garble build [stderr] # test/main ZiOACuw7.go:1: undefined: ODC0xN52.BaDqbhkj This is because the imported package would not hash the original name, via its ignoreObjects logic. And, since the importer package has no access to that knowledge, it would hash the same name, and fail to find it in the final build. The regression happened because we used to have a types.Scope Lookup that saved us in this scenario. Add the test, and re-add the Lookup, this time only for this particular scenario with function names. Thanks to Andrew LeFevre for reporting and describing the test case. While at it, replace more uses of "garbled" to "obfuscated".	4 years ago
Daniel Martí	5de519694a	CI: pin a commit when testing against Go tip Since it changes rapidly, especially during merge cycles, and we don't want CI to surprisingly blow up in our faces from one day to another. Pin this to a commit from yesterday which works, since some changes merged today moved where the Go build version is recorded and broke garble. While at it, replace "git clone" with a wget of a source archive. This is much, much faster, mainly because a tarball is significantly smaller. We now download about 20MiB instead of over 350MiB. One downside is that, without git, make.bash can't construct a devel version on its own. For that reason, add a pretty basic manual version via the VERSION file. This means that we must not reject custom devel version strings. This is a good thing anyway, because custom devel strings are already common when building Go in custom ways. Those people tend to be advanced users, such as CI, so fall back to assuming they know what they are doing and don't error. Plus, starting last week, devel versions in Go master now contain the major Go version like in build tags, such as "go1.17-commit...", so we will soon start relying on that instead of parsing dates: $ go version go version devel go1.17-a7e16abb22 Thu Apr 8 07:33:58 2021 +0000 linux/amd64	4 years ago
Daniel Martí	6b1a062c6f	make -literals succeed on all of std Two bugs were remaining which made the build with -literals of std fail. First, we were ignoring too many objects in constant expressions, including type names. This resulted in type names declared in dependencies which were incorrectly not obfuscated in the current package: # go/constant O1ku7TCe.go:1: undefined: alzLJ5Fd.Word b0ieEGVQ.go:1: undefined: alzLJ5Fd.Word LEpgYKdb.go:4: undefined: alzLJ5Fd.Word FkhHJCfm.go:1: undefined: alzLJ5Fd.Word This edge case is easy to reproduce, so a test case is added to literals.txt. The second issue is trickier; in some packages like os/user, we would get syntax errors because of comments printed out of place: ../tip/os/user/getgrouplist_unix.go:35:130: syntax error: unexpected newline, expecting comma or ) This is a similar kind of error that we tried to fix with `e2f06cce94`. In particular, it's fixed by also setting CallExpr.Rparen in withPos. We also add many other missing Pos fields for good measure, even though we're not sure they help just yet. Unfortunately, all my attempts to minimize this into a reproducible failure have failed. We can't just copy the failing file from os/user, as it only builds on some OSs. It seems like it was the perfect mix of cgo (which adds line directive comments) plus unlucky positioning of literals. For that last reason, as well as for ensuring that -literals works well with a wide variety of software, we add a build of all of std with -literals when not testing with -short. This is akin to what we do in goprivate.txt, but with the -literals flag. This does make "go test" more expensive, but also more thorough. Fixes #285, hopefully for good this time.	4 years ago
Daniel Martí	e2f06cce94	set positions when using cursor.Replace The regular obfuscation process simply modifies some simple nodes, such as identifiers and strings. In those cases, we modify the nodes in-place, meaning that their positions remain the same. This hasn't caused any problems. Literal obfuscation is trickier. Since we replace one expression with an entirely different one, we use cursor.Replace. The new expression is entirely made up on the spot, so it lacks position information. This was causing problems. For example, in the added test input: > garble -literals build [stderr] # test/main dgcm4t6w.go:3: misplaced compiler directive dgcm4t6w.go:4: misplaced compiler directive dgcm4t6w.go:3: misplaced compiler directive dgcm4t6w.go:6: misplaced compiler directive dgcm4t6w.go:7: misplaced compiler directive dgcm4t6w.go:3: misplaced compiler directive dgcm4t6w.go:9: misplaced compiler directive dgcm4t6w.go:3: misplaced compiler directive dgcm4t6w.go:3: too many errors The build errors are because we'd move the compiler directives, which makes the compiler unhappy as they must be directly followed by a function declaration. The root cause there seems to be that, since the replacement nodes lack position information, go/printer would try to estimate its printing position by adding to the last known position. Since -literals adds code, this would result in the printer position increasing rapidly, and potentially printing directive comments earlier than needed. For now, making the replacement nodes have the same position as the original node seems to stop go/printer from making this mistake. It's possible that this workaround won't be bulletproof forever, but it works well for now, and I don't see a simpler workaround right now. It would be possible to use fancier mechanisms like go/ast.CommentMap or dave/dst, but those are a significant amount of added complexity as well. Fixes #285.	4 years ago
Daniel Martí	7294469a7f	testdata: reduce the cost of short tests Reduces "go test -short" with a warm build cache from ~9s to ~4s. The main offender was the use of "-a" in the "garble test" call; I think I added that flag to rebuild all packages when debugging an error, and forgot to remove it.	4 years ago
Daniel Martí	e94b8e3750	make "help" refuse arguments for now Since they are otherwise silently ignored.	4 years ago
Daniel Martí	1662fc93b4	make reverse error if no changes were made Just the exit status code is used. Printing an error would be confusing, since we're printing the input too. Plus, the error is binary anyway - either we changed something, or we did not. One could make a case for only printing if we changed anything, and showing an error otherwise. But that would mean buffering the entire input, which could be problematic or slow when e.g. reversing large log files.	4 years ago
Daniel Martí	fe095ef132	handle unknown flags in reverse (#290 ) While at it, expand the tests for build and test too.	4 years ago
Daniel Martí	1a8e32227f	improve "reverse" even further (#289 ) Fix up a few TODOs, and simplify the way we handle comments. We now add whitespace around inline /line/ directives, to ensure we don't break programs. A test case is added too. We now add line directives to call sites, not function declarations, since those are what actually shows up in stack traces. It's unclear if we care about any other lines inside functions at all. This also fixes reversing with -literals, since that feature adds a significant amount of code which shuffles line numbers around. Finally, we extend the tests with types, methods, and anonymous functions, and we make all of them work well. Updates #5.	4 years ago
Daniel Martí	10ec00b37a	make flags like -literals and GOPRIVATE affect hashing (#288 ) In `6898d61637`, we switched from using action IDs from "go list -toolexec=garble" to those from the original "go list". We still wanted the obfuscation and hashing to change if the version of garble changes, so we hashed that "original action ID" with garble's own content ID, and called the new hash "garble action ID". While working on a different patch, I noticed something weird: with the new mechanism, adding or removing flags like -literals did not alter those hashes, unlike the old method. This is because the old method used ownContentID, which includes such bits of information, but the new method does not. Change that, and add a test that locks in the behavior we want. In seed.txt, we check that a single function name gets hashed in particular ways in different scenarios. Note that we use a mix of "cmp" and "! bincmp", since the former has no negated form. While at it, the seed.txt test is revamped a bit. Now, we only run with -literals once, as this test is mainly about -seed. We also declare seed strings once, as environment variables, which makes it easier to track what each step is doing.	4 years ago
Daniel Martí	a8c5d534d1	support reversing stack trace positions (#287 ) In particular, the positions within function declarations, including the positions of call sites to other functions. Note that this isn't well tested just yet, particularly not with other features like -literals. We can extend the tests and code over time. This gets us the core basics. The issue will be closed once the feature is documented for users, in a follow-up PR. Updates #5.	4 years ago
Daniel Martí	ea19e39aa4	use hashWith for obfuscation position information Position information was obfuscated with math/rand manually, which meant that the resulting positions were pretty small like "x.go:34", but they were also very hard to reverse due to their short length and difficulty to reproduce. We now hash them with hashWith and the package's GarbleActionID: "main.go:203" hashed with 933ad1c700755b7c3a9913c55cade1 to "mwu1xuNz.go" The input to the hash is the base filename and the byte offset of the declaration within the file, meaning that it's unique within a package. The output filename is long enough to allow easy reversal. The line number is always 1, since the information needed for reversing is contained entirely within the filename. It doesn't really matter if we encode data in the filename or line number, but it's easier for us to use a string. For #5.	4 years ago
Daniel Martí	748c6a0538	obfuscate asm function names as well (#273 ) Historically, it was impossible to rename those funcs as the implementation was in assembly files, and we only transformed Go code. Now that transformAsm exists, it's only about fifty lines to do some very basic parsing and rewriting of assembly files. This fixes the obfuscated builds of multiple std packages, including a few dependencies of net/http, since they included assembly funcs which called pure Go functions. Those pure Go functions had their names obfuscated, breaking the call sites in assembly. Fixes #258. Fixes #261.	4 years ago
lu4p	810bdf448e	don't obfuscate the "embed" import path Updates #263	4 years ago
Daniel Martí	def9351b25	fix and re-enable "garble test" (#268 ) With the many refactors building up to v0.1.0, we broke "garble test" as we no longer dealt with test packages well. Luckily, now that we can depend on TOOLEXEC_IMPORTPATH, we can support the test command again, as we can always figure out what package we're currently compiling, without having to track a "main" package. Note that one major pitfall there is test packages, where TOOLEXEC_IMPORTPATH does not agree with ImportPath from "go list -json". However, we can still work around that with a bit of glue code, which is also copiously documented. The second change necessary is to consider test packages private depending on whether their non-test package is private or not. This can be done via the ForTest field in "go list -json". The third change is to obfuscate "_testmain.go" files, which are the code-generated main functions which actually run tests. We used to not need to obfuscate them, since test function names are never obfuscated and we used to not obfuscate import paths at compilation time. Now we do rewrite import paths, so we must do that for "_testmain.go" too. The fourth change is to re-enable test.txt, and expand it with more sanity checks and edge cases. Finally, document "garble test" again. Fixes #241.	4 years ago
Daniel Martí	b71ff42877	testdata: remove some unnecessary execs (#267 ) Our use of go-internal's gotooltest.Setup means that "go" is set up as a top-level command in the scripts, so we can simply "go build" instead of having to "exec go build". The result is practically the same, but the scripts are simpler. While at it, I had left an "exec cat" behind; remove it.	4 years ago
Daniel Martí	4e9ee17ec8	refactor "current package" with TOOLEXEC_IMPORTPATH (#266 ) Now that we've dropped support for Go 1.15.x, we can finally rely on this environment variable for toolexec calls, present in Go 1.16. Before, we had hacky ways of trying to figure out the current package's import path, mostly from the -p flag. The biggest rough edge there was that, for main packages, that was simply the package name, and not its full import path. To work around that, we had a restriction on a single main package, so we could work around that issue. That restriction is now gone. The new code is simpler, especially because we can set curPkg in a single place for all toolexec transform funcs. Since we can always rely on curPkg not being nil now, we can also start reusing listedPackage.Private and avoid the majority of repeated calls to isPrivate. The function is cheap, but still not free. isPrivate itself can also get simpler. We no longer have to worry about the "main" edge case. Plus, the sanity check for invalid package paths is now unnecessary; we only got malformed paths from goobj2, and we now require exact matches with the ImportPath field from "go list -json". Another effect of clearing up the "main" edge case is that -debugdir now uses the right directory for main packages. We also start using consistent debugdir paths in the tests, for the sake of being easier to read and maintain. Finally, note that commandReverse did not need the extra call to "go list -toolexec", as the "shared" call stored in the cache is enough. We still call toolexecCmd to get said cache, which should probably be simplified in a future PR. While at it, replace the use of the "-std" compiler flag with the Standard field from "go list -json".	4 years ago
Daniel Martí	ff0bea73b5	all: drop support for Go 1.15.x (#265 ) This mainly cleans up the few bits of code where we explicitly kept support for Go 1.15.x. With v0.1.0 released, we can drop support now, since the next v0.2.0 release will only support Go 1.16.x. Also updates all modules, including test ones, to 'go 1.16'. Note that the TOOLEXEC_IMPORTPATH refactor is not done here, despite all the TODOs about doing so when we drop 1.15 support. This is because that refactor needs to be done carefully and might have side effects, so it's best to keep it to a separate commit. Finally, update the deps.	4 years ago
Daniel Martí	1267e2eced	fix link errors when importing crypto/ecdsa (#262 ) First, we had some link errors such as: cannot find package J6OzO8GN (using -importcfg) This was caused by the code that writes an updated importcfg, which did not handle import maps well. That code is now fixed, and we also add an obfuscatedImportPath method for clarity. Once fixed, we ran into other link errors: Pw3g97ww.addVW: relocation target Pw3g97ww.addVWlarge not defined After some digging, the cause of those is assembly code that we do not yet support obfuscating. #261 tracks that. Meanwhile, to fix "GOPRIVATE=* garble build" and to be able to have a test for the original import path bug, we add the packages which use that form of assembly code to runtimeRelated - math/big and crypto/sha512. There might be more, but these were the ones found by trying to link crypto/tls, a fairly common dependency. Fixes #256.	4 years ago
Daniel Martí	99887c13c2	ignore -ldflags=-X flags mentioning unknown packages That would panic, since the *listedPackage would be nil for a package path we aren't aware of: panic: runtime error: invalid memory address or nil pointer dereference [signal SIGSEGV: segmentation violation code=0x1 addr=0x88 pc=0x126b57d] goroutine 1 [running]: main.transformLink.func1(0x7ffeefbff28b, 0x5d) mvdan.cc/garble@v0.0.0-20210302140807-b03cd08c0946/main.go:1260 +0x17d main.flagValueIter(0xc0000a8e20, 0x2f, 0x2f, 0x12e278e, 0x2, 0xc000129e28) mvdan.cc/garble@v0.0.0-20210302140807-b03cd08c0946/main.go:1410 +0x1e9 main.transformLink(0xc0000a8e20, 0x30, 0x36, 0x4, 0xc000114648, 0x23, 0x12dfd60, 0x0) mvdan.cc/garble@v0.0.0-20210302140807-b03cd08c0946/main.go:1241 +0x1b9 main.mainErr(0xc0000a8e10, 0x31, 0x37, 0x37, 0x0) mvdan.cc/garble@v0.0.0-20210302140807-b03cd08c0946/main.go:287 +0x389 main.main1(0xc000096058) mvdan.cc/garble@v0.0.0-20210302140807-b03cd08c0946/main.go:150 +0xe7 main.main() mvdan.cc/garble@v0.0.0-20210302140807-b03cd08c0946/main.go:83 +0x25 The linker ignores such unknown references, so we should too. Fixes #259.	4 years ago
lu4p	a397a8e94e	Literals: Skip constants with inferred values. Obfuscating literals broke constants with values inferred via iota before, because it would be moved to a variable declaration instead.	4 years ago
Daniel Martí	c0c5a75454	testdata: fix flakiness of tiny.txt on Windows (#252 ) Sometimes it prints relatively low addresses, like: (0xbc200,0xdd658) We required 6 to 8 hex digits, and sometimes it printed 5. Make the regular expression more conservative. Fixes #184.	4 years ago
Daniel Martí	09e244986e	formally add support for Go 1.16 This was pretty much just fixing the README and closing the issue. The only other noteworthy user-facing change is that, if the Go version is detected to be too old, we now suggest 1.16.x instead of 1.15.x. While at it, refactor goversion.txt a bit. I wanted it to print a clearer "mocking the go build" error if another command was used like "go build", but I didn't want to learn BAT. So, instead use a simple Go program and build it, which will work on all platforms. The added "go build" step barely takes 100ms on my machine, given how simple the program is. The [short] line also doesn't seem necessary to me. The entire script runs in under 200ms for me, so it's well within the realm of "short", at least compared to many of the other test scripts. Fixes #124.	4 years ago
Andrew LeFevre	081cba38d7	strip a few more unneeded runtime functions Strip a few more traceback printing related functions in the runtime, new for Go 1.16. Also test that GODEBUG=inittrace=1 does not print anything when -tiny is passed.	4 years ago
Daniel Martí	2ee6604408	replace the -p path when assembling (#247 ) The asm tool runs twice for a package with assembly. The second time it does, the path given to the -p flag matters, just like in the compiler, as we generate an object file. We don't have a -buildid flag in the asm tool, so obtaining the action ID to obfuscate the package path with is a bit tricky. We store it from transformCompile, and read it from transformAsm. See the detailed docs for more. This was the last "skip" line in the tests due to Go 1.16. After all PRs are merged, one last PR documenting that 1.16 is supported will be sent, closing the issue for good. It's unclear why this wasn't an issue in Go 1.15. My best guess is that the ABI changes only happened in Go 1.16, and this causes exported asm funcs to start showing up in object files with their package paths. Updates #124.	4 years ago
Daniel Martí	5e3ba2fc09	update the list of runtime-related packages for 1.16 (#246 ) With a few extra lines, we can keep Go 1.15 support in the table too. Re-enables the goprivate.txt test for Go 1.16. While at it, make the script's use of grep a bit simpler with -E, which also uses the same syntax as Go's regexp. Its skip logic was also buggy, resulting in the macos results always being empty. Updates #124.	4 years ago
Daniel Martí	89dbdb69a1	start working on Go 1.16 support (#244 ) There are three minor bugs breaking Go 1.16 with the current version of garble, after the import path obfuscation refactor: 1) Stripping the runtime results in an unused import error. This PR fixes that. 2) The asm.txt test seems to be broken; something to do with the export data not being right for the exported assembly func. 3) The obfuscated build of std fails, since our runtimeRelated table was generated for Go 1.15, not 1.16. This PR fixes the first issue, adds conditional skip lines for 1.16 for the other two issues, and enables 1.16 on CI. Note that 1.16 support is not here just yet, because of the other two issues. As such, no doc changes. Updates #124.	4 years ago
Daniel Martí	180d64236a	properly fix obfuscated imports with their package names (#245 ) The TODO I left there didn't take long to surface as a bug. If the package path ends with a word containing a hyphen, that's not a valid identifier, so we end up with invalid Go syntax. Add that test case, as well as one where an import was already named. To fix the issue, we just need to use the package name we got from 'go list -json'. Fixes #243.	4 years ago
Daniel Martí	05d0dd1801	reimplement import path obfuscation without goobj2 (#242 ) We used to rely on a parallel implementation of an object file parser and writer to be able to obfuscate import paths. After compiling each package, we would parse the object file, replace the import paths, and write the updated object file in-place. That worked well, in most cases. Unfortunately, it had some flaws: * Complexity. Even when most of the code is maintained in a separate module, the import_obfuscation.go file was still close to a thousand lines of code. * Go compatibility. The object file format changes between Go releases, so we were supporting Go 1.15, but not 1.16. Fixing the object file package to work with 1.16 would probably break 1.15 support. * Bugs. For example, we recently had to add a workaround for #224, since import paths containing dots after the domain would end up escaped. Another example is #190, which seems to be caused by the object file parser or writer corrupting the compiled code and causing segfaults in some rare edge cases. Instead, let's drop that method entirely, and force the compiler and linker to do the work for us. The steps necessary when compiling a package to obfuscate are: 1) Replace its "package foo" lines with the obfuscated package path. No need to separate the package path and name, since the obfuscated path does not contain slashes. 2) Replace the "-p pkg/foo" flag with the obfuscated path. 3) Replace the "import" spec lines with the obfuscated package paths, for those dependencies which were obfuscated. 4) Replace the "-importcfg [...]" file with a version that uses the obfuscated paths instead. The linker also needs that last step, since it also uses an importcfg file to find object files. There are three noteworthy drawbacks to this new method: 1) Since we no longer write object files, we can't use them to store data to be cached. As such, the -debugdir flag goes back to using the "-a" build flag to always rebuild all packages. On the plus side, that caching didn't work very well; see #176. 2) The package name "main" remains in all declarations under it, not just "func main", since we can only rename entire packages. This seems fine, as it gives little information to the end user. 3) The -tiny mode no longer sets all lines to 0, since it did that by modifying object files. As a temporary measure, we instead set all top-level declarations to be on line 1. A TODO is added to hopefully improve this again in the near future. The upside is that we get rid of all the issues mentioned before. Plus, garble now nearly works with Go 1.16, with the exception of two very minor bugs that look fixable. A follow-up PR will take care of that and start testing on 1.16. Fixes #176. Fixes #190.	4 years ago
Daniel Martí	e2a32634a6	simplify, improve, and test line obfuscation (#239 ) First, remove the shuffling of the declarations list within each file. This is what we used at the very start to shuffle positions. Ever since we started obfuscating positions via //line comments, that has been entirely unnecessary. Second, add a proper test that will fail if we don't obfuscate line numbers well enough. Filenames were already decently covered by other tests. Third, simplify the line obfuscation code. It does not require astutil.Apply, and ranging over file.Decls is easier. Finally, also obfuscate the position of top-level vars, since we only used to do it for top-level funcs. Without that fix, the test would fail as varLines was unexpectedly sorted.	4 years ago
Daniel Martí	63c42c3cc7	support typechecking all of std (#236 ) There was one bug keeping the command below from working: GOPRIVATE='' garble build std The bug is rather obscure; I'm still working on a minimal reproducer that I can submit upstream, and I'm not yet convinced about where the bug lives and how it can be fixed. In short, the command would fail with: typecheck error: /go/src/crypto/ecdsa/ecdsa.go:122:12: cannot use asn1.SEQUENCE (constant 48 of type asn1.Tag) as asn1.Tag value in argument to b.AddASN1 Note that the error is ambiguous; there are two asn1 packages, but they are actually mismatching. We can see that by manually adding debug prints to go/types: constant: asn1.SEQUENCE (constant 48 of type golang.org/x/crypto/cryptobyte/asn1.Tag) argument type: vendor/golang.org/x/crypto/cryptobyte/asn1.Tag It's clear that, for some reason, go/types ends up confused and loading a vendored and non-vendored version of asn1. There also seems to be no way to work around this with our lookup function, as it just receives an import path as a parameter, and returns an object file reader. For now, work around the issue by not* using a custom lookup function in this rare edge case involving vendored dependencies in std packages. The added code has a lengthy comment explaining the reasoning. I still intend to investigate this further, but there's no reason to keep garble failing if we can work around the bug. Fixes #223.	4 years ago
Daniel Martí	2fa5697189	avoid panic on funcs that almost look like tests (#235 ) The added test case used to crash garble: panic: runtime error: invalid memory address or nil pointer dereference [recovered] panic: runtime error: invalid memory address or nil pointer dereference [signal SIGSEGV: segmentation violation code=0x1 addr=0x8 pc=0x8fe71b] goroutine 1 [running]: golang.org/x/tools/go/ast/astutil.Apply.func1(0xc0001e8880, 0xc000221570) /go/pkg/mod/golang.org/x/tools@v0.0.0-20210115202250-e0d201561e39/go/ast/astutil/rewrite.go:47 +0x97 panic(0x975bc0, 0xd6c610) /sdk/go1.15.8/src/runtime/panic.go:969 +0x1b9 go/types.(Named).Obj(...) /sdk/go1.15.8/src/go/types/type.go:473 mvdan.cc/garble.isTestSignature(0xc0001e7080, 0xa02e84) /src/garble/main.go:1170 +0x7b mvdan.cc/garble.(transformer).transformGo.func2(0xc000122df0, 0xaac301) /src/garble/main.go:1028 +0xff1 We were assuming that the first parameter was a named type, but that might not be the case. This crash was found out in the wild, from which a minimal repro was written. We add two variants of it to the test data, just in case.	4 years ago
Daniel Martí	e33179d480	reverse: support unexported names and package paths (#233 ) Unexported names are a bit tricky, since they are not listed in the export data file. Perhaps unsurprisingly, it's only meant to expose exported objects. One option would be to go back to adding an extra header to the export data file, containing the unexported methods in a map[string]T or []string. However, we have an easier route: just parse the Go files and look up the names directly. This does mean that we parse the Go files every time "reverse" runs, even if the build cache is warm, but that should not be an issue. Parsing Go files without any typechecking is very cheap compared to everything else we do. Plus, we save having to load go/types information from the build cache, or having to load extra headers from export files. It should be noted that the obfuscation process does need type information, mainly to be careful about which names can be obfuscated and how they should be obfuscated. Neither is a worry here; all names belong to a single package, and it doesn't matter if some aren't actually obfuscated, since the string replacements would simply never trigger in practice. The test includes an unexported func, to test the new feature. We also start reversing the obfuscation of import paths. Now, the test's reverse output is as follows: goroutine 1 [running]: runtime/debug.Stack(0x??, 0x??, 0x??) runtime/debug/stack.go:24 +0x?? test/main/lib.ExportedLibFunc(0x??, 0x??, 0x??, 0x??) p.go:6 +0x?? main.unexportedMainFunc(...) C.go:2 main.main() z.go:3 +0x?? The only major missing feature is positions and filenames. A follow-up PR will take care of those. Updates #5.	4 years ago
Daniel Martí	a499a6bcd7	testdata: simplify goversion test (#232 ) As per the TODO, just set up the files in the right places.	4 years ago
Daniel Martí	af517a20f8	make the handling of import paths more robust First, make isPrivate panic on malformed import paths, since that should never happen. This catches the errors that some users had run into with packages like gopkg.in/yaml.v2 and github.com/satori/go.uuid: panic: malformed import path "gopkg.in/garbletest%2ev2": invalid char '%' This seems to trigger when a module path contains a dot after the first element, and that module is fetched via the proxy. This results in the toolchain URL-encoding the second dot, and garble ends up seeing that encoded path. We reproduce this behavior with a fake gopkg.in module added to the test module proxy. Using yaml.v2 directly would have been easier, but it's pretty large. Note that we tried a replace directive, but that does not trigger the URL-encoding bug. Also note that we do not obfuscate the gopkg.in package; that's fine, as the isPrivate path validity check catches the bug either way. For now, make initImport use url.PathUnescape to work around this issue. The underlying bug is likely in either the goobj2 fork, or in the upstream Go toolchain itself. hashImport also gives a better error if it cannot find a package now, rather than just an "empty seed" panic. Finally, the sanity check in isPrivate unearthed the fact that we do not support garbling test packages at all, since they were invalid paths which never matched GOPRIVATE. Add an explicit check and TODO about that. Fixes #224. Fixes #228.	4 years ago
Daniel Martí	79c775e218	obfuscate unexported names like exported ones (#227 ) In `90fa325da7`, the obfuscation logic was changed to use hashes for exported names, but incremental names starting at just one letter for unexported names. Presumably, this was done for the sake of binary size. I argue that this is not a good idea for the default mode for a number of reasons: 1) It makes reversing of stack traces nearly impossible for unexported names, since replacing an obfuscated name "c" with "originalName" would trigger too many false positives by matching single characters. 2) Exported and unexported names aren't different. We need to know how names were obfuscated at a later time in both cases, thanks to use cases like -ldflags=-X. Using short names for one but not the other doesn't make a lot of sense, and makes the logic inconsistent. 3) Shaving off three bytes for unexported names doesn't seem like a huge deal for the default mode, when we already have -tiny to optimize for size. This saves us a bit of work, but most importantly, simplifies the obfuscation state as we no longer need to carry privateNameMap between the compile and link stages. name old time/op new time/op delta Build-8 153ms ± 2% 150ms ± 2% ~ (p=0.065 n=6+6) name old bin-B new bin-B delta Build-8 7.09M ± 0% 7.08M ± 0% -0.24% (p=0.002 n=6+6) name old sys-time/op new sys-time/op delta Build-8 296ms ± 5% 277ms ± 6% -6.50% (p=0.026 n=6+6) name old user-time/op new user-time/op delta Build-8 562ms ± 1% 558ms ± 3% ~ (p=0.329 n=5+6) Note that I do not oppose using short names for both exported and unexported names in the future for -tiny, since reversing of stack traces will by design not work there. The code can be resurrected from the git history if we want to improve -tiny that way in the future, as we'd need to store state in header files again. Another major cleanup we can do here is to no longer use the garbledImports map. From a look at obfuscateImports, we hash a package's import path with its action ID, much like exported names, so we can simply re-do that hashing for the linker's -X flag. garbledImports does have some logic to handle duplicate package names, but it's worth noting that should not affect package paths, as they are always unique. That area of code could probably do with some simplification in the future, too. While at it, make hashWith panic if either parameter is empty. obfuscateImports was hashing the main package path without a salt due to a bug, so we want to catch those in the future. Finally, make some tiny spacing and typo tweaks to the README.	4 years ago
Daniel Martí	d8e8738216	initial support for reversing panic output (#225 ) For now, this only implements reversing of exported names which are hashed with action IDs. Many other kinds of obfuscation, like positions and private names, are not yet implemented. Note that we don't document this new command yet on purpose, since it's not finished. Some other minor cleanups were done for future changes, such as making transformLineInfo into a method that also receives the original filename, and making header names more self-describing. Updates #5.	4 years ago
Daniel Martí	39d60f91e6	include a "garble version" test (#221 ) Not sure why the last commit came with none. While at it, I noticed that the version command would ignore arguments. Error instead.	4 years ago
Daniel Martí	1db6e1e230	make -coverprofile include toolexec processes (#216 ) testscript already included magic to also account for commands in the total code coverage. That does not happen with plain tests, since those only include coverage from the main test process. The main problem was that, before, indirectly executed commands did not properly save their coverage profile anywhere for testscript to collect it at the end. In other words, we only collected coverage from direct garble executions like "garble -help", but not indirect ones like "go build -toolexec=garble". $ go test -coverprofile=cover.out PASS coverage: 3.6% of statements total coverage: 16.6% of statements ok mvdan.cc/garble 6.453s After the delicate changes to testscript, any direct or indirect executions of commands all go through $PATH and properly count towards the total coverage: $ go test -coverprofile=cover.out PASS coverage: 3.6% of statements total coverage: 90.5% of statements ok mvdan.cc/garble 33.258s Note that we can also get rid of our code to set up $PATH, since testscript now does it for us. goversion.txt needed minor tweaks, since we no longer set up $WORK/.bin. Finally, note that we disable the reuse of $GOCACHE when collecting coverage information. This is to do "full builds", as otherwise the cached package builds would result in lower coverage. Fixes #35.	4 years ago
Andrew LeFevre	2d720ae155	fixed some bugs related to additional linkname corner cases (#210 )	4 years ago
Daniel Martí	835f4aadf3	testdata: add test case for init funcs in many files (#208 ) We tested that init funcs within a single file retain their order, but not when they are split between many files. Add one such case, with ten files and the same global-var-append mechanism. While at it, add some docs and make each init func take just one line.	4 years ago
Daniel Martí	ba19a1d49c	do not try to obfuscate huge literals (#204 ) It's common for asset bundling code generators to produce huge literals, for example in strings. Our literal obfuscators are meant for relatively small string-like literals that a human would write, such as URLs, file paths, and English text. I ran some quick experiments, and it seems like "garble build -literals" appears to hang trying to obfuscate literals starting at 5-20KiB. It's not really hung; it's just doing a lot of busy work obfuscating those literals. The code it produces is also far from ideal, so it also takes some time to finally compile. The generated code also led to crashes. For example, using "garble build -literals -tiny" on a package containing literals of over a megabyte, our use of asthelper to remove comments and shuffle line numbers could run out of stack memory. This all points in one direction: we never designed "-literals" to deal with large sizes. Set a source-code-size limit of 2KiB. We alter the literals.txt test as well, to include a few 128KiB string literals. Before this fix, "go test" would seemingly hang on that test for over a minute (I did not wait any longer). With the fix, those large literals are not obfuscated, so the test ends in its usual 1-3s. As said in the const comment, I don't believe any of this is a big problem. Come Go 1.16, most developers should stop using asset-bundling code generators and use go:embed instead. If we wanted to somehow obfuscate those, it would be an entirely separate feature. And, if someone wants to work on obfuscating truly large literals for any reason, we need good tests and benchmarks to ensure garble does not consume CPU for minutes or run out of memory. I also simplified the generate-literals test command. The only argument that matters to the script is the filename, since it's used later on. Fixes #178.	4 years ago
Daniel Martí	249501b5e9	fix garbling names belonging to indirect imports (#203 ) main.go includes a lengthy comment that documents this edge case, why it happened, and how we are fixing it. To summarize, we should no longer error with a build error in those cases. Read the comment for details. A few other minor changes were done to allow writing this patch. First, the actionID and contentID funcs were renamed, since they started to collide with variable names. Second, the logging has been improved a bit, which allowed me to debug the issue. Third, the "cache" global shared by all garble sub-processes now includes the necessary parameters to run "go list -toolexec", including the path to garble and the build flags being used. Thanks to lu4p for writing a test case, which also applied gofmt to that testdata Go file. Fixes #180. Closes #181, since it includes its test case.	4 years ago
Daniel Martí	c0731921c2	rewrite go:linkname directives with garbled names (#200 ) If code includes a linkname directive pointing at a name in an imported package, like: //go:linkname localName importedpackage.RemoteName func localName() We should rewrite the comment to replace "RemoteName" with its obfuscated counterpart, if the package in question was obfuscated and that name was as well. We already had some code to handle linkname directives, but only to ensure that "localName" was never obfuscated. This behavior is kept, to ensure that the directive applies to the right name. In the future, we could instead rewrite "localName" in the directive, like we do with "RemoteName". Add plenty of tests, too. The linkname directive used to be tested in imports.txt and syntax.txt, but that was hard to maintain as each file tested different edge cases. Now that we have build caching, adding one extra testscript file isn't a big problem anymoree. Add linkname.txt, which is self-explanatory. The other two scripts also get a bit less complex. Fixes #197.	4 years ago
Daniel Martí	4f65e6f99c	testdata: avoid messing with the host's mod cache (#198 ) It turns out that the modules we include in testdata/mod via txtar-addmod don't result in the same h1 hash that one gets when using proxy.golang.org. As proof: $ go clean -modcache && go get -d rsc.io/quote@v1.5.2 && go test -short [...] --- FAIL: TestScripts/imports (0.06s) > garble build -tags buildtag go list error: exit status 1: verifying rsc.io/quote@v1.5.2: checksum mismatch downloaded: h1:w5fcysjrx7yqtD/aO+QwRjYZOKnaM9Uh2b40tElTs3Y= go.sum: h1:3fEykkD9k7lYzXqCYrwGAf7iNhbk4yCjHmKBN9td4L0= The added comment explains the situation in detail. For now, simply work around the issue by not sharing GOMODCACHE with the host.	4 years ago
Daniel Martí	c9deff810b	obfuscate fewer std packages (#196 ) Previously, we were never obfuscating runtime and its direct dependencies. Unfortunately, due to linkname, the runtime package is actually closely related to dozens of other std packages as well. Until we can obfuscate the runtime and properly support go:linkname directives, obfuscating fewer std packages is a better outcome than breaking and not producing any obfuscated code at all. The added test case is building runtime/pprof, which used to cause failures: # runtime/pprof /go/src/runtime/pprof/label.go:27:21: undefined: context.Context /go/src/runtime/pprof/label.go:59:21: undefined: context.Context /go/src/runtime/pprof/label.go:93:16: undefined: context.Context /go/src/runtime/pprof/label.go:101:20: undefined: context.Context The net package was also very close to obfuscating properly thanks to this change, so its test is now run as well. The only other remaining fix was to not obfuscate fields on cgo types, since those aren't obfuscated at the moment. The map is pretty long, but it's only a temporary solution and the command to obtain the list again is included. Never obfuscating the entire std library is also an option, but it's a bit unnecessary. Fixes #134.	4 years ago

1 2 3 4

162 Commits (05d35350cf3bd40b3ca4c8266843d98a0cab823b)