garble

Commit Graph

Author	SHA1	Message	Date
Daniel Martí	af517a20f8	make the handling of import paths more robust First, make isPrivate panic on malformed import paths, since that should never happen. This catches the errors that some users had run into with packages like gopkg.in/yaml.v2 and github.com/satori/go.uuid: panic: malformed import path "gopkg.in/garbletest%2ev2": invalid char '%' This seems to trigger when a module path contains a dot after the first element, and that module is fetched via the proxy. This results in the toolchain URL-encoding the second dot, and garble ends up seeing that encoded path. We reproduce this behavior with a fake gopkg.in module added to the test module proxy. Using yaml.v2 directly would have been easier, but it's pretty large. Note that we tried a replace directive, but that does not trigger the URL-encoding bug. Also note that we do not obfuscate the gopkg.in package; that's fine, as the isPrivate path validity check catches the bug either way. For now, make initImport use url.PathUnescape to work around this issue. The underlying bug is likely in either the goobj2 fork, or in the upstream Go toolchain itself. hashImport also gives a better error if it cannot find a package now, rather than just an "empty seed" panic. Finally, the sanity check in isPrivate unearthed the fact that we do not support garbling test packages at all, since they were invalid paths which never matched GOPRIVATE. Add an explicit check and TODO about that. Fixes #224. Fixes #228.	3 years ago
Daniel Martí	79c775e218	obfuscate unexported names like exported ones (#227 ) In `90fa325da7`, the obfuscation logic was changed to use hashes for exported names, but incremental names starting at just one letter for unexported names. Presumably, this was done for the sake of binary size. I argue that this is not a good idea for the default mode for a number of reasons: 1) It makes reversing of stack traces nearly impossible for unexported names, since replacing an obfuscated name "c" with "originalName" would trigger too many false positives by matching single characters. 2) Exported and unexported names aren't different. We need to know how names were obfuscated at a later time in both cases, thanks to use cases like -ldflags=-X. Using short names for one but not the other doesn't make a lot of sense, and makes the logic inconsistent. 3) Shaving off three bytes for unexported names doesn't seem like a huge deal for the default mode, when we already have -tiny to optimize for size. This saves us a bit of work, but most importantly, simplifies the obfuscation state as we no longer need to carry privateNameMap between the compile and link stages. name old time/op new time/op delta Build-8 153ms ± 2% 150ms ± 2% ~ (p=0.065 n=6+6) name old bin-B new bin-B delta Build-8 7.09M ± 0% 7.08M ± 0% -0.24% (p=0.002 n=6+6) name old sys-time/op new sys-time/op delta Build-8 296ms ± 5% 277ms ± 6% -6.50% (p=0.026 n=6+6) name old user-time/op new user-time/op delta Build-8 562ms ± 1% 558ms ± 3% ~ (p=0.329 n=5+6) Note that I do not oppose using short names for both exported and unexported names in the future for -tiny, since reversing of stack traces will by design not work there. The code can be resurrected from the git history if we want to improve -tiny that way in the future, as we'd need to store state in header files again. Another major cleanup we can do here is to no longer use the garbledImports map. From a look at obfuscateImports, we hash a package's import path with its action ID, much like exported names, so we can simply re-do that hashing for the linker's -X flag. garbledImports does have some logic to handle duplicate package names, but it's worth noting that should not affect package paths, as they are always unique. That area of code could probably do with some simplification in the future, too. While at it, make hashWith panic if either parameter is empty. obfuscateImports was hashing the main package path without a salt due to a bug, so we want to catch those in the future. Finally, make some tiny spacing and typo tweaks to the README.	3 years ago
Daniel Martí	d8e8738216	initial support for reversing panic output (#225 ) For now, this only implements reversing of exported names which are hashed with action IDs. Many other kinds of obfuscation, like positions and private names, are not yet implemented. Note that we don't document this new command yet on purpose, since it's not finished. Some other minor cleanups were done for future changes, such as making transformLineInfo into a method that also receives the original filename, and making header names more self-describing. Updates #5.	3 years ago
Daniel Martí	78b69bbdab	share a single temporary directory between all processes Each compile and link sub-process created its own temporary directory, to be cleaned up shortly after. Moreover, we also had the global gob-encoded temporary file. Instead, place all of those under a single, start-to-end temporary directory. This is cleaner for the end user, and easier to maintain for us. A big plus is that we can also get rid of the confusing deferred global, as it was mostly used to clean up these extra temp dirs. The only remaining use was post-compile code, which is now an explicit func returned by each "transform" func. While at it, clean up the math/rand seeding code a bit and add a debug log line, and stop shadowing a cmd string with a cmd *exec.Cmd. Fixes #147.	3 years ago
Daniel Martí	f667a7ad31	all: use better names than "blacklist", and docs (#206 ) The three transformer map fields are now very well documented, which was badly needed for anyone trying to understand the source code. ignoreObjects is also a better field name than blacklist, as it says what the map is indexed by (types.Object) and what we do with those: ignore them when we obfuscate code. The rewriting of go:linkname directives is moved to a separate func, so that we can name that func from the docs. Finally, the docs are overall improved a bit, as I was re-tracing all the pieces of code that used the ambiguous "blacklist" terminology. Fixes #169.	4 years ago
lu4p	2e2bd09b5e	Simplify maps to boolean value	4 years ago
Andrew LeFevre	6857aa1426	fix 2 small bugs in import obfuscation (#195 ) The first bug fixed is not garbling package names that don't contain any code. For example, given the import path "github.com/foo/bar", "github.com" was treated as a package name that should be garbled, which doesn't make sense. The other bug was incorrectly matching private package names inside import paths. Before, if "internal" was an imported package matched by GOPRIVATE, but "internal/foo" was not, any instance of "internal/foo" would still be garbled as "<garbled>/foo", which is incorrect.	4 years ago
lu4p	cf290b8e6d	Share data between processes via a shared file. (#192 ) Previously garble heavily used env vars to share data between processes. This also makes it easy to share complex data between processes. The complexity of main.go is considerably reduced.	4 years ago
Daniel Martí	dfa622fe50	simplify globals, split hash.go (#191 ) The previous globals worked, but were unnecessarily complex. For example, we passed the fromPath variable around, but it's really a static global, since we only compile or link a single package in each Go process. Use such global variables instead of passing them around, which currently include the package's import path, its build ID, and its import config path. Also split all the hashing and build ID code into hash.go, since that's a relatively well contained 200 lines of code that doesn't need to make main.go any bigger. We also split the code to alter Go's own version to a separate function, so that it can be moved out of main.go as well.	4 years ago
Andrew LeFevre	8c03afee95	Use latest Binject/debug version to support importmap directives, fixes #146 (#189 ) * Use latest Binject/debug version to support importmap directives in the importcfg file * Uncomment line in goprivate testscript to test ImportMap * Fixed issue where a package in specified in importmap would be hashed differently in a package that imported it, due to the mapping of import paths. Also commented out the 'net' import in the goprivate testscript (again) due to cgo compile errors	4 years ago
Andrew LeFevre	523cc4454f	Hash identical package names from different package paths differently (#172 )	4 years ago
Andrew LeFevre	047aa254e2	properly remove all filenames when -tiny is passed (#160 ) * properly remove all filenames when -tiny is passed * document filename symbol removal	4 years ago
pagran	803c1d9439	Store obfuscated sources in object files (#158 ) Now the flag "-debugdir" does not trigger a full recompilation. Obfuscated source files are saved to object files and are extracted during linking.	4 years ago
Daniel Martí	2a0ac434fb	initial support for build caching (#142 ) As per the discussion in https://github.com/golang/go/issues/41145, it turns out that we don't need special support for build caching in -toolexec. We can simply modify the behavior of "[...]/compile -V=full" and "[...]/link -V=full" so that they include garble's own version and options in the printed build ID. The part of the build ID that matters is the last, since it's the "content ID" which is used to work out whether there is a need to redo the action (build) or not. Since cmd/go parses the last word in the output as "buildID=...", we simply add "+garble buildID=_/_/_/${hash}". The slashes let us imitate a full binary build ID, but we assume that the other components such as the action ID are not necessary, since the only reader here is cmd/go and it only consumes the content ID. The reported content ID includes the tool's original content ID, garble's own content ID from the built binary, and the garble options which modify how we obfuscate code. If any of the three changes, we should use a different build cache key. GOPRIVATE also affects caching, since a different GOPRIVATE value means that we might have to garble a different set of packages. Include tests, which mainly check that 'garble build -v' prints package lines when we expect to always need to rebuild packages, and that it prints nothing when we should be reusing the build cache even when the built binary is missing. After this change, 'go test' on Go 1.15.2 stabilizes at about 8s on my machine, whereas it used to be at around 25s before.	4 years ago
Daniel Martí	859221a950	make import path obfuscation work with the build cache What obfuscateImports did was valid, but unfortunately made the build cache redo work. This is because we were modifying object files in-place in the build cache, meaning that the Go tool would think it had to re-compile those packages. Instead, write the modified object files in a temporary directory, and leave the input object files untouched. We require a bit of extra code to keep track of this and adjust the link argument as well as its importcfg file. The function of obfuscateImports, as well as the reasoning above, is now summarized in its godoc as well. This should be the last change in preparation for proper build caching support. Rebasing the build caching branch on this commit finally makes caching work reliably every single time.	4 years ago
pagran	406036d433	rewrite private name map storage to support build caching We now store how we obfuscated unexported names in the object file itself, not a separate file. This means that the data can survive in the build cache, whereas the separate file was being lost. Luckily, we can just add an extra header to the archive, and other programs like the Go linker will just ignore it.	4 years ago
pagran	90fa325da7	Rewrite renaming logic for private names and reduce length of public names (#135 ) 1. Now private names are obfuscated based on the counter in scope of the package. 2. The length of public names is reduced to 4 bytes.	4 years ago
Andrew LeFevre	d679944408	Strip all filename and position info when -tiny is passed (#128 ) Co-authored-by: pagran <pagran@protonmail.com> Co-authored-by: lu4p <lu4p@pm.me>	4 years ago
Daniel Martí	805c895d59	set up an AUTHORS file to attribute copyright Many files were missing copyright, so also add a short script to add the missing lines with the current year, and run it. The AUTHORS file is also self-explanatory. Contributors can add themselves there, or we can simply update it from time to time via git-shortlog. Since we have two scripts now, set up a directory for them.	4 years ago
Daniel Martí	f764467e9b	all: update the docs a bit Rework the features section in the README, leaving optional features at the end of the list. Simplify the caveats list, too; the build cache and exported field/method bits only need one point each. Overall, the section was far too wordy for little reason. Also redo the help text a bit. There's now a line to briefly introduce the tool, as well as a link to the README with all the details. Finally, the flags have shorter and more consistent help strings. While at it, remove two unused global vars as spotted by staticcheck.	4 years ago
Andrew LeFevre	c8d61c772f	Garble imports and package paths in GOPRIVATE (#116 ) Finally, finally this is done. This allows import paths to be obfuscated by modifying object/archive files and garbling import paths contained within. The bulk of the code that makes parsing and writing Go object/archive files possible lives at https://github.com/Binject/debug/tree/master/goobj2, which I wrote as well. I have tested by garbling and checking for import paths via strings and grep (in order of difficulty) https://github.com/lu4p/binclude, garble itself, and https://github.com/dominikh/go-tools/tree/master/cmd/staticcheck. This only supports object/archive files produced from the Go 1.15 compiler. The object file format changed at 1.15, and 1.14 and earlier is not supported. Fixes #13.	4 years ago

21 Commits (e33179d48056660898d70841538e9898bfd992c1)