Commit Graph

736 Commits (master)
 

Author SHA1 Message Date
Daniel Martí 8ee4c91196 make gotoolchain.txtar upgrade to the host's GOVERSION
On CI we test on go1.23.x and go1.24.x, so if we always upgrade
to the latest go1.24.x, that will cause garble to complain
when running on go1.23.x:

    garble was built with "go1.23.7" and can't be used with the newer "go1.24.1"

Moreover, the test hard-coded go1.24.1, which is currently the latest
go1.24.x but will not be for long, so this test was brittle.
3 days ago
Daniel Martí db3003b9fa use the correct toolchain "go" tool under GOTOOLCHAIN=auto
We call `go list` to collect information about all the packages
to obfuscate and build, which is crucial to be able to perform
obfuscation of names used across packages.

However, when GOTOOLCHAIN causes a toolchain upgrade,
we must ensure that we use the upgraded Go tool;
otherwise we are mixing information from different toolchain versions.

Fixes #934.
3 days ago
Daniel Martí 7f80dfb59d rebuild cmd/link with the correct toolchain under GOTOOLCHAIN=auto
When we build the patched cmd/link binary for use by garble,
we perform this build in a temporary directory so that the Go module
from the user does not get in the way.

When the user module made us upgrade the toolchain per GOTOOLCHAIN,
leaving that module's directory stops upgrading the toolchain,
so we patch a newer toolchain and build it with an older toolchain.
This is largely harmless, but it makes the newer toolchain think
it is actually an older toolchain, which leads to those pesky
"linker object header mismatch" version errors.

Updates #934.
3 days ago
Daniel Martí 6f7af3b785 add test reproducing gotoolchain upgrade errors
While here, make link.version more readable by adding a newline
and document the assumption it makes about GOVERSION.

For #934.
3 days ago
Daniel Martí ad47efbe72 actually collect intrinsics from Go 1.24
The diffstat for go_std_tables.go shows that we were missing
more than two dozen new intrinsic functions from Go 1.24,
which could lead to the intrinsification done by the toolchain
to no longer work and leave programs with slower generic functions.
5 days ago
Daniel Martí ff989a15b8 regenerate tables with the latest Go versions
These will typically not cause changes, but just in case.
4 weeks ago
Daniel Martí c23dd1672b use LC_ALL=C rather than LANG=en_US
LC_ALL overrides all settings rather than setting a default.
And C is the English locale that should be used for scripting.
4 weeks ago
Daniel Martí 96a954792d update dependencies 4 weeks ago
Daniel Martí cb83c50b13 all: run gopls's modernize -fix
Except on reflect_abi_code.go, as that needs to be compatible
with older versions of Go given that we inject its code.
1 month ago
Daniel Martí fa2e718bd1 start using go/ast.Preorder
Thanks to being able to use range-over-func, some control flow
in our code gets simplified.
1 month ago
Daniel Martí 275737aabd start using go/types.Func.Signature
Guaranteed to return a *types.Signature, so no need to type assert.
1 month ago
Daniel Martí 2e9cd84bde CHANGELOG: prepare for a bugfix release
A minor version bump seems unnecessary when we are just adding support
for a new Go version, and that support was very easy to do.
2 months ago
Daniel Martí 3936ebfe5d update to Go 1.24.0 and test with it on CI as well 2 months ago
Daniel Martí 2adfc43326 bump unsupportedGo to mark Go 1.24 as supported
debugdir.txtar also needed tweaking as runtime/map.go is gone
starting in Go 1.24.

Finally, modinfo.txtar needed tweaking since Go 1.24 started stamping
Go binaries with VCS-derived module versions, so we no longer end up
with empty "(devel)" versions.
2 months ago
Daniel Martí e8392a640a update go_std_tables.go with go1.24rc3 2 months ago
Daniel Martí 28ccfa094b internal/linker: add Go patches rebased on go1.24rc3
See https://github.com/burrowers/go-patches/pull/8.
2 months ago
Daniel Martí 561158dca9
CHANGELOG: v0.14.0 is happening today 2 months ago
Daniel Martí f90cc05f6d CHANGELOG: write release notes for the upcoming release 2 months ago
Paul Scheduikat 97833204f8 skip all type parameters in recordType
We only did this for Container in the type switch, but not for Struct.
The added test case panics otherwise.
Just like in the previous case, we still don't need to recurse
into type parameters for fieldToStruct to be filled correctly.

Fixes #899
2 months ago
Daniel Martí e6c0aeffe1 README: we require Go 1.23.5 now 2 months ago
Daniel Martí f5dc4e784a simplify reflectInspector method signatures
A parent paramter was unused, and a cache parameter could be reached
via the receiver.
2 months ago
Daniel Martí be2921064a temporarily disable staticcheck
It's not crucial for CI to work, and go1.24.0 is around the corner.
3 months ago
Daniel Martí e0dbea2b3d hash structs via the bundled and altered typeutil.hash
As spotted by the protobuf package via check-third-party.sh,
the two structs below are identical:

    type alias1 = int64
    type Struct1 struct { Alias alias1 }

    type alias2 = int64
    type Struct2 struct { Alias alias2 }

Our previous approach with stripStructTags dealt with struct tags,
but it did not deal with aliases, which are now present in go/types
thanks to the new alias tracking.

The new approach properly ignores struct tags and unaliases any
type aliases, resulting in correct hashes of any type.
3 months ago
Daniel Martí 8ca3d0adcf bundle x/tools/go/types/typeutil.hash and modify it
See the two "NOTE" comments below. In summary:

* struct field tags are not hashed
* named types are hashed by name rather than by pointer

We are keeping an eye on https://go.dev/issue/69420 as well.
3 months ago
Daniel Martí 41624da69b update golang.org/x/tools 3 months ago
Daniel Martí 4787f9bb3d internal/linker: use go/version to get the Go major version 3 months ago
Daniel Martí 97ae350b0e apply minor cleanups suggested by tools
GARBLE_PARENT_WORK hasn't been used for a while.
A couple of other minor simplifications.
3 months ago
Daniel Martí 83a06019be rely on go/types alias tracking
Allows us to remove EmbeddedAliasFields, recordedObjectString,
and all the logic around using them.

Resolves an issue where a user was running into a panic in
our logic to record embedded aliases.

Note that this means we require Go 1.23.5 or later now,
which also meant some changes to goversion.txtar to keep it green.

Fixes #827.
3 months ago
Daniel Martí deff1cb44d fix running BenchmarkBuild
It was accidentally broken during the last testscript refactor,
where we removed an os.Exit call around the main call there.
3 months ago
Daniel Martí 3c742dac65 centralize logic dealing with main package import paths
obfuscatedImportPath already handled ToObfuscate, so the callers
don't have to do that as well. Handle main packages too, whose logic
was sprinkled and repeated throughout the project.
3 months ago
Daniel Martí 066771481b obfuscate package names separately from import paths
Reflection can show package names alone via reflect.Type.String,
and I'm sure they are available in other ways.
We were obfuscating "package p" exactly like the import path "foo.com/p"
which worked OK for the most part, but not for reflection.

This is also a slight improvement to the quality of the obfuscation,
given that package names and import paths will no longer be aligned
when obfuscated.
3 months ago
Daniel Martí 0b69dcd472 use lpkg directly in reflectInspector
There's no need to reach for sharedCache.ListedPackages
when the caller is computePkgCache, which already has the lpkg value.

While here, compare *go/types.Package pointers directly
rather than via their import path strings.
3 months ago
Daniel Martí abdec5abda assume obj is non-nil in obfuscatedObjectName
Only two callers did pass nil, and there's no reason for them to do so.
They should be the ones to check that typeToObj did not return nil.
3 months ago
Daniel Martí 96356e9015 rename reflectedObjectString for clarity
This function returns obfuscated names, so use that as its name.
Moreover, some of the callers still called the result "objStr",
which misled me into thinking the string was a unique object path.

Leave a TODO behind about using go/types/objectpath too.
Our own recordedObjectString is sort of similar, but not as principled.
3 months ago
Daniel Martí 76905ba3bc use the latest testscript to drop func() int
And don't fail when requesting help, which is what Go's flag package
has been moving towards.
3 months ago
Jamison Lahman 7678613fd0
only parse stdout from "go env"
I use a wrapper around go that prints some debug to stderr
which causes the unmarshalling below to fail.
4 months ago
Daniel Martí 6ac80db02c simplify the use of toolexecCmd in reverse
The list flags are entirely unused, so they can be omitted.
The only argument that matters is the package argument to load.
While here, update the TODO, as it no longer applies.
4 months ago
Daniel Martí 4e71f1aee3 update go_std_tables.go with go1.23.3
No changes to the values. However, to avoid noise with every bugfix
release changing all the lines, only use go1.X rather than go1.X.Y
to annotate each of the entries in maps and lists.
4 months ago
Daniel Martí 30d1d8cbb7 go back to sorting _originalNamePairs lexicographically
Now that we only use the list to create a replacer at init time,
we no longer need to spend extra effort sorting by length first.
The benchmark shows no measurable difference in performance.
4 months ago
Daniel Martí ef76bf1c50 use a strings.Replacer to reverse names in internal/abi
This way, rather than using a double loop quadratic algorithm
to search for each name to replace in a string,
we can make use of the reasonably efficient generic replacer
which makes use of tries.

Copying some code from the strings package is not ideal,
but it beats having to re-implement such an algorithm ourselves.

Not only is the algorithm much faster, as we are no longer quadratic,
but the replacer also appends into a buffer to avoid repeated string
copies, which means we allocate fewer bytes per operation.

                       │      old       │                new                 │
                       │     sec/op     │   sec/op     vs base               │
    AbiOriginalNames-8   135708.0n ± 0%   391.1n ± 5%  -99.71% (p=0.001 n=7)

                       │     old      │                   new                    │
                       │     B/s      │      B/s        vs base                  │
    AbiOriginalNames-8   2.565Mi ± 0%   890.112Mi ± 4%  +34597.03% (p=0.001 n=7)

                       │     old     │                new                │
                       │    B/op     │    B/op     vs base               │
    AbiOriginalNames-8   5464.0 ± 0%   848.0 ± 0%  -84.48% (p=0.001 n=7)

                       │    old     │                new                │
                       │ allocs/op  │ allocs/op   vs base               │
    AbiOriginalNames-8   18.00 ± 0%   16.00 ± 0%  -11.11% (p=0.001 n=7)
4 months ago
Daniel Martí 8f71a501a9 remove now-redundant len shortcut in _originalNames
As Paul Scheduikat points out, the loop already does not start
if the length of name is less than minHashLength.

                       │     old     │             new              │
                       │   sec/op    │   sec/op     vs base         │
    AbiOriginalNames-8   135.6µ ± 1%   135.5µ ± 0%  ~ (p=1.000 n=7)
4 months ago
Daniel Martí 49cfcfbf95 clarify internal/abi name code a bit
Use the term "original name" rather than "real name" for the code
as it is clearer that we mean the unobfuscated name.

Update the comments at the top of the file to be clearer with the
explanation of what kinds of inputs we can expect.

While here, use fmt.Appendf to simplify the generation code a bit.
4 months ago
Daniel Martí c83c5ce3e6 sort _realNamePairs from shortest to longest obfuscated name
This lets us only check names up to the remaining input string length.

                  │     old     │                new                 │
                  │   sec/op    │   sec/op     vs base               │
    AbiRealName-8   196.7µ ± 1%   172.3µ ± 1%  -12.41% (p=0.001 n=7)

                  │     old      │                 new                 │
                  │     B/s      │     B/s       vs base               │
    AbiRealName-8   1.774Mi ± 1%   2.022Mi ± 0%  +13.98% (p=0.001 n=7)

                  │     old      │                new                 │
                  │     B/op     │     B/op      vs base              │
    AbiRealName-8   5.359Ki ± 0%   5.336Ki ± 0%  -0.44% (p=0.001 n=7)

                  │    old     │               new                │
                  │ allocs/op  │ allocs/op   vs base              │
    AbiRealName-8   19.00 ± 0%   18.00 ± 0%  -5.26% (p=0.001 n=7)
4 months ago
Daniel Martí c012f08c66 store name pairs for _realName as a slice of pairs
Iterating over a map is much more expensive than iterating over a slice,
given how it needs to work out which keys are present in each bucket
and then randomize the order in which to navigate the keys.
None of this work needs to happen when iterating over a slice.

A map would be nice if we were to actually do map lookups, but we don't.

                  │     old     │                new                 │
                  │   sec/op    │   sec/op     vs base               │
    AbiRealName-8   707.1µ ± 1%   196.7µ ± 1%  -72.17% (p=0.001 n=7)

                  │     old      │                  new                  │
                  │     B/s      │      B/s       vs base                │
    AbiRealName-8   517.6Ki ± 2%   1816.4Ki ± 1%  +250.94% (p=0.001 n=7)

                  │     old      │                new                 │
                  │     B/op     │     B/op      vs base              │
    AbiRealName-8   5.362Ki ± 0%   5.359Ki ± 0%  -0.05% (p=0.001 n=7)

                  │    old     │              new              │
                  │ allocs/op  │ allocs/op   vs base           │
    AbiRealName-8   19.00 ± 0%   19.00 ± 0%  ~ (p=1.000 n=7) ¹
4 months ago
Daniel Martí b14bf2019d optimize _realName a bit via shortcuts
│     old      │                new                 │
                  │    sec/op    │   sec/op     vs base               │
    AbiRealName-8   1025.9µ ± 6%   707.1µ ± 1%  -31.08% (p=0.001 n=7)

                  │     old      │                 new                 │
                  │     B/s      │     B/s       vs base               │
    AbiRealName-8   351.6Ki ± 6%   517.6Ki ± 2%  +47.22% (p=0.001 n=7)

                  │     old      │              new              │
                  │     B/op     │     B/op      vs base         │
    AbiRealName-8   5.363Ki ± 0%   5.362Ki ± 0%  ~ (p=0.178 n=7)

                  │    old     │              new              │
                  │ allocs/op  │ allocs/op   vs base           │
    AbiRealName-8   19.00 ± 0%   19.00 ± 0%  ~ (p=1.000 n=7) ¹
4 months ago
Daniel Martí 210b19ac59 add benchmark for the injected _realName abi code
│     new     │
                  │   sec/op    │
    AbiRealName-8   1.026m ± 6%

                  │     new      │
                  │     B/s      │
    AbiRealName-8   351.6Ki ± 6%

                  │     new      │
                  │     B/op     │
    AbiRealName-8   5.363Ki ± 0%

                  │    new     │
                  │ allocs/op  │
    AbiRealName-8   19.00 ± 0%
4 months ago
Daniel Martí 78a6a42ecf split code to inject into internal/abi into a separate file
This way we can edit it as regular Go code, with the help of an LSP,
and we can also test and benchmark it.
4 months ago
Paul Scheduikat 926f3de60d obfuscate all names used in reflection
Go code can retrieve and use field and method names via the `reflect` package.
For that reason, historically we did not obfuscate names of fields and methods
underneath types that we detected as used for reflection, via e.g. `reflect.TypeOf`.

However, that caused a number of issues. Since we obfuscate and build one package
at a time, we could only detect when types were used for reflection in their own package
or in upstream packages. Use of reflection in downstream packages would be detected
too late, causing one package to obfuscate the names and the other not to, leading to a build failure.

A different approach is implemented here. All names are obfuscated now, but we collect
those types used for reflection, and at the end of a build in `package main`,
we inject a function into the runtime's `internal/abi` package to reverse the obfuscation
for those names which can be used for reflection.

This does mean that the obfuscation for these names is very weak, as the binary
contains a one-to-one mapping to their original names, but they cannot be obfuscated
without breaking too many Go packages out in the wild. There is also some amount
of overhead in `internal/abi` due to this, but we aim to make the overhead insignificant.

Fixes #884, #799, #817, #881, #858, #843, #842

Closes #406
4 months ago
Daniel Martí 515358b18d remove two unused fields from sharedCache
ExecPath is only used within toolexecCmd, so make it a local variable.

GOMOD hasn't been used since we dropped the use of GOPRIVATE.
4 months ago
Daniel Martí b38f42da0f stop using listedPackage.IgnoredGoFiles
We used it to detect GOOS-specific packages and ignore their load errors
without having to do a substring search.
However, it turns out that repeatedly loading the string slice
from gob files in the cache is rather slow, particularly since many
Go packages have dozens of GOOS-specific files which can be ignored.

            │      old      │                 new                  │
            │ cached-sec/op │ cached-sec/op  vs base               │
    Build-8     340.3m ± 1%     335.8m ± 2%  -1.32% (p=0.002 n=10)

            │     old     │                new                 │
            │ mallocs/op  │ mallocs/op   vs base               │
    Build-8   35.73M ± 0%   35.09M ± 0%  -1.79% (p=0.000 n=10)
4 months ago