This value is hard-coded in the linker and written in a header. We could rewrite the final binary, like we used to do with import paths, but that would require once again maintaining libraries to do so. Instead, we're now modifying the linker to do what we want. It's not particularly hard, as every Go install has its source code, and rebuilding a slightly modified linker only takes a few seconds at most. Thanks to `go build -overlay`, we only need to copy the files we modify, and right now we're just modifying one file in the toolchain. We use a git patch, as the change is fairly static and small, and the patch is easier to understand and maintain. The other side of this change is in the runtime, as it also hard-codes the magic value when loading information. We modify the code via syntax trees in that case, like `-tiny` does, because the change is tiny (one literal) and the affected lines of code are modified regularly between major Go releases. Since rebuilding a slightly modified linker can take a few seconds, and Go's build cache does not cache linked binaries, we keep our own cached version of the rebuilt binary in `os.UserCacheDir`. The feature isn't perfect, and will be improved in the future. See the TODOs about the added dependency on `git`, or how we are currently only able to cache one linker binary at once. Fixes #622. |
2 years ago | |
---|---|---|
.github | 2 years ago | |
internal | 2 years ago | |
scripts | 2 years ago | |
testdata | 2 years ago | |
.gitattributes | 6 years ago | |
.gitignore | 3 years ago | |
AUTHORS | 3 years ago | |
CHANGELOG.md | 2 years ago | |
CONTRIBUTING.md | 3 years ago | |
LICENSE | 5 years ago | |
README.md | 2 years ago | |
bench_test.go | 2 years ago | |
cmdgo_quoted.go | 3 years ago | |
go.mod | 2 years ago | |
go.sum | 2 years ago | |
hash.go | 2 years ago | |
main.go | 2 years ago | |
main_test.go | 2 years ago | |
position.go | 3 years ago | |
reverse.go | 3 years ago | |
runtime_patch.go | 2 years ago | |
shared.go | 2 years ago |
README.md
garble
go install mvdan.cc/garble@latest
Obfuscate Go code by wrapping the Go toolchain. Requires Go 1.18 or later.
garble build [build flags] [packages]
The tool also supports garble test
to run tests with obfuscated code,
and garble reverse
to de-obfuscate text such as stack traces.
See garble -h
for up to date usage information.
Purpose
Produce a binary that works as well as a regular build, but that has as little information about the original source code as possible.
The tool is designed to be:
- Coupled with
cmd/go
, to support modules and build caching - Deterministic and reproducible, given the same initial source code
- Reversible given the original source, to de-obfuscate panic stack traces
Mechanism
The tool wraps calls to the Go compiler and linker to transform the Go build, in order to:
- Replace as many useful identifiers as possible with short base64 hashes
- Replace package paths with short base64 hashes
- Replace filenames and position information with short base64 hashes
- Remove all build and module information
- Strip debugging information and symbol tables via
-ldflags="-w -s"
- Obfuscate literals, if the
-literals
flag is given - Remove extra information, if the
-tiny
flag is given
By default, the tool obfuscates all the packages being built.
You can manually specify which packages to obfuscate via GOGARBLE
,
a comma-separated list of glob patterns matching package path prefixes.
This format is borrowed from GOPRIVATE
; see go help private
.
Note that commands like garble build
will use the go
version found in your
$PATH
. To use different versions of Go, you can
install them
and set up $PATH
with them. For example, for Go 1.17.1:
$ go install golang.org/dl/go1.17.1@latest
$ go1.17.1 download
$ PATH=$(go1.17.1 env GOROOT)/bin:${PATH} garble build
Literal obfuscation
Using the -literals
flag causes literal expressions such as strings to be
replaced with more complex expressions, resolving to the same value at run-time.
String literals injected via -ldflags=-X
are also replaced by this flag.
This feature is opt-in, as it can cause slow-downs depending on the input code.
Literals used in constant expressions cannot be obfuscated, since they are
resolved at compile time. This includes any expressions part of a const
declaration, for example.
Tiny mode
With the -tiny
flag, even more information is stripped from the Go binary.
Position information is removed entirely, rather than being obfuscated.
Runtime code which prints panics, fatal errors, and trace/debug info is removed.
All in all, this can make binaries 2-5% smaller.
With this flag, no panics or fatal runtime errors will ever be printed, but they
can still be handled internally with recover
as normal. In addition, the
GODEBUG
environmental variable will be ignored.
Note that this flag can make debugging crashes harder, as a panic will simply
exit the entire program without printing a stack trace, and all source code
positions are set to line 1. Similarly, garble reverse
is generally not useful
in this mode.
Speed
garble build
should take about twice as long as go build
, as it needs to
complete two builds. The original build, to be able to load and type-check the
input code, and then the obfuscated build.
Garble obfuscates one package at a time, mirroring how Go compiles one package
at a time. This allows Garble to fully support Go's build cache; incremental
garble build
calls should only re-build and re-obfuscate modified code.
Note that the first call to garble build
may be comparatively slow,
as it has to obfuscate each package for the first time. This is akin to clearing
GOCACHE
with go clean -cache
and running a go build
from scratch.
Determinism and seeds
Just like Go, garble builds are deterministic and reproducible in nature.
This has significant benefits, such as caching builds and being able to use
garble reverse
to de-obfuscate stack traces.
By default, garble will obfuscate each package in a unique way, which will change if its build input changes: the version of garble, the version of Go, the package's source code, or any build parameter such as GOOS or -tags. This is a reasonable default since guessing those inputs is very hard.
However, providing your own obfuscation seed via -seed
brings some advantages.
For example, builds sharing the same seed will produce the same obfuscation,
even if any of the build parameters or versions vary.
It can also make reverse-engineering harder, as an end user could guess what
version of Go or garble you're using.
Note that extra care should be taken when using custom seeds.
If a seed used to build a binary gets lost, garble reverse
will not work.
Rotating the seeds can also help against reverse-engineering in the long run,
as otherwise some bits of code may be obfuscated the same way over time.
An alternative approach is -seed=random
, where each build is entirely different.
Caveats
Most of these can improve with time and effort. The purpose of this section is to document the current shortcomings of this tool.
-
Exported methods are never obfuscated at the moment, since they could be required by interfaces. This area is a work in progress; see #3.
-
Garble aims to automatically detect which Go types are used with reflection, as obfuscating those types might break your program. Note that Garble obfuscates one package at a time, so if your reflection code inspects a type from an imported package, and your program broke, you may need to add a "hint" in the imported package:
type Message struct { Command string Args string } // Never obfuscate the Message type. var _ = reflect.TypeOf(Message{})
-
Go plugins are not currently supported; see #87.
-
Garble requires
git
to patch the linker. That can be avoided once go-gitdiff supports non-strict patches.
Contributing
We welcome new contributors. If you would like to contribute, see CONTRIBUTING.md as a starting point.