▲Byte Buddy is a code generation and manipulation library for Javabytebuddy.net

93 points by mooreds 4 days ago | 34 comments

pjmlp 19 hours ago [-]

Note that Java now has its own API for this purpose.

atomicnumber3 18 hours ago [-]

for those who might be clicking through thinking "since when??", the emphasis is on "now" - this was released in JDK 24.

bytebuddy predates it by at least a decade.

pjmlp 18 hours ago [-]

Hence the now on my comment. :)

kiicia 5 hours ago [-]

Wasn’t there already like instrumentation api that was run via premain?

zappb 17 hours ago [-]

Ideally, tools like ByteBuddy will adopt that API as it's for low level concerns.

lbalazscs 16 hours ago [-]

We are already living in an (almost) ideal world: https://github.com/raphw/byte-buddy/discussions/1798

brabel 18 hours ago [-]

How does that compare in terms of usability and completeness?

aardvark179 15 hours ago [-]

It is complete, and I’ve found it extremely usable when writing code to trawl over a large number of class files. Looks like it should be good for code generation as well but I haven’t used that yet.

pjmlp 18 hours ago [-]

I have not yet used it, only raising awareness.

This came to be, because Oracle noticed everyone, including themselves, were depending on ASM, so the JEP was born.

cogman10 18 hours ago [-]

Yup, the ASM dependency is one that would constantly cause us headaches. A load of frameworks have a path to ASM for one reason or another and it requires an update every time you move up JVM runtimes.

It's usually not painful to update (just bump the version) but it's an annoyance.

In fact, Byte buddy has a dep on ASM.

cogman10 18 hours ago [-]

It's complete but low level compared to Byte Buddy. A better comparison is the to ASM (which is what it was meant to replace).

https://asm.ow2.io/

cogman10 18 hours ago [-]

If you are into code generation, another project of interest is Java Poet

https://github.com/square/javapoet

I've used it to do a mass refactoring of an annotation-based library. Worked pretty great.

layer8 13 hours ago [-]

Should probably link to https://github.com/palantir/javapoet instead, as the Square version has been deprecated since 2020.

pandemic_region 3 hours ago [-]

Palantir does Java? Jikes !

11 hours ago [-]

selimco 17 hours ago [-]

It seems like micronaut has been able to avoid runtime bytecode generation by doing everything at compile-time. I wonder if there’s things that you can’t do the micronaut way.

hansvm 11 hours ago [-]

Sure:

- There are how many computer architectures? A compile-once-run-anywhere binary looks closer to shipping a fancy interpreter with your code than shipping a compiled project. Runtime bytecode generation is one technique for making that fast.

- More generally, anything you don't know till runtime generates a huge amount of bloat if you handle it at compile-time. Imagine, e.g., a UI for dragging and dropping ML components to create an architecture. For as much compute as you're about to pour into training, even for very simple problems, it's worth something that looks like a compilation pass to appropriately fuse everything together. You could probably get away with literally shipping a compiler, but bytecode generation is a reasonable solution too.

- Some things are literally impossible at compile-time without boxing and other overhead. E.g., once upon a time I made a zero-cost-abstraction library allowing you to specify an ML computational graph using the type system (most useful for problems where you're not just doing giant matmuls all day). It was in a language where mutually recursive generics are lazily generated, so you're able to express arbitrary nth derivatives still in the type system, still with zero overhead. What you can't do though is create a runtime program capable of creating arbitrary derivatives; there must be an upper bound for any finite-sized binary (for sufficiently complex starting functions) -- you could cap it at 2nd derivatives or 10th or whatever, but there would have to be a cap. If you move that to runtime though then you can have your cake and eat it too, less the cost of compiling (i.e., bytecode generation) at runtime.

Etc. It's a tradeoff between binary size (which might have to be infinite in the compiled case) and runtime overhead (having to "compile" for each new kind of input you find).

looperhacks 6 hours ago [-]

I haven't used micronaut specifically, but I remember using Quarkus when it was rather new. It also does a lot at compile-time compared to, say, spring. The one big disadvantage I noticed that it's had to eject if you need to defer something to runtime for some reason. Don't know if it's still an issue, but that's really the only disadvantage I remember

exabrial 11 hours ago [-]

I think that's noteworthy, but just not necessary. Still really cool if memory usage and startup times are your constraints.

morkalork 19 hours ago [-]

Reminds me of a side project I did when first starting CS! The Java byte code specification is absolutely approachable and if you've never looked at it before I recommend it (although this project says you can still use it without that knowledge)

geokon 9 hours ago [-]

where to start?

ActorNightly 18 hours ago [-]

The better question is why use Java for anything these days. If you really need to run something with JVM, use Kotlin.

ackfoobar 17 hours ago [-]

As a Kotlin enjoyer, I find these comments counterproductive. Maybe they like the lack of extension functions?

kachapopopow 17 hours ago [-]

Kotlin is fatter, compiler is slower, code completion is slow as hell on large projects, but other than building small applications - there's really no reason to not use kotlin except for the fact that you need to actually learn the language or else you're going to end up with very very slow codebase where opening a file and waiting for syntax highlighting takes 2-3 seconds and typing autocomplete is just painfully slow.

switchbak 15 hours ago [-]

"fatter, compiler is slower, code completion is slow as hell" - if that's all you want out of your programming language, then Java is probably a good choice for you.

For others that value the things that Kotlin brings over Java (even modern Java), and for the ways in which it delivers a simpler experience than Scala - I think it's a pragmatic and sensible decision.

vips7L 15 hours ago [-]

I do like the lack of extension functions. I find them confusing, especially when you can use them on things that are null.

simon_void 4 hours ago [-]

I wonder if that confusion is due to the fact that you haven't yet wrapped your head around the fact that extension functions are "just" syntactic sugar for static functions. The implicit "this" becomes the the first parameter of the static function and function parameters can be null. Now you might ask "why not use static (/first class) functions then? Because those "feel" like less ideomatic to use then extension functions or methods that are defined on the object (hirachy) itself. But understanding why the extension type can be nullable is not the same is using it on nullable types. I restrict my extension functions to non-nullable types most of the time as well. The best exception to this preference -just to see where it makes sense- is the build-in function [toString](https://kotlinlang.org/api/core/kotlin-stdlib/kotlin/to-stri...), since you want it to return "null" if you invoke it on null.

AdieuToLogic 14 hours ago [-]

> The better question is why use Java for anything these days.

Java (the language) is pretty much "C for the JVM." By that, I mean frameworks/libraries intended for maximum potential use in languages running on the JVM (such as Kotlin, Scala, and of course Java) all support Java (the language) interoperability. Many written in alternate languages targeting the JVM, such as Akka[0], typically have some degree of Java (the language) support as well.

While I prefer to program in one of the alternate programming languages targeting the JVM, I understand why many OSS projects are implemented in Java (the language) for the reasons outlined above.

0 - https://github.com/akka/akka

ActorNightly 7 hours ago [-]

The problem is, if you are trying to optimize for the JVM, you are already down the wrong path. The JVM is useful in a very small niche when you want something that is faster than Python/Node , but still want cross platform support and somewhat rapid development. The cases where this applies are very niche.

It may allow closer to JVM access, however the entire ecosystem is a colossal mess. The main() implementation in having a class that wraps it is pretty dumb, standard stuff like Lombok hacks the AST (not to mention in general the annotation preprocessors work by printing code strings to file), and the whole dependency injection frameworks are very much separated from actual processing with how much stuff they do in the background.

And then there is the whole Apache foundation with its software being used widely as standard. The same foundation where someone wrote the code that allows log statements to pull arbitrary code from the internet and execute it, and that change made its way past multiple eyes before being merged to production without a single person realizing how crazy it is.

If you want speed, write stuff in C/Rust/Clean C++ (without templates, no C style memory access, e.t.c). If you want to be efficient, write stuff in Python/Node.

simon_void 3 hours ago [-]

I like Rust as much as the next guy, but Kotlin is the most ergonomic programming language I know. So my approach is to use Kotlin by default and should it some day become clear that the service is a bottleneck (or if the cloud cost can be lowered substancially), to only then rewrite it in Rust. At that point the service has probably already gained most of the functionality it'll ever have, which should make the Rust conversion as straight-forward as it can be.

selimco 6 hours ago [-]

You don’t really need any of the apache commons libraries with modern java versions, if that’s what you were referring to. Also I think that most people who are considering doing jvm development would consider kotlin as an alternative language or maybe c# and dotnet as an alternative ecosystem. I believe rust, c or cpp are rarely going to be considerations for most people in that situation.

peterashford 9 hours ago [-]

I've worked in both and I prefer Java

symbolicAGI 9 hours ago [-]

A recent good reason for using Java is that frontier LLMs are trained with very large amounts of high quality enterprise Java source code. Claude Code for example loves Java and its static type system.

I constrain my LLM-generated Java code to only static methods of 20 LOC or less, and limit data types to those that are JSON compatible. Both of these lead to more reliable code and data that Claude Code fully understands and generates.

I am preparing to auto-generate an agent-based application that might reach 1.5 million Java LOC. Hard to imagine accomplishing that with Javascript or Python or C++.

wvenable 7 hours ago [-]

> A recent good reason for using Java is that frontier LLMs are trained with very large amounts of high quality enterprise Java source code.

Where did it get it from?

gf000 5 hours ago [-]

Could you please expand on how you limit the generated code? I haven't dived deep into Claude code, mostly just familiar with OpenAI's offering.

Loading comments...

pjmlp 19 hours ago [-]

Note that Java now has its own API for this purpose.

https://openjdk.org/jeps/484

atomicnumber3 18 hours ago [-]

for those who might be clicking through thinking "since when??", the emphasis is on "now" - this was released in JDK 24.

bytebuddy predates it by at least a decade.

pjmlp 18 hours ago [-]

Hence the now on my comment. :)

kiicia 5 hours ago [-]

Wasn’t there already like instrumentation api that was run via premain?

zappb 17 hours ago [-]

Ideally, tools like ByteBuddy will adopt that API as it's for low level concerns.

lbalazscs 16 hours ago [-]

We are already living in an (almost) ideal world: https://github.com/raphw/byte-buddy/discussions/1798

brabel 18 hours ago [-]

How does that compare in terms of usability and completeness?

aardvark179 15 hours ago [-]

pjmlp 18 hours ago [-]

I have not yet used it, only raising awareness.

This came to be, because Oracle noticed everyone, including themselves, were depending on ASM, so the JEP was born.

cogman10 18 hours ago [-]

It's usually not painful to update (just bump the version) but it's an annoyance.

In fact, Byte buddy has a dep on ASM.

cogman10 18 hours ago [-]

It's complete but low level compared to Byte Buddy. A better comparison is the to ASM (which is what it was meant to replace).

https://asm.ow2.io/

cogman10 18 hours ago [-]

If you are into code generation, another project of interest is Java Poet

https://github.com/square/javapoet

I've used it to do a mass refactoring of an annotation-based library. Worked pretty great.

layer8 13 hours ago [-]

Should probably link to https://github.com/palantir/javapoet instead, as the Square version has been deprecated since 2020.

pandemic_region 3 hours ago [-]

Palantir does Java? Jikes !

11 hours ago [-]

selimco 17 hours ago [-]

It seems like micronaut has been able to avoid runtime bytecode generation by doing everything at compile-time. I wonder if there’s things that you can’t do the micronaut way.

hansvm 11 hours ago [-]

Sure:

Etc. It's a tradeoff between binary size (which might have to be infinite in the compiled case) and runtime overhead (having to "compile" for each new kind of input you find).

looperhacks 6 hours ago [-]

exabrial 11 hours ago [-]

I think that's noteworthy, but just not necessary. Still really cool if memory usage and startup times are your constraints.

morkalork 19 hours ago [-]

geokon 9 hours ago [-]

where to start?

ActorNightly 18 hours ago [-]

The better question is why use Java for anything these days. If you really need to run something with JVM, use Kotlin.

ackfoobar 17 hours ago [-]

As a Kotlin enjoyer, I find these comments counterproductive. Maybe they like the lack of extension functions?

kachapopopow 17 hours ago [-]

switchbak 15 hours ago [-]

"fatter, compiler is slower, code completion is slow as hell" - if that's all you want out of your programming language, then Java is probably a good choice for you.

vips7L 15 hours ago [-]

I do like the lack of extension functions. I find them confusing, especially when you can use them on things that are null.

simon_void 4 hours ago [-]

AdieuToLogic 14 hours ago [-]

> The better question is why use Java for anything these days.

While I prefer to program in one of the alternate programming languages targeting the JVM, I understand why many OSS projects are implemented in Java (the language) for the reasons outlined above.

0 - https://github.com/akka/akka

ActorNightly 7 hours ago [-]

If you want speed, write stuff in C/Rust/Clean C++ (without templates, no C style memory access, e.t.c). If you want to be efficient, write stuff in Python/Node.

simon_void 3 hours ago [-]

selimco 6 hours ago [-]

peterashford 9 hours ago [-]

I've worked in both and I prefer Java

symbolicAGI 9 hours ago [-]

I am preparing to auto-generate an agent-based application that might reach 1.5 million Java LOC. Hard to imagine accomplishing that with Javascript or Python or C++.

wvenable 7 hours ago [-]

> A recent good reason for using Java is that frontier LLMs are trained with very large amounts of high quality enterprise Java source code.

Where did it get it from?

gf000 5 hours ago [-]

Could you please expand on how you limit the generated code? I haven't dived deep into Claude code, mostly just familiar with OpenAI's offering.