Speeding up Java with GraalVM

Feb 17, 2021

Microservices and programming languages

Microservices can be built using most modern languages. Suggesting any single language is best for building services undermines one of the key value propositions of the architecture: the freedom and flexibility to select the best technology for the job. A quick search for the best languages for building microservices yields a wide range of potential candidates. However, when looking at the intersection of candidate lists, the top three generally include Java, NodeJS, and Go. In this article, we will be focusing on Java.

Microservices and Java

Java has been a popular programming language for a long time. A visit to the Tiobe index or Redmonk index illustrates that, as of this writing, Java has held one of the top 3 positions since 2001.

Java microservice support

From its initial release, Java has included networking support. This factis not surprising considering the corporate motto of Sun Microsystems, the company that originally created Java was:

The Network is the Computer.

John Gage, Sun Microsystems

Distributed programming is in Java's DNA. While language support for networking makes microservices possible, it is painfully low-level. Fortunately, with Java, we have a host of microservice frameworks to mitigate our suffering. These include Dropwizard, Vert.x, Micronaut, Helidon, Quarkus, and of course, the leading Java microservice framework, Spring Boot.

The problem with java microservices

Before we get started, we must address several valid concerns often raised when discussing Java microservices. While Java has a wealth of support for building microservices, it suffers from a couple of critical issues, specifically: longer startup time and larger memory footprint. These issues arise from the use of the Java Virtual Machine (JVM). These issues are not exclusive to Java. Any language running in the JVM (e.g., Scala , Kotlin , Clojure , Groovy , JRuby, etc.) is also affected.

Service Startup Time

Before a Java microservice can begin executing, its host JVM must first launch and initialize itself. The time it takes for the JVM itself is an application that runs the Java application. Before the service can even be loaded, the JVM must first be started. The time it takes for the JVM to start determines the absolute minimum startup time for any Java application. Once initialized, the JVM must perform quite a bit of work under the covers to launch the service's bytecode. At a high level, the JVM performs three primary tasks when launching an application: loading, linking, and initialization.

loading

For dynamically loaded languages like Java,a portion of the application code is loaded at startup, with additional code asynchronously loaded as needed. The JVM specification defines loading as:

Loading is the process of finding the binary representation of a class or interface type with a particular name and creating a class or interface from that binary representation.

JVM Specification

The compiled bytecode for every class and interface that comprises the application must be retrieved from its jar file container and loaded into the JVM's memory. To accomplish this, Java depends on two types of classloaders: a Bootstrap ClassLoader, which is built into the JVM and has its loading policy defined by the JVM Specification, and User-Defined ClassLoaders which allow application developers to designate a custom loading policy. All classes loaded by the JVM must be loaded by one of these types of loaders.

Each application class and interface, as well as each of its transitive dependencies, must be loaded. The aggregate load time is then added to the JVM startup time to calculate the overall startup time. The complexity of the microservice measured by the total number of classes and interfaces loaded directly impacts the startup time.

linking

The next process the JVM must perform is linking. The JVM specification describes linking as:

Linking a class or interface involves verifying and preparing that class or interface, its direct superclass, its direct superinterfaces, and its element type (if it is an array type), if necessary. Resolution of symbolic references in the class or interface is an optional part of linking.

JVM Specification

Verfication
Verification ensures each class or interface conforms to the structural requirements of the JVM. Additionally, verification may require additional classes and interfaces to be loaded. During verification, the JVM will ensure that:
- There are no uninitialized variables.
- No access rules for private data and methods are violated.
- All method calls match the object reference.
- There are no operand stack overflows or underflows.
- All local variable uses and stores are valid.
- All JVM instruction arguments are of valid types.
- No final classes are subclassed and that no final methods are overridden.
- All field references and method references have valid names, valid classes, and a valid type descriptor.
If any verification check fails, the JVM throws a java.lang.VerifyError error.
Preparation
Preparation is the process of creating and initializing the static fields for a class or interface to its default values.
Resolution
For various JVM instructions that make symbolic references to the JVM's run-time constant pool (e.g., newarray, checkcast, getfield, getstatic, instanceof, invokedynamic, invokeinterface, invokespecial, invokestatic, invokevirtual, ldc, ldc_w, multianewarray, new, putfield, and putstatic ), the linking process requires an additional step. This resolution step dynamically determines concrete values from the symbolic references in the run-time constant pool.

JVM

startup time

initialization

Initialization

Initialization of a class or interface consists of executing the class or interface initialization method &lth;clinit> (§2.9.2).

JVM Specification

JVM

Service memory footprint

JVM

Java

Project Jigsaw and Java 9

Java SDK

jdeps

jlink

Jdeps is a class and module dependency analyzer that identifies the classes and modules required for a given application.
jlink allows us to build an optimized Java runtime image to include only those classes and modules need by the application.

Java 9

class memory footprint

Java Runtime

Introducing The GraalVM

Sun Microsystems

Oracle

Sun

Sun Labs

Maxine Virtual Machine Project

modular design and code reuse

Java

meta-circular

Hotspot VM

GraalVM (v19.0)

Maxine VM

GraalVM

JRE

GraalVM

GraalVM Compiler

Truffle Language Implementation Framework

LLVM runtime

Javascript runtime

GraalVM Native Image

GraalVM Compiler

GraalVM compiler

Java

Streams

Lambdas

JRE

Truffle Language Implementation Framework

LLVM runtime

polyglot

LLVM runtime

LLVM bitcode

C++

Javascript Runtime

JavaScript runtime

ECMAScript-compliant

JavaScript

Node.js

Native Image (AOT)

Java

startup time

memory footprint

GraalVM

Native Image

GraalVM Native Image

JVM

ahead-of-time (AOT)

GraalVM

Substrate VM

GraalVM

Faster Startup

GraalVM

classloading

AOT

Just-In-Time Compiler(JIT)

Smaller memory footprint

jdeps

jlink

JVM

JIT

Native Image limitations

JVM

Reflection

Java

thread

heap

JVM

SubstrateVM

The serial collector is usually adequate for most small applications, in particular those requiring heaps of up to approximately 100 megabytes on modern processors. The other collectors have additional overhead or complexity, which is the price for specialized behavior. If the application does not need the specialized behavior of an alternate collector, use the serial collector. One situation where the serial collector isn't expected to be the best choice is a large, heavily threaded application that runs on a machine with a large amount of memory and two or more processors.

JVM

JIT

Java

JVM

reflection

dynamic class loading

classpath handling

dynamic proxies

classpath resources

JVM

GraalVM Native Image Configuration

JVM

META_INF/native-image

native-image

native-image.properties - This file is used to configure the native image builder's native-image command-line arguments.
reflect-config.json - Native image has partial support for reflection. However, to take advantage of this, we may need to provide the builder with additional metadata for those program elements. The reflection-config.json file provides the configuration information. During the build, the native image builder performs a static analysis of the application and attempts to automatically detect calls to the Reflection API. In situtations where the builder is unable to automatically detect reflection calls, they must be manually added to this file.
proxy-config.json - Because Java Dynamic Proxies can also be detected autmatically, we can also specify the dynamic proxy classes be generated by the native image builder manually by editing this configuration file.
resource-config.json - Application resources on the classpath are not automatically added to the native image by default. Resources called by Class.getResource(), Class.getResourceAsStream() or similar classloader methods must be explicitly configured in the resource-config.json file.
jni-config.json - GraalVM native image supports JNI reflection, Java to Native method calls, Native to Java method calls, object creation through configuration.

Runtime vs. Build-time initialization

--initialize-at-build-time and --initialize-at-run-time

Java Frameworks using Graalvm

GraalVM Native Image

Java

Java

Java

Coming Up

GraalVM Native Image

Spring Native

beta

Spring Framework

GraalVM Native Image

ThinkMicroservices.com

Recent Posts

Speeding up Java with GraalVM

Microservices and programming languages

Microservices and Java

Java microservice support

The problem with java microservices

Service Startup Time

loading

linking

Verfication

Preparation

Resolution

initialization

Service memory footprint

Introducing The GraalVM

GraalVM Compiler

Truffle Language Implementation Framework

LLVM runtime

Javascript Runtime

Native Image (AOT)

Faster Startup

Smaller memory footprint

Native Image limitations

GraalVM Native Image Configuration

Runtime vs. Build-time initialization

Java Frameworks using Graalvm

Coming Up

Think Microservices

Recent Posts

Object Caching, Redis, and Kubernetes

Object Storage, Minio, and Kubernetes

OpenFaaS NATS Event Connector

Tags

About