net.corda.v5.serialization
Object serialization is the process of converting objects into a stream of bytes while deserialization is the reverse process of creating objects from a stream of bytes. It takes place every time we store transactions in the database.
Corda uses a custom form of typesafe binary serialisation. The primary drivers for this were:
- A desire to have a schema describing what has been serialized alongside the actual data:
- To assist with versioning, both in terms of being able to interpret data archived long ago (e.g. trades from a decade ago, long after the code has changed) and between differing code versions.
- To make it easier to write generic code; e.g. user interfaces that can navigate the serialized form of data.
- To support cross platform (non-JVM) interaction, where the format of a class file is not so easily interpreted.
- A desire to use a documented and static wire format that is platform independent, and is not subject to change with 3rd party library upgrades, etc.
- A desire to support open-ended polymorphism, where the number of subclasses of a superclass can expand over time and the subclasses do not need to be defined in the schema upfront. This is key to many Corda concepts, such as states An immutable object representing a fact known by one or more participants at a specific point in time. You can use states to represent any type of data, and any kind of fact. .
- Increased security when deserializing objects by constructing them through supported constructors, rather than having data inserted directly into their fields without an opportunity to validate consistency or intercept attempts to manipulate supposed invariants.
- Binary formats work better with digital signatures than text based formats, as there’s much less scope for changes that modify syntax but not semantics.
For more information, see the documentation for the package in the Java API documentation.
Allow List
In classic Java serialization, any class on the JVM classpath can be deserialized. This is a source of exploits and vulnerabilities that exploit the large set of third-party libraries that are added to the classpath as part of a JVM application’s dependencies and carefully craft a malicious stream of bytes to be deserialized. In Corda, we strictly control which classes can be deserialized (and, pro-actively, serialized) by insisting that each (de)serializable class is part of an allow list of allowed classes.
To add a class to the allow list, you must use either of the following mechanisms:
- Add the
@CordaSerializable
annotation to the class. This annotation can be present on the class itself, on any super class of the class, on any interface implemented by the class or its super classes, or any interface extended by an interface implemented by the class or its super classes. - Add a custom serializer for the class
There is also a built-in Corda allow list that allows common JDK classes for convenience. This allow list is not user-editable.
The annotation is the preferred method for allowing serialization. Here is an example of both ways you can do this for a couple of example classes.
// Not annotated, so need to add custom serializer.
data class ExampleRESTValue(val foo: String)
// Annotated, so no need to add custom serializer.
@CordaSerializable
data class ExampleRESTValue2(val bar: Int)
// Define a custom serializer for ExampleRESTValue
class ExampleRESTValueSerializer : SerializationCustomSerializer<ExampleRESTValue, String> {
override fun toProxy(obj: ExampleRESTValue): String {
return obj.foo
}
override fun fromProxy(proxy: String): ExampleRESTValue {
return ExampleRESTValue(proxy)
}
}
Runnable r = (Runnable & Serializable) () -> System.out.println("Hello World");
, or
Callable<String> c = (Callable<String> & Serializable) () -> "Hello World";
.AMQP
Corda uses an extended form of AMQP 1.0 as its binary wire protocol.
Corda serialisation is currently used for:
- Passing objects between the flow worker A worker that runs the CorDapp application code and translates flow API calls into function requests to the relevant workers. The flow workers are designed to share work between themselves and to record checkpoints at each stage of the application's progress, so that in the event of worker failure, the operations can be retried. and database worker A worker that connects to, manages, and operates upon the database(s) used by the Corda cluster. This includes the cluster-level database schemas needed to store configuration data for the cluster, but also the separate databases/schemas used by each virtual node.
For the checkpointing of flows Corda uses a private scheme that is subject to change. It is currently based on the Kryo framework, but this may not be true in future.
This separation of serialization schemes into different contexts allows us to use the most suitable framework for that context rather than attempting to force a one-size-fits-all approach. Kryo is more suited to the serialization of a program’s stack frames, as it is more flexible than our AMQP framework in what it can construct and serialize. However, that flexibility makes it exceptionally difficult to make secure. Conversely, our AMQP framework allows us to work with a safer, secure framework with far fewer security holes.
Selection of serialization context should, for the most part, be opaque to CorDapp Corda Distributed Application. A Java (or any JVM targeting language) application built using the Corda build toolchain and CorDapp API to solve some problem that is best solved in a decentralized manner. Developers, the Corda framework selecting the correct context as configured.
This document describes what is currently and what will be supported in the Corda AMQP format from the perspective of CorDapp Developers, to allow CorDapps to take into consideration the future state. The AMQP serialization format will continue to apply the allow list functionality that is already in place and described in this page.
Core Types
This section describes the classes and interfaces that the AMQP serialization format supports.
Collection Types
The following collection types are supported. Any implementation of the following will be mapped to an implementation of the interface or class on the other end. For example, if you use a Guava implementation of a collection, it will deserialize as the primitive collection type.
The types of properties should only be of these types, and not any concrete implementation types (for example, Guava). Collections must specify their generic type(s), the generic type parameters will be included in the schema, and the collection’s contained objects will be checked against the generic parameter(s) type(s) when deserialized.
java.util.Collection
java.util.List
java.util.Set
java.util.SortedSet
java.util.NavigableSet
java.util.NonEmptySet
java.util.Map
java.util.SortedMap
java.util.NavigableMap
However, as a convenience, we explicitly support the concrete implementation types listed below that are AMQP serializable, and therefore they can be used as the declared types of properties.
java.util.LinkedHashMap
java.util.TreeMap
java.util.EnumSet
java.util.EnumMap (but only if there is at least one entry)
JVM Primitives
All the primitive types are supported.
boolean
byte
char
double
float
int
long
short
Arrays
Arrays of any type are supported, primitive or otherwise.
JDK Types
The following JDK library types are supported:
java.io.InputStream
java.lang.Boolean
java.lang.Byte
java.lang.Character
java.lang.Class
java.lang.Double
java.lang.Float
java.lang.Integer
java.lang.Long
java.lang.Short
java.lang.StackTraceElement
java.lang.String
java.lang.StringBuffer
java.math.BigDecimal
java.math.BigInteger
java.security.PublicKey
java.security.cert.CertPath
java.security.cert.X509CRL
java.security.cert.X509Certificate
java.time.DayOfWeek
java.time.Duration
java.time.Instant
java.time.LocalDate
java.time.LocalDateTime
java.time.LocalTime
java.time.Month
java.time.MonthDay
java.time.OffsetDateTime
java.time.OffsetTime
java.time.Period
java.time.YearMonth
java.time.Year
java.time.ZonedDateTime
java.time.ZonedId
java.time.ZoneOffset
java.util.BitSet
java.util.Currency
java.util.Optional
java.util.UUID
javax.security.auth.x500.X500Principal
Third-Party Types
The following 3rd-party types are supported:
kotlin.Unit
kotlin.Pair
Corda Types
Any classes and interfaces in the Corda codebase annotated with @CordaSerializable
are supported.
All Corda exceptions that are expected to be serialized inherit from CordaThrowable
via
CordaRuntimeException
(for unchecked exceptions). Any Throwable
that is serialized but does
not conform to CordaThrowable
will be converted to a CordaRuntimeException
, with the original exception type
and other properties retained within it.
Custom Types
Your own types must adhere to the following rules to be supported:
Classes
General Rules
- The class must be compiled with parameter names included in the
.class
file. This is the default in Kotlin but must be turned on in Java using the-parameters
command line option tojavac
In circumstances where classes cannot be recompiled, such as when using a third-party library, a proxy serializer can be used to avoid this problem. Details on creating such an object can be found on the Pluggable Serializers for CorDapps page. - The class must be annotated with
@CordaSerializable
- The declared types of constructor arguments, getters, and setters must be supported, and where generics are used, the
generic parameter must be a supported type, an open wildcard (
*
), or a bounded wildcard which is currently widened to an open wildcard - Any superclass must adhere to the same rules, but can be abstract
- Object graph cycles are not supported, so an object cannot refer to itself, directly or indirectly
Constructor Instantiation
The primary way Corda’s AMQP serialization framework instantiates objects is via a specified constructor. This is used to first determine which properties of an object are to be serialised, then, on deserialization, it is used to instantiate the object with the serialized values.
R3 recommends that serializable objects in Corda adhere to the following rules, as they allow immutable state objects to be deserialised:
- A Java Bean getter for each of the properties in the constructor, with a name of the form
getX
. For example, for a constructor parameterfoo
, there must be a getter calledgetFoo()
. Iffoo
is a boolean, the getter may optionally be calledisFoo()
. This is why the class must be compiled with parameter names turned on. - A constructor which takes all of the properties that you wish to record in the serialized form. This is required in order for the serialization framework to reconstruct an instance of your class
- If more than one constructor is provided, the serialization framework needs to know which one to use. The
@ConstructorForDeserialization
annotation can be used to indicate which one. For a Kotlin class, without the@ConstructorForDeserialization
annotation, the primary constructor will be selected
In Kotlin, this maps cleanly to a data class where there getters are synthesized automatically. For example, suppose we have the following data class:
data class Example (val a: Int, val b: String)
Properties a
and b
will be included in the serialised form.
However, properties not mentioned in the constructor will not be serialised. For example, in the following code,
property c
will not be considered part of the serialised form:
data class Example (val a: Int, val b: String) {
var c: Int = 20
}
var e = Example (10, "hello")
e.c = 100;
val e2 = e.serialize().deserialize() // e2.c will be 20, not 100!!!
Setter Instantiation
As an alternative to constructor-based initialisation, Corda can also determine the important elements of an object by inspecting the getter and setter methods present on the class. If a class has only a default constructor and properties then the serializable properties will be determined by the presence of both a getter and setter for that property that are both publicly visible (i.e. the class adheres to the classic idiom of mutable JavaBeans).
On deserialization, a default instance will first be created, and then the setters will be invoked on that object to populate it with the correct values.
For example:
class Example {
var a: Int = 0
var b: Int = 0
var c: Int = 0
}
class Example {
private int a;
private int b;
private int c;
public int getA() { return a; }
public int getB() { return b; }
public int getC() { return c; }
public void setA(int a) { this.a = a; }
public void setB(int b) { this.b = b; }
public void setC(int c) { this.c = c; }
}
Inaccessible Private Properties
The Corda AMQP serialization framework does not support private properties without publicly accessible getter methods. The below example fails at serialization.
class C(val a: Int, private val b: Int)
class C {
private Integer a;
private Integer b;
public C(Integer a, Integer b) {
this.a = a;
this.b = b;
}
public Integer getA() {
return a;
}
}
When designing Corda states, it should be remembered that they are not, despite appearances, traditional OOP style objects. They are signed over, transformed, serialised, and relationally mapped. As such, all elements should be publicly accessible by design.
Providing a public getter for property C.b
, as per the following example, makes type C
AMQP serializable:
class C(val a: Int, val b: Int)
class C {
private Integer a;
private Integer b;
C(Integer a, Integer b) {
this.a = a;
this.b = b;
}
public Integer getA() {
return a;
}
public Integer getB() {
return b;
}
}
Mismatched Class Properties and Constructor Parameters
Consider an example where you wish to ensure that a property of class whose type is some form of container is always sorted using some specific criteria yet you wish to maintain the immutability of the class.
This could be codified as follows:
@CordaSerializable
class ConfirmRequest(statesToConsume: List<StateRef>, val transactionId: SecureHash) {
companion object {
private val stateRefComparator = compareBy<StateRef>({ it.txhash }, { it.index })
}
private val states = statesToConsume.sortedWith(stateRefComparator)
}
The intention in the example is to always ensure that the states are stored in a specific order regardless of the ordering
of the list used to initialise instances of the class. This is achieved by using the first constructor parameter as the
basis for a private member
Corda identity that has been granted admission to a membership group. Synonym for a virtual node or group member.
. However, because that member is not mentioned in the constructor (whose parameters determine
what is serializable as discussed above) it would not be serialized. In addition, as there is no provided mechanism to retrieve
a value for statesToConsume
we would fail to build a serializer for this Class.
In this case a secondary constructor annotated with @ConstructorForDeserialization
would not be a valid solution as the
two signatures would be the same. Best practice is thus to provide a getter for the constructor parameter which explicitly
associates it with the actual member variable.
@CordaSerializable
class ConfirmRequest(statesToConsume: List<StateRef>, val transactionId: SecureHash) {
companion object {
private val stateRefComparator = compareBy<StateRef>({ it.txhash }, { it.index })
}
private val states = statesToConsume.sortedWith(stateRefComparator)
//Explicit "getter" for a property identified from the constructor parameters
fun getStatesToConsume() = states
}
Mutable Containers
Because Java fundamentally provides no mechanism by which the mutability of a class can be determined this presents a problem for the serialization framework. When reconstituting objects with container properties (lists, maps, etc) we must chose whether to create mutable or immutable objects. Given the restrictions, we have decided it is better to preserve the immutability of immutable objects rather than force mutability on presumed immutable objects.
data class C(val l : MutableList<String>)
val bytes = C(mutableListOf ("a", "b", "c")).serialize()
val newC = bytes.deserialize()
newC.l.add("d")
The call to newC.l.add
will throw an UnsupportedOperationException
.
There are several workarounds that can be used to preserve mutability on reconstituted objects. Firstly, if the class is not a Kotlin data class and thus is not restricted by having to have a primary constructor.
class C {
val l : MutableList<String>
@Suppress("Unused")
constructor (l : MutableList<String>) {
this.l = l.toMutableList()
}
}
val bytes = C(mutableListOf ("a", "b", "c")).serialize()
val newC = bytes.deserialize()
// This time this call will succeed
newC.l.add("d")
Secondly, if the class is a Kotlin data class, a secondary constructor can be used.
data class C (val l : MutableList<String>){
@ConstructorForDeserialization
@Suppress("Unused")
constructor (l : Collection<String>) : this (l.toMutableList())
}
val bytes = C(mutableListOf ("a", "b", "c")).serialize()
val newC = bytes.deserialize()
// This will also work
newC.l.add("d")
Thirdly, to preserve immutability of objects (a recommend design principle - Copy on Write semantics) then mutating the contents of the class can be done by creating a new copy of the data class with the altered list passed (in this example) passed in as the Constructor parameter.
data class C(val l : List<String>)
val bytes = C(listOf ("a", "b", "c")).serialize()
val newC = bytes.deserialize()
val newC2 = newC.copy (l = (newC.l + "d"))
init
block reassigning the property
to a mutable instanceEnums
All enums are supported, provided they are annotated with @CordaSerializable
. Corda supports interoperability of
enumerated type versions. This allows such types to be changed over time without breaking backward (or forward)
compatibility. The rules and mechanisms for doing this are discussed in Enum Evolution.
Exceptions
The following rules apply to supported Throwable
implementations.
- If you wish for your exception to be serializable and transported type safely it should inherit from
CordaRuntimeException
- If not, the
Throwable
will deserialize to aCordaRuntimeException
with the details of the originalThrowable
contained within it, including the class name of the originalThrowable
Kotlin Objects
Kotlin’s non-anonymous object
s (i.e. constructs like object foo : Contract {...}
) are singletons and
treated differently. They are recorded into the stream with no properties, and deserialize back to the
singleton instance. Currently, the same is not true of Java singletons, which will deserialize to new instances
of the class. This is hard to fix because there’s no perfectly standard idiom for Java singletons.
Kotlin’s anonymous object
s (i.e. constructs like object : Contract {...}
) are not currently supported
and will not serialize correctly. They need to be re-written as an explicit class declaration.
Type Evolution
Type evolution is the mechanism by which classes can be altered over time yet still remain serializable and deserializable across all versions of the class. This ensures an object serialized with an older idea of what the class “looked like” can be deserialized and a version of the current state of the class instantiated. You can learn more in Default Class Evolution.
Was this page helpful?
Thanks for your feedback!
Chat with us
Chat with us on our #docs channel on slack. You can also join a lot of other slack channels there and have access to 1-on-1 communication with members of the R3 team and the online community.
Propose documentation improvements directly
Help us to improve the docs by contributing directly. It's simple - just fork this repository and raise a PR of your own - R3's Technical Writers will review it and apply the relevant suggestions.
We're sorry this page wasn't helpful. Let us know how we can make it better!
Chat with us
Chat with us on our #docs channel on slack. You can also join a lot of other slack channels there and have access to 1-on-1 communication with members of the R3 team and the online community.
Create an issue
Create a new GitHub issue in this repository - submit technical feedback, draw attention to a potential documentation bug, or share ideas for improvement and general feedback.
Propose documentation improvements directly
Help us to improve the docs by contributing directly. It's simple - just fork this repository and raise a PR of your own - R3's Technical Writers will review it and apply the relevant suggestions.