# External definitions A lot of modern blockchains support _tokens_ tradeable on top of the base chain. For proper support in Trezor, we need to know parameters of the tokens that are not part of the signed data -- most typically, the name, currency symbol and the number of decimal places. Similarly, the "Ethereum" implementation in Trezor actually supports any EVM chain, but we again need some identifying data about the chain in order to display amounts in an user-friendly way. Currently, Trezor has the capability to load external definitions of: * EVM chains (networks), identified by their chain ID, * ERC20 tokens, identified by the chain ID and token address, * Solana tokens, identified by the mint account. ## Built-in definitions A subset of definitions for the most common EVM chains and ERC20 tokens is baked into the firmware image. The set of built-in definitions is declared in the following files: * networks - [`networks.json`](https://github.com/trezor/trezor-firmware/blob/main/common/defs/ethereum/networks.json) * tokens - [`tokens.json`](https://github.com/trezor/trezor-firmware/blob/main/common/defs/ethereum/tokens.json) These definitions need to be modified manually. ## External definitions A full list of definitions is compiled from multiple sources and is available [in a separate repository](https://github.com/trezor/definitions). From this list, a collection of binary blobs is generated, signed, and made available online. A given Trezor firmware will only accept signed definitions newer than a certain date, typically one month before firmware release. This means that a client application should either always fetch fresh definitions from the official URLs, or refresh its local copy frequently. ## Retrieving the definitions The base URL for the definitions is `https://data.trezor.io/firmware/definitions/`. ### EVM ecosystem #### Chain ID is known To look up a network definition by its chain ID, use the following URL: `https://data.trezor.io/firmware/definitions/eth/chain-id//network.dat` `` is a decimal number, e.g., `1` for Ethereum mainnet. To look up a token definition for a given chain ID and token address, use the following URL: `https://data.trezor.io/firmware/definitions/eth/chain-id//token-.dat` `` is again a decimal number.
`` is all lowercase (no checksum) token address hex without the `0x` prefix. E.g., this is the URL for Görli TST token: [https://data.trezor.io/firmware/definitions/eth/chain-id/5/token-7af963cf6d228e564e2a0aa0ddbf06210b38615d.dat] #### Chain ID is not known Certain Ethereum calls, such as `EthereumGetAddress` and `EthereumSignMessage`, do not require the caller to know the chain ID, because their results do not depend on it. For this situation, it is possible to look up a network definition by a SLIP-44 identifier on the following URL: `https://data.trezor.io/firmware/definitions/eth/slip44//network.dat` `` is a decimal number, e.g., `60` for Ethereum mainnet. In some cases, multiple network definitions can be registered for the same SLIP-44 number. The retrieved definition is valid for an unspecified one of those colliding networks. This does not matter for purposes of `EthereumGetAddress` and the like, because the information in the network definition is only used to prove validity of the derivation path. When using Ethereum's SLIP-44 number 60 in the derivation path, the caller does not need to provide the network definition, because Ethereum network is always built-in. ### Solana To look up a token definition for a given token mint account, use the following URL: `https://data.trezor.io/firmware/definitions/solana/token/.dat` `` is base58-encoded mint account of the token, e.g., `So11111111111111111111111111111111111111112`. ### Full set of definitions It is possible to download the full set of signed definitions in a single tar archive from the following URL: [`https://data.trezor.io/firmware/definitions/definitions.tar.xz`](https://data.trezor.io/firmware/definitions/definitions.tar.xz). ## Definition format Each definition is encoded as a protobuf message specified in the file [`messages-definitions.proto`](https://github.com/trezor/trezor-firmware/blob/main/common/protob/messages-definitions.proto) and packaged in the following binary format. All numbers are unsigned little endian. 1. magic string `trzd1` (5 bytes) 2. definition type according to `DefinitionType` enum (1 byte) 3. data version of the definition (4 bytes) 4. protobuf payload length (2 bytes) 5. protobuf payload (N bytes) A Merkle tree is constructed from all binary definitions (see below) and its root is signed by the CoSi algorithm. The full format of the definition is as follows: 1. Data payload (see above) 2. Number of Merkle proof entries (1 byte) 3. Sequence of 32-byte proof entries (N * 32 bytes) 4. CoSi sigmask (1 byte) 5. CoSi signature (64 bytes) ### Merkle tree algorithm The input for the Merkle tree calculation is a collection of binary values. 1. For each entry, calculate a _leaf hash_: `SHA256(0x00 || entry)`, with `||` denoting string concatenation. 2. Sort the leaf hashes lexicographically in ascending order. This is the base level of a binary tree. 3. For each level of the tree, build the next level by taking a pair of entries from the left and calculating an _internal hash_: a. Set `min` to the smaller of the two entries, and `max` to the larger one. b. The internal hash is `SHA256(0x01 || min || max)`. 4. If there is a left-over odd entry, append it to the end of the next level. 5. Continue until there is only one entry left. This is the root hash. For each leaf, its proof is a sequence of neighbor hashes going up the tree. One way to keep track of the proof is, whenever constructing an internal node, add the right hash to the left child's proof list and vice versa. A [reference implementation](https://github.com/trezor/trezor-firmware/blob/main/python/src/trezorlib/merkle_tree.py) is provided. ## Data sources External Ethereum definitions are generated based on data from external APIs and repositories: * [CoinGecko](https://www.coingecko.com/) for most of the info about networks and tokens * [defillama](https://defillama.com/) to pair as much networks as we can to CoinGecko ID * [Ethereum Lists - chains](https://github.com/ethereum-lists/chains) as the only source of EVM-based networks * [Ethereum Lists - tokens](https://github.com/ethereum-lists/tokens) as another source of tokens