aboutsummaryrefslogtreecommitdiff
path: root/README.md
blob: a05518843575c45f410c02898877525f9fb38009 (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85

   

ecdsa384

Core Description

This core implements the scalar base point multiplier for ECDSA curve P-384. It can be used during generation of public keys, the core can also be used as part of the signing operation.

API Specification

The core interface is similar to other Cryptech cores. FMC memory map looks like the following:

0x0000 | NAME0 0x0004 | NAME1 0x0008 | VERSION

0x0020 | CONTROL 0x0024 | STATUS

0x0100 | K0 0x0104 | K1 ... 0x012C | K11

0x0140 | X0 0x0144 | X1 ... 0x017C | X11

0x0180 | Y0 0x0184 | Y1 ... 0x01AC | Y11

The core has the following registers:

  • NAME0, NAME1 Read-only core name ("ecdsa384").

  • VERSION Read-only core version, currently "0.11".

  • CONTROL Control register bits: [31:2] Don't care, always read as 0 [1] "next" control bit [0] Don't care, always read as 0 The core starts multiplication when the "next" control bit changes from 0 to 1. This way when the bit is set, the core will only perform one multiplication and then stop. To start another operation, the bit must be cleared at first and then set to 1 again.

  • STATUS Read-only status register bits: [31:2] Don't care, always read as 0 [1] "valid" control bit [0] "ready" control bit (always read as 1) The "valid" control bit is cleared as soon as the core starts operation, and gets set after the multiplication operations is complete. Note, that unlike some other Cryptech cores, this core doesn't need any special initialization, so the "ready" control bit is simply hardwired to always read as 1. This is to keep general core interface consistency.

  • K0-K11 Buffer for the 384-bit multiplication factor (multiplier) K. The core will compute Q = K * G (the base point G is the multiplicand). K0 is the least significant 32-bit word of K, i.e. bits [31:0], while K11 is the most significant 32-bit word of K, i.e. bits [383:352].

  • X0-X11, Y0-Y11 Buffers for the 384-bit coordinates X and Y of the product Q = K * G. Values are returned in affine coordinates. X0 and Y0 contain the least significant 32-bit words, i.e. bits [31:0], while X11 and Y11 contain the most significant 32-bit words, i.e. bits [383:352].

Implementation Details

The top-level core module contains block memory buffers for input and output operands and the base point multiplier, that reads from the input buffer and writes to the output buffers.

The base point multiplier itself consists of the following: * Buffers for storage of temporary values * Configurable "worker" unit * Microprograms for the worker unit * Multi-word mover unit * Modular inversion unit

The "worker" unit can execute five basic operations: * comparison * copying * modular addition * modular subtraction * modular multiplications

There are two primary microprograms, that the worker runs: curve point doubling and addition of curve point to the base point. Those microprograms use projective Jacobian coordinates, so one more microprogram is used to convert the product into affine coordinates with the help of modular inversion unit.

Note, that the core is supplemented by a reference model written in C, that has extensive comments describing tricky corners of the underlying math.

Vendor-specific Primitives

Cryptech Alpha platform is based on Xilinx Artix-7 200T FPGA, so this core takes advantage of Xilinx-specific DSP slices to carry out math-intensive operations. All vendor-specific math primitives are placed under /rtl/lowlevel/artix7, the core also offers generic replacements under /rtl/lowlevel/generic, they can be used for simulation with 3rd party tools, that are not aware of Xilinx-specific stuff. Selection of vendor/generic primitives is done in ecdsa_lowlevel_settings.v, when porting to other architectures, only those four low-level modules need to be ported.