sw/libhal - Cryptech libhal: crypto software, HSM management, RPC

Age	Commit message (Collapse)	Author
2018-04-02	First cut at reusing RSA blinding factors.rsa-blind-mutation	Rob Austein
	General technique here suggested by Peter Gutman. If I got the math wrong, blame me, not Peter. This has not yet been tested to confirm that it returns correct results when using the blinding factors cache, and preliminary timing results suggest that we may be chasing the wrong performance problem. Unclear whether we'll ever really want to integrate this change, but pushing it on a branch to get it into repository history. If we do end up using this, the blinding factors cache will need minor redesign, principally to use the external SDRAM because main memory has gotten kind of full. Some way to clear the cache when restarting the HSM would be nice, probably requires a `hal_rsa_init()` function.
2018-03-30	More accurate timing.	Rob Austein

2018-03-30	Tweak report output.	Rob Austein
	Copy ContextManagedUnpacker from latest version of libhal.py so that this script won't depend on the current development code.
2018-03-30	First cut at parallel signature timing test.	Rob Austein
	At the moment this only handles RSA keys, and can only handle one size of key at a time. More bells and whistles will follow eventually, now that the basic asynchronous API to our RPC protocol works.
2018-03-25	Clear search state variables in rsa.c's find_prime().	Rob Austein
	Failing to clear the temporary buffer used to transfer bits from the TRNG into a bignum was a real leak of something very close to keying material, albeit only onto the local stack where it was almost certain to have been overwritten by subsequent operations (generation of other key components, wrap and PKCS #8 encoding) before pkey_generate_rsa() ever returned to its caller. Still, bad coder, no biscuit. Failing to clear the remainders array was probably harmless, but doctrine says clear it anyway.
2018-03-21	Supply our own context manager instead of using contextlib.	Rob Austein
	contextlib is cute, but incompatible with other coroutine schemes like Tornado, so just write our own context manager for xdrlib.Unpacker.
2018-01-04	Add hal_core_alloc2() to fix a dining philosophers problem in hal_modexp2().	Paul Selkirk
	Uncoordinated attempts to allocate two modexpa7 cores leads to deadlock if multiple clients try to do concurrent RSA signing operations. The simplest solution (back off and retry) could theoretically lead to resource starvation, but we haven't seen it in actual testing.
2017-12-13	Merge branch systolic_crt into master.	Rob Austein
	This branch was sitting for long enough that master had been through a cleanup pass, so beware of accidental reversions.
2017-10-27	~0 is actually more correct, or more portable	Paul Selkirk

2017-10-23	Cleanup signed/unsigned mismatches, mostly in loop counters	Paul Selkirk

2017-10-15	Correctly mark "unsecure" KEK as erased.	Paul Selkirk

2017-10-15	Cleanup: keystore driver functions return HAL_StatusTypeDef instead of magic ↵	Paul Selkirk
	values.
2017-10-11	Cleanup: Remove "const" qualifiers from function return types.	Paul Selkirk

2017-09-15	4096-bit RSA keys working again, with 8k keystore "blocks".	Rob Austein

2017-09-13	Oops, forgot hal_io.c.	Rob Austein

2017-09-13	Preliminary support for parallel core RSA CRT.	Rob Austein

2017-09-13	Sort-of-working, large (4096-bit) RSA keys broken.	Rob Austein
	Snapshot of mostly but not entirely working code to include the extra ModExpA7 key components in the keystore. Need to investigate whether a more compact representation is practical for these components, as the current one bloats the key object so much that a bare 4096-bit key won't fit in a single hash block, and there may not be enough room for PKCS #11 attributes even for smaller keys. If more compact representation not possible or insufficient, the other option is to double the size of a keystore object, making it two flash subsectors for a total of 8192 octets. Which would of course halve the number of keys we can store and require a bunch of little tweaks all through the ks code (particularly flash erase), so definitely worth trying for a more compact representation first.
2017-09-12	Silly macro bugs.	Rob Austein

2017-09-12	Untested ASN.1 support for ModExpA7 private speedup factors.	Rob Austein

2017-09-11	Reverse the polarity of the neutron flow	Rob Austein

2017-09-09	Far too much fun with modexpa7 operand lengths and locations.	Rob Austein

2017-09-09	Whack with club until compiles.	Rob Austein

2017-09-09	Start hacking for systolic modexp.	Rob Austein
	Work in progress. Probably won't even compile, much less run. Requires corresponding new core/math/modexpa7 core. No support (yet) for ASN.1 encoding of speedup factors or storage of same in keystore. No support (yet) for running CRT algorithm in parallel cores. Minor cleanup of ancient bus I/O code, including EIM and I2C bus code we'll probably never use again.
2017-07-31	Unconditionally set the allocated flag when initializing a hash state	Paul Selkirk
	structure. When running multiple concurrent unit tests, I observed multiple failures in the hmac tests, which I ultimately tracked down to different clients sharing the same hal_hmac_state struct. hal_hash_initialize is called twice in hal_hmac_initialize (once to get the state structure, then again if the supplied key is too long), and is called in hal_hmac_finalize, to hash the digest with the supplied key. In these subsequent cases, the caller supplies the state structure, which hal_hash_initialize zeroes, but it doesn't set the allocated flag. This marks an in-use struct as available, so it gets reassigned and reinitialized, and Bad Things Happen for both clients that are trying to use it.
2017-07-24	Split compile-time control of RSA ModExp.	Rob Austein
	At least for now, the speed tradeoff between software ModExp and our Verilog ModExp core differs significantly between signature and key generation. We don't really know why, but since key generation does not need to be constant time, we split out control over whether to use the software or FPGA implementation, so that we can use the FPGA for signature while using software for key generation. Revisit this if and when we figure out what the bottleneck is, as well as any time that the FPGA core itself changes significantly.
2017-07-24	Use ModExp fast mode for Miller-Rabin tests.	Rob Austein
	Trying to make RSA key generation run in constant time is probably both futile and unnecessary, so we can speed it up a bit by switching the ModExpA7 core to use "fast" mode rather than "constant time" mode. Sadly, while this change produces a measureable improvement, it doesn't bring FGPA ModExp anywhere near the speed of the software equivalent in this case. Don't really know why.
2017-07-03	Fencepost error (1-based counting using xrange(), sigh).	Rob Austein

2017-06-27	Key generation timing.	Rob Austein
	Initial version, very basic, RSA-only. Gussy up later.
2017-06-14	Tidy up new prime generation code.	Rob Austein

2017-06-14	Faster prime generation algorithm for RSA.	Rob Austein
	Algorithm suggested by a note in Handbook of Applied Cryptography, motivated by profiling of libtfm fp_isprime() function showing something close to 50% of CPU time spent running Montgomery reductions in the small primes test, before we even get to Miller-Rabin.
2017-06-11	Flesh out digest and HMAC tests.	Rob Austein

2017-06-11	None is more Pythonic than "" as indicator for "no key"	Rob Austein

2017-06-08	Merge branch 'ks9'	Rob Austein

2017-06-08	Document the keystore design.	Rob Austein

2017-06-04	Include file dependencies were badly out of date.	Rob Austein

2017-06-04	Tweak CRC input to be backwards compatabile with ksng.	Rob Austein
	Except for torture tests, we never really used the hideously complex multi-block capabilities of the ksng version of the flash keystore, among other reasons because the only keys large enough to trigger the multi-block code were slow enough to constitute torture on their own. So we can preserve backwards compatabliity simply by including the former chunk fields (renamed legacy here) in the CRC and checking for the expected single-block key values. We probably want to include everything in the CRC in any case except when there's an explicit reason omit something, so, this is cheap, just a bit obscure. At some point in the future we can phase out support for the backwards compatible values, but there's no particular hurry about it unless we want to reuse those fields for some other purpose.
2017-06-03	Add --soft-backup option to cryptech_backup.	Rob Austein
	cryptech_backup is designed to help the user transfer keys from one Cryptech HSM to another, but what is is a user who has no second HSM supposed to do for backup? The --soft-backup option enables a mode in which cryptech_backup generates its own KEKEK instead of getting one from the (nonexistent) target HSM. We make a best-effort attempt to keep this soft KEKEK secure, by wrapping it with a symmetric key derived from a passphrase, using AESKeyWrapWithPadding and PBKDF2, but there's a limit to what a software-only solution can do here. The --soft-backup code depends (heavily) on PyCrypto.
2017-06-03	Add --soft-backup option to cryptech_backup.	Rob Austein
	cryptech_backup is designed to help the user transfer keys from one Cryptech HSM to another, but what is is a user who has no second HSM supposed to do for backup? The --soft-backup option enables a mode in which cryptech_backup generates its own KEKEK instead of getting one from the (nonexistent) target HSM. We make a best-effort attempt to keep this soft KEKEK secure, by wrapping it with a symmetric key derived from a passphrase, using AESKeyWrapWithPadding and PBKDF2, but there's a limit to what a software-only solution can do here. The --soft-backup code depends (heavily) on PyCrypto.
2017-06-02	Fix AESKeywrapWithPadding handling of very long messages.	Rob Austein
	We were XORing the low 32 bits of R[0] instead of the full 64 bits. Makes no difference for small values of n, so we never detected it.
2017-05-31	Automatic logout when client disconnects or muxd restarts.	Rob Austein
	The HSM itself should be detecting carrier drop on its RPC port, but I haven't figured out where the DCD bit is hiding in the STM32 UART API, and the muxd has to be involved in this in any case, since only the muxd knows when an individual client connection has dropped. So, for the moment, we handle all of this in the muxd.
2017-05-30	Hold keystore lock before calling keystore driver methods.	Rob Austein
	Most keystore methods already followed this rule, but hal_ks__init() and hal_ks__logout() were confused, in different ways.
2017-05-30	Missed one instance of one API change in "logout" branch merge.	Rob Austein

2017-05-30	Merge branch 'logout' into ks9	Rob Austein
	The internal keystore API has changed enough since where the "logout" branch forked that a plain merge would have no prayer of compiling, must less running. So this merge goes well beyond manual conflict resolution: it salvages the useful code from the "logout" branch, with additional code as needed to reimplement the functionality. Sorry.
2017-05-29	Adjust "bloat" unit tests for ks9 keystore parameters.	Rob Austein

2017-05-29	Missed a few references to old pkey_slot field names, oops.	Rob Austein

2017-05-29	Simplify per-session keys.	Rob Austein
	Cosmetic cleanup of pkey_slot along the way.
2017-05-29	Indentation.	Rob Austein

2017-05-29	Debug per-session keys.	Rob Austein

2017-05-28	Debug new keystore init code.	Rob Austein

2017-05-28	Almost compiles.	Rob Austein
	Need to refactor init sequence slightly (again), this time to humor the bootloader, which has its own special read-only view of the PIN block in the token keystore.