Re: [TLS] [CFRG] Chempat-X: hybrid of X25519 and sntrup761

Eric Rescorla Mon, 29 Jan 2024 08:36:07 -0800

Hi folks,

Replying to DJB's email but not really in direct response to him.

I'm not a cryptographer and don't have a strong opinion on the
technical merits of X-wing in particular, but I've been following
this thread (lots of messages) and was hoping to try to summarize
what I think is common ground and locate the points of contention.

Does anyone disagree with the following claims?

- X-wing depends on certain properties of the ML-KEM which aren't
  generically true of all KEMs, so if we just substituted in another
  KEM it might not be safe (assuming for the purposes of this
  discussion that sntrup761 and X25519 behave as advertised).

- When used in TLS 1.3, this dependency is relaxed because TLS 1.3
  hashes the whole transcript into the key.

- The combiner that DJB proposes would also relax this dependency at
  some performance cost (see below) and should otherwise be as safe as
  X-wing.


I also think that people agree about the design principle that the
CFRG should propose a specific set of hybrid algorithms that are safe
to use in most if not all contexts and do not require the users to be
aware of the specific internal details of how they are put together.
So, for instance, if CFRG were to specify two hybrids: X25519/ML-KEM
and X448/ML-KEM, it shouldn't matter to protocols whether these use
the same or different combiners or other internal structure.

Similarly, I think people agree that even if we have a generic
combiner that CFRG uses for multiple input algorithms, we should
discourage people from doing so with novel algorithms (though
perhaps we know that people will probably do this whether or not CFRG
publishes such a combiner).


Hopefully, this brings us to the point(s?) of disagreement. Unless
I've misunderstood, the question is (1) Whether it would be better to
use a somewhat safer generic combiner even when we don't believe it's
necessary, even at some performance cost; (2) and if the answer to (1)
is "yes" then what cost would be acceptable; and (3) what is the
actual incremental cost of doing that.

Is this a fair summary?

-Ekr




















On Mon, Jan 29, 2024 at 7:12 AM D. J. Bernstein <d...@cr.yp.to> wrote:

> Ilari Liusvaara writes:
> > Security review of X-wing only needs to be done once.
>
> "Of course we hope that any particular piece of security review can be
> done just once and that's the end of it (OK Google, please read once
> through the Chrome source code and remove the buffer overflows), but the
> bigger picture is that many security failures are easily explained by
> reviewer overload, so minimizing the complexity of what has to be
> reviewed is a useful default policy. Minimizing TCBs is another example
> of the same approach."
>
> > Generic combiner would likely lead to lots of combinations.
>
> What does "generic" mean here? What's the mechanism that will have a
> "generic" combiner triggering "lots of combinations", and why _won't_
> the same mechanism apply to QSF---the construction presented in the
> X-Wing paper, used in X-Wing, and labeled in the paper as "generic"?
>
> Some arguments seem to be starting from the idea that X-Wing _isn't_
> using a "generic" combiner---but the paper says it is. People seem to
> be using multiple incompatible meanings of the word "generic" here. How
> about we just stop using the word? Any valid argument will survive
> elimination of ambiguous terminology.
>
> The actual technical difference is between (A) combiners making security
> guarantees assuming the input KEM is IND-CCA2 and (B) combiners making
> security guarantees assuming the input KEM is IND-CCA2 and "ciphertext
> collision resistant". (Plus further assumptions in B if the goals of
> https://eprint.iacr.org/2021/708 and https://eprint.iacr.org/2021/1351
> are within scope. The specific A proposals that we're looking at handle
> those goals automatically, since they hash the full public keys.)
>
> > And I think it much easier to understand integrated thing that is unsafe
> > to change than that the component KEMs must be <some-gibberish> (oh,
> > and then being used with stuff that fails the requirement).
>
> I agree that what's presented to applications (or to higher-level
> protocols such as TLS) should be a full combined KEM reviewed by CFRG.
> All of the proposals on the table (original X-Wing; X-Wing with the
> modifications that I recommended; Chempat-X) are full combined KEMs.
>
> CFRG approval (or approval by a WG such as TLS) is always for exactly
> the specified mechanism. I don't object to systematically saying so in
> CFRG RFCs---but I also see no evidence that _implementors who read the
> RFCs_ are misunderstanding this in the first place.
>
> Meanwhile there's ample evidence of implementors doing things that specs
> _don't_ allow. Paying attention to how this happens often allows it to
> be predicted and mitigated. This is better---for the end users, which is
> what matters!---than blaming implementors.
>
> Example: there are books and papers stating variable-width ECC ladders
> (for obvious reasons: this works for all scalars without the width as an
> extra parameter). Telling implementors to use fixed-width ladders
> sometimes works, but what if the implementors aren't listening?
>
> X25519 arranges ECC secret keys to have the leading 1 at a fixed
> position. Ed25519 uses double-length nonces. Sure, these protections
> aren't guaranteed to be used (implementors without good tests aren't
> _forced_ to use the 1; implementors often reduce nonces mod the group
> order), but common sense says that they reduce risks, and subsequent
> ecosystem observations (https://blog.cr.yp.to/20191024-eddsa.html)
> showed that this proactive strategy was successful.
>
> See https://cr.yp.to/talks.html#2015.01.07 for more examples, starting
> with the Sony PS3 disaster, which was predicted by Rivest in 1992: "The
> poor user is given enough rope with which to hang himself---something a
> standard should not do." Sure, NIST never admitted responsibility for
> the disaster (NIST hides behind the standard clearly saying nonces have
> to be random), but we can and should do better than that.
>
> Similarly, making sure that combiners hash the full transcript, so that
> implementors plugging in KEMs that don't do this are protected, is more
> robust than warning implementors not to plug in those KEMs.
>
> > And things are made worse by different protocols tending to do things
> > subtly differently even when using the same KEMs.
>
> Certainly having multiple combiners adds to the security-review load.
> This argument favors having just one combiner across protocols _and_
> having just one combiner across underlying KEMs. What's the contrary
> argument?
>
> > None of this is theroretical risk, this all has been seen in practice.
> > Unlike unsafe modified versions of X-Wing.
>
> I do not agree that we should be waiting to see X-Wing-derived failures
> before taking steps to prevent those failures.
>
> > > In context, this speedup is negligible: about
> > > 1% of the cost of communicating that data in the first place, never
> mind
> > > other application costs.
> > The cost is not negligible.
>
> Sorry, can you clarify which step you're disagreeing with?
>
> https://cr.yp.to/papers.html#pppqefs includes dollar costs for Internet
> communication (roughly 2^-40 dollars/byte) and for CPU cycles (roughly
> 2^-51 dollars/cycle). Cycle counts for hashing (under 2^4 cycles/byte)
> are readily available from https://bench.cr.yp.to. Combining these
> numbers produces the 1% figure (2^-47/2^-40). 1% is negligible.
>
> There was also a question about security tokens, where I laid out
> details of a calculation concluding that communication takes 82ms (plus
> lower-layer overhead) while this hashing takes 2ms. The CPUs there cost
> roughly $1, a smaller fraction of the token cost than desktop CPUs
> inside desktop computers.
>
> > If somebody starts blindly replacing algorithms, there is much higher
> > risk of actual security problem caused by choosing bad algorithms than
> > omitting some hashing.
>
> How much higher? Where is that number coming from? Why is "blindly" a
> useful model for real-world algorithm choice?
>
> To be clear, I'm also concerned about the risk of people plugging in
> KEMs that turn out to breakable. The basic reason we're applying double
> encryption is that, as https://kyberslash.cr.yp.to illustrates, this is
> a serious risk even for NIST's selected KEM! But I don't see how that
> risk justifies picking a combiner that's unnecessarily fragile and
> unnecessarily hard to review.
>
> > No real attacker is looking how to break IND-CCA. They are looking at
> > how to attack actual systems. The value of IND-CCA is that despite
> > unrealistically strong attack model, most algorithms that fail it tend
> > to fail in rather dangerous ways. Which is not at all clear for
> > combiner with insufficient hashing.
>
> What's the source for these statistics? Is the long series of
> Bleichenbacher-type vulnerabilities, including
>
>    https://nvd.nist.gov/vuln/detail/CVE-2024-23218
>
> from Apple a week ago, being counted as just one failure?
>
> I agree that the security goals we're aiming for here aren't as
> _obviously_ necessary as, e.g., stopping attackers from figuring out
> secret keys. But I'd be horrified to see a car manufacturer or plane
> manufacturer saying "We don't really have to worry whether our seatbelts
> work since they're only for disasters". We should be assuming there's a
> disaster and making sure our seatbelts do what they're supposed to do.
>
> ---D. J. Bernstein
>
> _______________________________________________
> TLS mailing list
> TLS@ietf.org
> https://www.ietf.org/mailman/listinfo/tls
>

_______________________________________________
TLS mailing list
TLS@ietf.org
https://www.ietf.org/mailman/listinfo/tls

Re: [TLS] [CFRG] Chempat-X: hybrid of X25519 and sntrup761

Reply via email to