Re: [bitcoin-dev] UNCHECKED Client side coinjoin amount organization with WabiSabi

Max Hillebrand via bitcoin-dev Sat, 09 Apr 2022 01:45:09 -0700

As expected, I sent some wrong explanations regarding input selection.The coin grouping and consolidation penalty seems to be correct, but Iwas wrong about the final best group selection. Let me try to correct this.

There are often many groups which have the same consolidation penalty.In one testnet example, a 1 btc wallet with a 20% privacy level had 64coins, and when tasked to find groups of 4 coins, it found 20 groups,which all had exactly 0 anonscore consolidation penalty, meaning allinputs had the same anonscore. All groups with the lowest consolidationpenalty advance to the next step. Notice however, there could be onlyone group with the lowest penalty, then the following would bedeterministic.

For all groups with the lowest consolidation penalty, we find out howmany of its coins come from the same previous transaction. The list ofgroups gets shuffled, then sorted ascending by count of same transactioncoins, and we pick the top one. There will likely be many groups with nosame transaction inputs, and as the list is shuffled, we pick randomlyone of them.

To summarize, the input count is a biased random choice. In some cases,especially for wallets with low utxo count, there is only one goodgroup, so the input selection is deterministic. However, often there aremany possible input groups with low consolidation penalty and low sametransaction count, and in these cases there is another random choice ofwhich inputs get registered. So even if the adversary knows the entirewallets utxo set and anonscore, in many cases he will not be able tofind out which inputs will be selected in the next round.

The big question is, if we should try to protect optimally against suchan adversary, especially if the defense strategy comes at extrablockspace cost. If yes, we can add further ambiguity, by not onlycreating these "rolling groups", but creating groups with random inputs,or even brute-forcing all possible groups [with some time-out].

Static link:https://github.com/zkSNACKs/WalletWasabi/blob/8016404503bdffa475d8b219a6fe019a1d5775aa/WalletWasabi/WabiSabi/Client/CoinJoinClient.cs#L366-L433

WIP max suggested input value:https://github.com/zkSNACKs/WalletWasabi/pull/7748



On 4/6/22 18:05, Max Hillebrand via bitcoin-dev wrote:

Hello list,
tl;dr: client side coinjoin amount organization is bloody difficult.Our current approach: pick random number of inputs based on walletutxo count; pick that group of inputs which result in the lowestanonscore consolidation penalty; generate deterministic frequencytable as Schelling point; brute force decompose input sum into likelydenominations and pick randomly one of the good ones.
In previous coinjoin implementations, round parameters like the equaldenomination are dictated by the coordinator. This is in part becauseof the design constraints of the Chaumian blind signature coordinationprotocol. Given knowledge of the input sum of a user, an adversary canfind out which denominations the user received, even though it is moredifficult to find out exactly which equal amount output coin wasreceived. Furthermore, this leads to a worse usability as well as moreblockspace consumption. However, the coordinator can enforce forexample, that every user ends up in the same denomination, and thus avery large anonymity set is achieved.
This can be improved by using a coinjoin coordination protocol likeWabiSabi with less constraints, specifically no input-input linkage,and arbitrary input/output amount registration. Now the coordinatordoes not dictates round parameters like minimum equal amountdenomination nor the decomposition algorithm used. The idea is to makemore decisions client side, without substantially sacrificing theprivacy guarantees and anonymity set size of outputs.
This turns out to be a quite difficult problem. I will try my best toexplain the approach that is currently implemented in Wasabi Wallet'sthird release candidate. The code is linked below, sorry in advancefor any discrepancy or confusion in my explanation. Even though theresults seem to be alright, this is probably not the optimal approach,so I kindly ask you grey-bearded Bitcoin wizards to review, break andimprove it.
## Input Selection
First, the client finds out how many coins to select in this round.This is a random choice between the numbers 1 and 10. However, if thewallet currently has less than 35 utxos, there is a preference ofchoosing 1. If the wallet has more than 125 utxos, there is apreference of choosing 10. With a gradient in between. This is tocontrol the utxo count of the wallet. Noticeably this does not takeinto account the sats amount in the utxo set, so a user with 0.1 btcwill behave the same as one with 1000 btc. Maybe the target utxo countshould be adjusted based on value.
Next, the question of which coins to register: Ideally, those coinswhich result in the least anonscore loss possible. Shuffle allsuitable utxos [i.e. confirmed, below max anonscore target etc], andsort them ascending by anonscore, then descending by amount. Nowcreate groups with the size of the previously established input countX. The first coin until the X coin of the sorted list are the firstgroup, then shift one down, so the second group is the second coinuntil the X+1 coin. Do these "rolling groups" all the way to thebottom of the list. This way, coins which have a anonscore close toeach other are selected.
Remove those groups which have many coins coming from the sametransaction.
For each group, calculate the anonscore cost of input consolidationweighted by amount. If the selected coins have anonscore 3, 5 and 10,then the group has a anonscore of 3. The input with 10 anonscore thushas a 7 anonscore cost. Now weight this to the input value of therelevant coin in the group, so that a loss of anonscore in a highvalue coin is more costly than if it were a low value coin.
Pick that input group with the lowest weighted anonscore cost.
There is randomness in the number of inputs chosen, but the selectionof the best coin group is deterministic. Maybe there can be somerandomness in the final group selection, without suffering from toomuch anonscore consolidation penalty.
One additional idea [which is not yet implemented] is that thecoordinator suggests [not dictates] a maximum input value, whichchanges across different rounds. Large value inputs are not consideredsuitable, if the maximum suggested input value of the current round issmaller.
It is important to note that currently users choose their inputswithout knowing the inputs that other users have already registered.It should be possible to design the protocol in a way to share theinputs that were already registered, even if input registration is notyet complete. There are however some privacy concerns, like timingattacks, or de-registration of an input after it was announced toother users.
## Output Selection
The coordinator collects all input registrations, and forwards them toall users. At this point, all clients knows all inputs of thistransaction. The goal now is to get a Schelling point among users ofwhich output denominations to choose, so that the anonset size of eachdenomination is sufficiently large.
First of all, it's a good idea to limit the denominations that theclient will register. Some simulations confirmed that low Hemmingweight numbers are efficient, thus clients generate a list of standarddenominations which are: powers of two; powers of three; two timespowers of three; powers of ten; two times powers of ten; and fivetimes powers of ten. However, remove some of those denominations whichare very close to each other, more so for larger values. Notice thatthis list of standard denominations is the same across all rounds, itdoes not depend on specific inputs.
We can further decrease the list of potential denominations that theclient chooses, but specifically for every round. This is a furtherSchelling point of which denominations the client prefers to choose.This is done with a deterministic frequency table, based on the inputsof the proposed transaction.
Take each input and greedily decompose it into the standarddenominations, meaning every input has precisely one decomposition.[45 decomposes greedily into 32+10+3] Count the occurrences of everystandard denomination into a frequency table. All those standarddenominations, which have a count of 2 or larger, are consideredlikely denominations.
Notice that currently we remove the largest input from this frequencytable calculation. This is so that the whale does not mix alone byhimself. Also notice that individual inputs, and not input sums aredecomposed. This is because we found that generating the frequencytable based on only one input leads to a more accurate Schellingpoint, which increases anonset count of the finally chosendenominations. Finally, notice that we only calculate one singledecomposition for each input, the greedy one, but we could alsocalculate multiple different [or all possible] decompositions for eachinput, thus generate a larger frequency table and more likelydenominations.
Whereas the frequency table should be deterministic as a Schellingpoint, the actual user's input sum must not be deterministicallydecomposed, otherwise an adversary who knows the input sum would findout which denominations the client chose. [but not which of the equaloutputs he got]
The client takes his input sum [minus fees] and brute-force decomposesinto all possible groups of the likely denominations [those with highcount in this rounds' frequency table]. We found that in most cases,even with this reduced list of likely denominations, any input sum canbe decomposed into up to eight outputs. [Sometimes the wealthiest usergets a non-standard change amount] However, each decomposition hassome small amount of sats left over, which is is not put into anoutput value, but instead pays miner fees.
Sort this list of all possible output groups ascending by leftoveramount, and remove those groups which have a leftover amount 1.3xlarger than the best option. Further, remove a group if it has asimilar largest denomination as another one. [So far everythingdeterministic, given all coinjoin inputs and the users' input sum]
Out of this shorter list of output amount groups, shuffle and pickrandomly one of them. These are non-deterministic denominations whichwill be registered for the actual coinjoin outputs. If there were noshuffle, but a selection of the amount group with the lowest loss,users would save sats. But arguably having this randomness hereincreases privacy sufficiently to justify the slight increase inleftover amount cost.
Again, while choosing their own outputs, clients do not know whichoutputs other clients registered. If the client would have thisinformation, it could possibly increase the quality of it's own outputregistration substantially.
Notice there is a different decomposition strategies for the frequencytable [greedy] and the input sum [brute-force all]. Maybe, having thesame decomposition strategy here would lead to better results.
Notice further that there is no rank ordering of the possibledenominations based on some ambiguity score or entropy score. Rather,the choice is random, and in some cases, this might result in notoptimal outcomes.
Here are some results of our simulation of the current algorithm:

50 inputs 15 users

Median output count:    98
Median change count:    4
Median change percent:  3.2
Median out anonsets:    3.5
Median leftovers:       481

300 inputs 70 users

Median output count:    442
Median change count:    0.5
Median change percent:  0.3
Median out anonsets:    9.6
Median leftovers:       394


Thank you for your consideration to review!

Skol
Max
Third Wasabi 2.0 Release Candidate:https://github.com/zkSNACKs/WalletWasabi/releases/tag/v1.98.2.0
Input selection code:https://github.com/zkSNACKs/WalletWasabi/blob/master/WalletWasabi/WabiSabi/Client/CoinJoinClient.cs#L366-L492
Amount decomposer code:https://github.com/zkSNACKs/WalletWasabi/blob/master/WalletWasabi/WabiSabi/Client/AmountDecomposer.cshttps://github.com/zkSNACKs/WalletWasabi/blob/master/WalletWasabi/WabiSabi/Client/Decomposer.cs
Decomposition simulation: https://github.com/nopara73/sake


_______________________________________________
bitcoin-dev mailing list
bitcoin-dev@lists.linuxfoundation.org
https://lists.linuxfoundation.org/mailman/listinfo/bitcoin-dev

_______________________________________________
bitcoin-dev mailing list
bitcoin-dev@lists.linuxfoundation.org
https://lists.linuxfoundation.org/mailman/listinfo/bitcoin-dev

Re: [bitcoin-dev] ***UNCHECKED*** Client side coinjoin amount organization with WabiSabi

Reply via email to

Re: [bitcoin-dev] UNCHECKED Client side coinjoin amount organization with WabiSabi