ah, I see - that makes a lot more sense, thanks. Should be a lot more scalable for large numbers of users.
On 22 June 2013 21:26, Michael MacFadden <michael.macfad...@gmail.com> wrote: > Thomas, > > At the moment we are talking about the OT algorithms, not how the nodes > physically communicate. As was previously discussed, there very well > could still be one or more servers in the picture. This is about where > and how the concurrency control occurs. At the moment a single sever > (even when federating) is in control of all concurrency control. This is > what we are trying to avoid. Even if there are still servers, the OT > should be a P2P OT algorithm. > > ~Michael > > On 6/22/13 12:03 PM, "Thomas Wrobel" <darkfl...@gmail.com> wrote: > >>Sorry as this has probably been explained - but if client p2p is a >>possibility, how will clients find eachother :? >>Surely everyone would need static external IPs - which simply isn't >>the case with the web today. >> >>This is mostly all over my head - but I have never even seen a web >>based p2p chat program before. I thought it simply wasn't possible to >>communicate between two browsers without a sever between them. >> >>or does p2p mean something different in this context? >> >>On 22 June 2013 18:53, John Blossom <jblos...@gmail.com> wrote: >>> Michael, >>> >>> I like that way that you put it. Hopefully what we wind up with is a >>> protocol that is fundamentally P2P but which interacts with servers as >>> ultra-efficient UI-less peers equipped with some super-powers and >>>services >>> that may not be present in the P2P domain persistently. Call it p2p2P, >>>so >>> to speak. John >>> >>> On Fri, Jun 21, 2013 at 10:04 PM, Michael MacFadden < >>> michael.macfad...@gmail.com> wrote: >>> >>>> MY humble opinion on the P2P issue is as follows. I think if we >>>>develop >>>> algorithms that can work in a P2P mode, then we can also support a >>>> client/server architecture as well by just controlling who the peers >>>>talk >>>> to. I think the problem with wave was not the client server >>>>architecture, >>>> but rather the way the servers interacted with each other. The servers >>>> themselves implemented something like a client/server relationship >>>>within >>>> federation. This meant that even if you were connected to your local >>>>wave >>>> server, if that wave server could not communicate with the wave server >>>> that initiated the wave, you were out of luck. >>>> >>>> I am not against having servers at all. In fact I think that things >>>>get >>>> very complicated if you have no servers what so ever (document storage, >>>> discovery, users, etc.). But if we need to make sure servers are >>>>peers. >>>> So we need a P2P style OT algorithm. Again do not confuse a P2P >>>>network >>>> topology (DHT, etc) with P2P OT Algorithms. A P2P OT Algorithm can >>>>also >>>> easily be made to behave like a Client/Server OT algorithm, where as >>>>the >>>> reverse is not feasible. >>>> >>>> ~Michael >>>> >>>> On 6/21/13 5:46 PM, "Sam Nelson" <so...@orcon.net.nz> wrote: >>>> >>>> >Wow, that was some heavy reading (: >>>> > >>>> >This section raised some questions for me: >>>> > >>>> >*4) Routing p2p messages/events in a pure P2P system (5 parts)* >>>> >How to manage to route all wave-stuff if we want to completely get >>>>rid of >>>> >servers completely, and only use peers. >>>> >The closest way would be to use a DHT, but huge latency is an unsolved >>>> >problem, and makes it impossible to use for real-time waving. >>>> >No other solution has been proposed. >>>> > >>>> >My question is simply, perhaps naively: why pure p2p? >>>> >Originally when I heard of p2p OT I saw it as a way to collaboratively >>>> >work offline in a LAN environment, and to sync pairs that are almost >>>> >always offline, by means of a proxy peer that moves between WAN and >>>>the >>>> >offline LAN. The peers would talk in much the same way that two >>>> >federating servers would, using their offline caches as a datasource >>>> >instead I'm guessingthis is like the MESH network John refers to. >>>> > >>>> >When talking about P2P between peers /over the internet/ - could >>>> >somebody please explain to be the purpose of and vision for this? Why >>>> >not just use a server, it seems to simplify things alot? (Firewalls, >>>> >authentication - can do offline like Windows 8 does with Microsoft >>>> >Account) >>>> > >>>> >Is purep2p just for privacy? Or is it really for alternate uses of the >>>> >protocol - other than the the documents and conversation use cases we >>>> >saw in Google Wave? >>>> > >>>> >Just an idea, in order to "open the eyes" of those drawn to this >>>>mailing >>>> >list, might it be beneficial to build up a wiki page of accepted use >>>> >cases so that everyone can read them and take them into account when >>>> >considering different ideas? That'd facilitate discussions like "well >>>> >this works for all our use cases except #13.... <discussion ensues >>>>about >>>> >this case>" >>>> > >>>> >Sam >>>> > >>>> > >>>> >On 22/06/2013 01:06, John Blossom wrote: >>>> >> Bruno, >>>> >> >>>> >> Thanks, this is an excellent summary. It helps me to get the gist of >>>> >>things >>>> >> more clearly. >>>> >> >>>> >> On the P2P latency, I don't think that it would be unacceptable to >>>>draw >>>> >>a >>>> >> line and say that P2P provides limited, non-guaranteed realtime OT >>>>or >>>> >>that >>>> >> it's not realtime OT and more of a syncing mode than a conversation >>>> >>mode. >>>> >> That would probably be sufficient for what needs to be done, >>>>especially >>>> >> since in some instances P2P-enabled Wave sessions may be using MESH >>>> >> networks for transport - a key factor in how a lot of experimental >>>> >> communications services are being deployed in developing nations >>>>(not >>>> >>just >>>> >> the Project Loon concept). In the MESH model, you're likely to have >>>>one >>>> >> node within range of another temporarily, which may sync with it, >>>>and >>>> >>then >>>> >> pass along data to another node when it comes in range of it. >>>>That's the >>>> >> most probable scenario for P2P in many instances, I would think. The >>>> >>other >>>> >> potential scenario: two people in a remote location, for the sake of >>>> >> argument two movie script-writers who have holed themselves up in a >>>> >>remote >>>> >> location to collaborate on a common script. They're on two devices >>>>that >>>> >>are >>>> >> very proximate to one another, so perhaps the latency issues will >>>>not >>>> >>be so >>>> >> severe. >>>> >> >>>> >> Things to think about, I will look at this more carefully later >>>>today. >>>> >> >>>> >> All the best, >>>> >> >>>> >> John Blossom >>>> >> >>>> >> On Fri, Jun 21, 2013 at 8:05 AM, Bruno Gonzalez (aka stenyak) < >>>> >> sten...@gmail.com> wrote: >>>> >> >>>> >>> Following Joseph's "A Very Wavey Plan (P2P!)" thread, a couple of >>>> >>> discussions have taken place at the irc.freenode.net #wiab channel, >>>> all >>>> >>> related to P2P. >>>> >>> >>>> >>> I've taken the liberty to restructure the IRC logs, remove some >>>> >>>chitchat, >>>> >>> and divide it into sub-discussions. Feel free to reply to any part >>>>of >>>> >>>this >>>> >>> email to continue a discussion. >>>> >>> >>>> >>> >>>> >>> *Summary of discussions:* >>>> >>> *====================* >>>> >>> *1) Underlying protocol for P2P federation* >>>> >>> Currently XMPP is used. HTTP and raw TCP are two suggested >>>>candidates >>>> >>>(HTTP >>>> >>> allowing to much more easily reach restricted networks). >>>> >>> >>>> >>> *2) Message/event types needed for P2P federation to work* >>>> >>> We'd need something similar in concept to certain git operations >>>>(git >>>> >>> clone, git push...). All will be based on hashes (not incremental >>>> >>> integers). >>>> >>> >>>> >>> *3) Routing p2p messages/events in a server-aided network* >>>> >>> One option is to somehow detect server clusters, send data to one >>>>of >>>> >>>them, >>>> >>> and let the rest of the cluster servers synchronize to it >>>>(locally). >>>> >>> Alternatively, the originator server can naively send stuff to all >>>> >>>possible >>>> >>> destination servers, regardless of the cost. >>>> >>> >>>> >>> *4) Routing p2p messages/events in a pure P2P system (5 parts)* >>>> >>> How to manage to route all wave-stuff if we want to completely get >>>>rid >>>> >>>of >>>> >>> servers completely, and only use peers. >>>> >>> The closest way would be to use a DHT, but huge latency is an >>>>unsolved >>>> >>> problem, and makes it impossible to use for real-time waving. >>>> >>> No other solution has been proposed. >>>> >>> >>>> >>> *5) Implementing "undo": invertibility, tombstones, edge cases, >>>>TP2* >>>> >>> No server means no canonical order of commits, which means that >>>>undo is >>>> >>> hard to do correctly. >>>> >>> (uhm... not sure if that's a good summary, some stuff went over my >>>>head >>>> >>> :-D, please read the log instead) >>>> >>> >>>> >>> *6) Usability of a pure p2p system in Real Life (tm)* >>>> >>> Being pragmatic, pure P2P is probably only usable in peers with >>>>good >>>> >>> connectivity. Rest of peers will need to rely on a server/proxy >>>>that >>>> >>>*does* >>>> >>> have good connectivity. >>>> >>> >>>> >>> *7) Comparison with BitTorrent and P2P-TV technologies* >>>> >>> Both technologies are much less restricted than wave with regards >>>>to >>>> >>> real-time responsiveness. So none are really a good reference for >>>>our >>>> >>> purposes. >>>> >>> >>>> >>> *8) Identifying participants (3 parts)* >>>> >>> Pure p2p means many peers don't have a >>>>name@centralized-server.comuser >>>> >>> handle, so an alternative has to be used. >>>> >>> However, it's easy to provide a traditional friendly handle, if the >>>> >>>user >>>> >>> prefers the tradeoff of having to often rely on a permanent server. >>>> >>>This >>>> >>> tradeoff can be mitigated by using a sort of userhandle cache. >>>> >>> >>>> >>> *9) P2P anonymity (lurking in a wave) (2 parts)* >>>> >>> In a pure p2p wave network, anonymous peers may want to read a >>>>public >>>> >>>wave, >>>> >>> without other peers knowing. A solution could be to make private >>>>the >>>> >>> required wavelets (where the anonymous participants IDs are >>>>stored). >>>> >>> >>>> >>> *10) Encryption of waves* >>>> >>> It's been proposed to use an AES key to encrypt all the wave data, >>>>and >>>> >>>only >>>> >>> allow participants to decrypt it. >>>> >>> >>>> >>> *11) Addition and removal of participants, and their ability to >>>>read >>>> >>>past >>>> >>> and future wave versions/deltas* >>>> >>> The aforementioned AES key can change over time, allowing a >>>> >>>finer-grained >>>> >>> restriction of what deltas new/removed participants can read. >>>> >>> >>>> >>> >>>> >>> * >>>> >>> * >>>> >>> >>>> >>> *Actual conversations:* >>>> >>> *====================* >>>> >>> * >>>> >>> * >>>> >>> *1) Underlying protocol for P2P federation:* >>>> >>> [in response to Joseph's email] >>>> >>> [23:42] <alown> I [...] agree with option 2 (make every root a JSON >>>> >>>blob) >>>> >>> [23:43] <alown> You haven't really detailed (at all) how the P2P >>>> >>>federation >>>> >>> is actually going to work (beyond 'not like IRC') >>>> >>> [23:44] <josephg> Personally, I'd love some raw TCP action >>>> >>> [23:44] <alown> I agree using KISS principle. >>>> >>> [23:44] <josephg> a few years ago (not long after wave was >>>>cancelled) >>>> >>>there >>>> >>> was a 'wave summit' >>>> >>> [23:45] <josephg> - and a few of us chatted about how we could >>>>make the >>>> >>> federation protocol simpler >>>> >>> [23:45] <josephg> we ended up (somehow) deciding that doing it over >>>> >>>http >>>> >>> woul dbe a good idea >>>> >>> [23:45] <josephg> because then we could sneak it into companies >>>>past >>>> >>>their >>>> >>> corporate HTTP firewalls, etc >>>> >>> [23:45] <josephg> but in any case, I'd like to figure out the >>>>protocol >>>> >>>and >>>> >>> (at least) have a TCP version >>>> >>> [23:46] <josephg> it should be pretty easy to wrap the same >>>>messages in >>>> >>> websockets if we want >>>> >>> >>>> >>> >>>> >>> *2) **Message/event types needed for P2P federation to work:* >>>> >>> [23:46] <alown> Do we need anything more complicated than the >>>> >>> waveletSubmit/Commit messages used currently? >>>> >>> [23:46] <alown> (Replace wavelet with 'abstract p2p ot container >>>>name) >>>> >>> [23:46] <josephg> um, yeah. >>>> >>> [23:47] <josephg> we'll also be able to rip out all the code that >>>>deals >>>> >>> with managing the tree of servers per wave >>>> >>> [23:47] <josephg> but yeah - the protocol will get a bit more >>>> >>>complicated >>>> >>> [23:47] <josephg> ... because we'll lose our beautiful integer >>>>version >>>> >>> numbers >>>> >>> [23:47] <josephg> so we'll need a protocol for syncronizing ops >>>> >>> [23:48] <josephg> yeah - ops will each have a hash >>>> >>> [23:48] <josephg> and two servers could each have ops the other >>>>server >>>> >>> doesn't have >>>> >>> [23:48] <josephg> so we have to be able to deal with that >>>> >>> [23:47] <alown> What other 'events' are cared about by any >>>>particular >>>> >>> server? >>>> >>> [23:47] <alown> For a SHA hash? >>>> >>> [23:48] <josephg> -> we'll need something like git's sync protocol >>>> >>> [23:48] <alown> So, initial server contact is 'git clone', and then >>>> >>>some >>>> >>> form of 'git push' on changes? >>>> >>> [23:49] <josephg> yep. >>>> >>> [23:49] <josephg> push on changes is easy - its basically the same >>>> >>>thing we >>>> >>> have now >>>> >>> [23:49] <josephg> just instead of saying "This should be applied at >>>> >>>version >>>> >>> 10" we say "This op has parents [abc123, def456]" >>>> >>> >>>> >>> >>>> >>> *3) **Routing **p2p **messages/events in a server-aided network:* >>>> >>> [23:49] <alown> With P2P do we have to broadcast to all peers? How >>>>do >>>> >>>we >>>> >>> coordinate that between them? >>>> >>> [23:50] <josephg> between servers? I dunno. >>>> >>> [23:50] <alown> How does BT handle this? >>>> >>> [23:50] <josephg> should we just connect every server to every >>>>other >>>> >>> server? That'd work fine... >>>> >>> [23:50] <josephg> I guess every server can address every other >>>>server >>>> >>> [23:50] <josephg> beacuse the wave will have al...@a.com and >>>> >>> josephg@b.comand so on on it >>>> >>> [23:50] <alown> This feels very inefficent... >>>> >>> [23:51] <josephg> so if you submit an op to your server, your >>>>server >>>> >>>can go >>>> >>> "Oh, I need to tell b.com about this too" >>>> >>> [23:51] <josephg> well, if there's 10 servers, presumably all 10 >>>> >>>servers >>>> >>> need to find out about ops somehow. >>>> >>> [23:51] <josephg> - assuming we stick with the current model of >>>>having >>>> >>> servers store all your operations >>>> >>> [23:51] <josephg> .. and documents for all the users at their >>>>domain >>>> >>> [23:51] <alown> But server 'b' and 'c' might both be part of a >>>>wave, >>>> >>>but >>>> >>> also know each other, and know that they are 'closer' to each other >>>> >>>than >>>> >>> 'a' is. So, we would want a->b/c then b<->c >>>> >>> [23:52] <josephg> so actually, having the server which originates >>>>an >>>> >>> operation send it to all the other servers on that wave is actually >>>> >>>close >>>> >>> to ideal. >>>> >>> [23:52] <josephg> yeah maybe. >>>> >>> >>>> >>> >>>> >>> *4) **Routing **p2p **messages/events in a pure P2P system (part >>>>1):* >>>> >>> [23:54] <alown> BT uses DHT for its P2P stuff... >>>> >>> [23:54] <josephg> ...I guess we could use a DHT storing all the >>>>ops, >>>> >>>but >>>> >>> thats pretty slow >>>> >>> [23:55] <josephg> and you still need to notify all servers with >>>>users >>>> >>>on >>>> >>> the wave that the wave was updated. >>>> >>> [23:55] <alown> Maybe, or perhaps only notify those within a >>>>certain >>>> >>> 'distance', with each server doing that. (Though could mean some >>>> >>>servers >>>> >>> are never updated) >>>> >>> [23:58] <alown> Perhaps we could make the network setup >>>>'SuperWaves' >>>> >>>which >>>> >>> broadcast to all peers, and carry all information, but normal wave >>>> >>>servers >>>> >>> do not reach this status? >>>> >>> [23:58] <alown> By having it decide itself based on how >>>>'connected' a >>>> >>> server is, this could find the most efficent ways to route it. >>>> >>> [00:01] <josephg> Do you think it'll really be a problem? >>>> >>> [00:01] <josephg> I mean, thinking about it - how many servers >>>>will be >>>> >>>on a >>>> >>> given wave? >>>> >>> [00:01] <alown> Depends. >>>> >>> [00:01] <alown> No idea. >>>> >>> [00:01] <josephg> If it were a public wave, I can imagine clients >>>>just >>>> >>> connecting to one (or more) centralized servers >>>> >>> [00:01] * josephg nods >>>> >>> [00:02] <josephg> ... But say if we were having a conversation on >>>> >>> wave-dev@apache, there's like, at most 20 people in a discussion >>>>from >>>> >>>5 or >>>> >>> so domains >>>> >>> [00:03] <josephg> ... I think we can deal with that kind of load. >>>> >>> [00:04] <josephg> but if the protocol lets any server tell any >>>>other >>>> >>>server >>>> >>> about an operation, then it should be pretty easy to set up >>>>something >>>> >>>like >>>> >>> that. >>>> >>> [00:04] <josephg> maybe. >>>> >>> [00:04] * josephg thinks >>>> >>> [00:05] <josephg> hm - you're right. I think I've just gotten used >>>>to >>>> >>>the >>>> >>> crappy state of doing routing for broadcasting messages to a >>>>network >>>> >>> [00:05] <josephg> if you can find / think of a better solution, >>>>I'm in. >>>> >>> [00:12] <alown> Heh, anyway replacing the network layer code >>>>SHOULD be >>>> >>> easy, since it SHOULD be cleanly seperated. >>>> >>> [00:13] <alown> Getting an initial implementation up using >>>>broadcast is >>>> >>> fine. >>>> >>> [00:13] <alown> (I was thinking of Wave's use in other apps as a >>>> >>>reason you >>>> >>> could have a lot of different participant domains) >>>> >>> *...4) Routing **p2p **messages/events in a pure P2P system (part >>>>2):* >>>> >>> [08:53] <stenyak> as for the "how to *really* do p2p", i see two >>>> >>>options: >>>> >>> a) use a dht-like algorithm and/or b) use a helper server to route >>>> >>>stuff >>>> >>> for you >>>> >>> [08:54] <stenyak> a) can be pretty slow if you want all OPs to >>>>reach >>>> >>>all >>>> >>> peers (if I'm not mistaken) >>>> >>> [08:54] <stenyak> and b) is essentially makes it not-p2p >>>> >>> [08:55] <stenyak> additionally, using p2p, how are we going to deal >>>> >>>with >>>> >>> routing problems (such as firewalls on both sides, etc)? >>>> >>> [08:56] <stenyak> in my mind, the only universal solution is to >>>>have a >>>> >>> third party server available to go through if we want speed or if >>>>we >>>> >>>want >>>> >>> to work on all edge cases >>>> >>> [08:56] <stenyak> and wave being advertised as realtime, i don't >>>>see >>>> >>>how >>>> >>> something like dht can ever fly >>>> >>> [11:20] <alown> stenyak: This is why I was wondering about a DHT >>>>system >>>> >>> with 'Superwave' servers (to act as a first point of contact). >>>> >>> [11:59] <stenyak> that would be like skype dynamic supernode list? >>>> >>> [11:59] <alown> The original system, yes. >>>> >>> [12:02] <stenyak> so we would devise a method to identify >>>>candidates to >>>> >>> being a supernode, in order to prevent cellphone wave peers from >>>> >>>becoming >>>> >>> one, and in order to promot certain other nodes (like major peers >>>>that >>>> >>>have >>>> >>> 99% uptime, e.g. wave.google.com or whatever) to become one >>>> >>> [12:03] <stenyak> bandwidth, latency, open ports, uptime... >>>> >>> [12:04] <alown> Once a network has been bootstrapped using >>>>something, >>>> >>>it is >>>> >>> relatively easy to identify the hosts which are most densely >>>>connected >>>> >>>(and >>>> >>> would be good supernode candidates) >>>> >>> [12:05] <stenyak> what do you mean with "using something"? >>>> >>> [12:06] <alown> Somehow the network has to initially be able to >>>>make >>>> >>> contact with other nodes (before it knows anything about them) >>>> >>> [12:07] <alown> For a LAN you could get away with a broadcast >>>> >>>'announce', >>>> >>> but it is a bit less clear on an internet-sized scale. >>>> >>> [12:08] <stenyak> bittorrent sync uses a broadcast for LAN. for >>>> >>>internet it >>>> >>> uses a tracker server for fast discovery of peers, or you can >>>>disable >>>> >>>that >>>> >>> and force to use DHT (with the long wait that means) >>>> >>> [12:09] <stenyak> the tracker can also act as a meeting-point for >>>> >>> firewalled peer pairs (which in my experience is a lot of them) >>>> >>> [12:09] <alown> Precisely the problem, because we don't really want >>>> >>>long >>>> >>> waits or trackers. >>>> >>> *...4) Routing **p2p **messages/events in a pure P2P system (part >>>>3):* >>>> >>> [12:42] <stenyak> hmmm... i'm not sure how a peer gets a list of >>>>waves >>>> >>>in >>>> >>> which he's a participant of >>>> >>> [12:43] <alown> Having a canonical source makes it all so much >>>>easier. >>>> >>>:P >>>> >>> [12:44] <stenyak> for pure p2p peers to "receive" new waves, >>>>either the >>>> >>> FROM or the TO peer (or both) would need to try to find their way >>>>to >>>> >>>the >>>> >>> other >>>> >>> [12:44] <stenyak> and we're assumign here that each person only >>>>runs >>>> >>>one >>>> >>> peer >>>> >>> [12:45] <stenyak> e.g. my privatekey may be used by 5 wave peers >>>>at the >>>> >>> same time, and we must make sure the new wave reaches all of them >>>> >>> [12:46] <alown> Looks like we may need to have mulitple DHTs then >>>>(one >>>> >>>for >>>> >>> ops, one for waves) >>>> >>> [12:46] <stenyak> in BT, it's the receiver end who actively looks >>>>for >>>> >>>peers >>>> >>> to receive from. in wave, it's not like that.. >>>> >>> [12:46] <alown> Or could we have a pubkey->wave mapping in one? >>>> >>> [12:46] <stenyak> and in BT, you can assume *many* people has the >>>>data >>>> >>>you >>>> >>> want >>>> >>> [12:46] <stenyak> in wave, its possible and probably that only one >>>> >>>other >>>> >>> peer in the universe has the wave >>>> >>> [12:46] <stenyak> (because it's a personal wave sent to you) >>>> >>> [12:47] <alown> I would expect any long-running supernodes to be >>>> >>>implicitly >>>> >>> part of all waves they know about. >>>> >>> [12:47] <alown> Though on second thought, this seems like it would >>>>add >>>> >>>its >>>> >>> own problems to authentication, storage, promotion of supernodes >>>>etc. >>>> >>> *...4) Routing **p2p **messages/events in a pure P2P system (part >>>>4):* >>>> >>> [12:51] <alown> Does it make sense for a peer to have your privkey, >>>> >>>since >>>> >>> you could be logged in anywhere, so it would be down to the place >>>>you >>>> >>>are >>>> >>> logged in, to 'subscribe' to that wave on the network, and attempt >>>>to >>>> >>> retrieve all data from it... >>>> >>> [12:55] <alown> I was expecting the network as a whole to act like >>>>a >>>> >>> WaveBus pubsub system, whereby once 'logged in' at some server >>>>(which >>>> >>>means >>>> >>> it gets your privkey from the authentication system), that server >>>>then >>>> >>> 'subscribes' to your waves on the 'network'. If somebody else at >>>>some >>>> >>>other >>>> >>> server changes it, then that server would be announcing to the >>>>network >>>> >>>of a >>>> >>> change (doesn't necesserily have to be a broadcast), which your >>>>server >>>> >>> would 'hear'. >>>> >>> [12:56] <alown> You could do this from any server where you logged >>>>in >>>> >>> (hence the concept of a domain is lost). >>>> >>> [12:57] <stenyak> by "server" you mean supernodes? >>>> >>> [12:57] <alown> Not necessarily. >>>> >>> [12:59] <stenyak> this pubsub network must be aware of nodes that >>>>are >>>> >>>in >>>> >>> it, in order to directly route wave updates to them, correct? >>>> >>> [12:59] <stenyak> and also, this network wouldn't be very >>>>volatile, but >>>> >>> would rather ideally be long-lived peers? >>>> >>> [13:00] <alown> It has no reason to have to directly route updates, >>>> >>>(though >>>> >>> it would hopefully be able to identify the best routes >>>>automatically). >>>> >>> [13:00] <alown> Yes it would require a few long-lived peers (which >>>> >>>would be >>>> >>> part of the requirement to be a supernode). >>>> >>> [13:01] <stenyak> so let's say i connect my laptop wave peer to the >>>> >>> "server" in the living room, at my firewalled home. this "server" >>>> >>>would be >>>> >>> already subscribed to the pubsub network, and in this specific >>>>case it >>>> >>> would route all wave updates to me >>>> >>> [13:02] <stenyak> in other cases (let's say, ipv6-enabled nodes >>>> >>>everywhere, >>>> >>> no firewall at home), the living room server could simply notify >>>>the >>>> >>> original "FROM" peer to send stuff to my laptop ipv6 ip, right? >>>> >>> [13:03] <alown> That sounds right. Supernodes are really only >>>>needed >>>> >>>for >>>> >>> getting the routing right. >>>> >>> [13:05] <stenyak> ok. in both these theoretical cases, the "server" >>>> >>>hasn't >>>> >>> necessarily been a wave node per se (nor a supernode either), but >>>> >>>rather a >>>> >>> second type of wave node that helps get stuff quickly wherever it's >>>> >>>needed >>>> >>> [13:05] <alown> Yes. >>>> >>> [13:05] <alown> I am not even sure where OT should be happening in >>>>this >>>> >>> picture... >>>> >>> [13:05] <stenyak> if OT happens, the "server" is a blind proxy i >>>>think >>>> >>> [13:06] <stenyak> so does not need the privkey to work >>>> >>> [13:07] <stenyak> unless we're also using OT in the wavebus pubusb >>>> >>>network >>>> >>> for some reason? >>>> >>> [13:07] <alown> Supernodes can be blind (though they might also >>>>just be >>>> >>> normal well-connected wave servers). I would expect normal servers >>>>to >>>> >>>still >>>> >>> be doing OT. The question is whether the 'client' (whatever that >>>>means) >>>> >>> should be doing it also. >>>> >>> [13:08] <alown> The network shouldn't need OT. (Algorithms exist >>>>that >>>> >>>allow >>>> >>> the incoming ops to be arbitarily queued and only processed when >>>> >>>needed). >>>> >>> [...] >>>> >>> [21:21] <josephg> alown: the client always needs to do OT because >>>> >>>otherwise >>>> >>> they can't both edit a document live and receive operations from >>>> >>>people who >>>> >>> didn't have their ops. >>>> >>> [21:22] <josephg> the server doesn't need to do OT, although if it >>>> >>>doesn't >>>> >>> do OT, it'll punt the OT work to its clients - which will result >>>>in a >>>> >>> higher CPU utilization on mobile devices. >>>> >>> [...] >>>> >>> [13:08] <stenyak> i pictured this "server" as being an optional >>>>item >>>> >>>that >>>> >>> shortcuts the long waits of DHT, rather than something necessary >>>>for >>>> >>> "clients"? >>>> >>> [13:08] <alown> Hmm. >>>> >>> [13:08] <alown> I suppose we should define what a 'client' is >>>>then... >>>> >>> [13:09] <alown> We have at least 2 layers of stuff going on here: >>>>1) >>>> >>>Wave >>>> >>> OT/operation layer 2) Network routing/P2P layer >>>> >>> [13:13] <alown> But it is quite plausible something might be doing >>>> >>>both of >>>> >>> those >>>> >>> [13:10] <stenyak> with your pubsub net suggestion, i was picturing >>>>2 >>>> >>>kinds: >>>> >>> a regular pure p2p peer, and a helper kind of node to route stuff >>>> >>>quickly >>>> >>> when a peer is connected to it >>>> >>> [13:13] <stenyak> so with that picture in mind, layer 1 stuff >>>>could go >>>> >>> directly from peer to peer (if connectivity/firewalls allows), or >>>> >>>through >>>> >>> the "helper node" if available >>>> >>> [...] >>>> >>> [13:20] <stenyak> [...] all this discussion looks very similar to >>>> >>> discussing how to design internet+dns, i think the problems are the >>>> >>>same >>>> >>> really >>>> >>> [13:20] <stenyak> or at least we could take some inspiration from >>>>it >>>> >>>maybe >>>> >>> [13:20] <alown> This was my conclusion last night with josephg. >>>>('The >>>> >>> problmes should already be solved (see The Internet)') >>>> >>> [14:09] <stenyak> and The Internets solved the problem how? By >>>>having a >>>> >>> large set of supernodes (dns servers), that may take a whole day to >>>> >>> propagate updates. The alternative being having the actual IP >>>>address >>>> >>>in >>>> >>> the first place, or to centralize stuff >>>> >>> [14:10] <stenyak> (aka use servers everywhere) >>>> >>> [14:22] <alown> Maybe, but the internet's design is X (where X > >>>>20) >>>> >>>years >>>> >>> old, so may not represent the most modern thinking of how to make >>>> >>> distributed networks. >>>> >>> [14:59] <alown> (Don't forget that our aim for Wave is at the >>>> >>>cutting-edge >>>> >>> of academic research also). >>>> >>> [...] >>>> >>> [14:50] <stenyak> i just threw the question at some friends who >>>>should >>>> >>>be >>>> >>> more up-to-date with networking technologies than me... hopefully >>>>they >>>> >>> comeback with some revolutionary dns-2 design or something that we >>>>can >>>> >>>copy >>>> >>> [15:18] <stenyak> could give as some ideas: http://openpeer.org/ >>>> >>> [15:18] <stenyak> (it's not a solution, but maybe they did the same >>>> >>> reasoning we're going through) >>>> >>> [15:46] <stenyak> another response i got goes along the lines of... >>>> >>>hard as >>>> >>> fuck, but if you manage to do it, you are a hero >>>> >>> [...] >>>> >>> [15:02] <stenyak> looking at it from a wider perspective, what we >>>>want >>>> >>>is >>>> >>> similar to having each peer shout at the whole world "here i am, >>>> >>>anything >>>> >>> got something for meeee?" in some way that doesn't clog the >>>>internet >>>> >>>tubes, >>>> >>> and that is so fast as shouting would be. i start to think it's not >>>> >>> physically possible to do that... >>>> >>> [15:03] <stenyak> if publickeys were handed to people based on the >>>> >>> location, then we could have routing tables similar to how internet >>>> >>> currently works >>>> >>> [15:03] <stenyak> but pubkeys are... well, random. so that kind of >>>> >>>routing >>>> >>> that allows anyone to connect to an arbitrary IP in a matter of >>>> >>> milliseconds is impossible, i believe >>>> >>> [15:04] <alown> So, we end up with DNS for public keys? >>>> >>> [15:04] <stenyak> something like dns, but much faster [wrt. >>>>propagation >>>> >>> times] >>>> >>> [15:05] <stenyak> so in essence, a tree of servers or whatever >>>>(which >>>> >>>is >>>> >>> similar to how wave currently works, right?) >>>> >>> [15:05] <alown> Heh. But the whole point was to avoid the tree >>>>system >>>> >>> currently (since it is susceptible to netsplits) >>>> >>> [...] >>>> >>> [15:56] <stenyak> maybe the real question could be: how do we make >>>>DHT >>>> >>>much >>>> >>> faster? >>>> >>> [16:14] <stenyak> once the initial discovery process is finished, >>>>the >>>> >>> transmission of data will not have the lag associated with DHT, so >>>> >>>even if >>>> >>> DHT takes 10 seconds, that could be acceptable >>>> >>> [16:15] <stenyak> i.e. a new peer takes 10 seconds to be >>>>discovered by >>>> >>>the >>>> >>> rest of participants collaborating in a wave >>>> >>> [16:16] <stenyak> (or viceversa.. the new peer takes 10 seconds to >>>> >>>discover >>>> >>> the participants) >>>> >>> [...] >>>> >>> [16:25] <stenyak> this could shed some light: >>>> >>> >>>> >>> >>>> >>> >>>> >>>>http://en.wikipedia.org/wiki/Distributed_hash_table#Algorithms_for_overl >>>> >>>ay_networks >>>> >>> [19:06] <stenyak> http://dsn.tm.kit.edu/english/2936.php >>>> >>> *...4) Routing **p2p **messages/events in a pure P2P system (part >>>>5):* >>>> >>> [21:03] <josephg> [...] For now, I want wave to be p2p in the same >>>>way >>>> >>>that >>>> >>> git is p2p. >>>> >>> [21:04] <josephg> that is, I want the core algorithms & data >>>> >>>structures to >>>> >>> use P2P-capable algorithms, and probably the wave servers will do >>>>p2p >>>> >>> between themselves (this is easy because they'll all be both named >>>>and >>>> >>> accessable) >>>> >>> [21:06] <josephg> as for client-to-client p2p, there's a few >>>>options >>>> >>> depending on what kind of use cases we want to support - but I >>>>want to >>>> >>> worry about getting the algorithms p2p-capable first. If you're >>>>keen >>>> >>>to set >>>> >>> up an anonymous, distributed wave system over a DHT - well, I want >>>>to >>>> >>>first >>>> >>> make that possible >>>> >>> [21:15] <josephg> .... and as for ipv6, network admins _love_ NAT >>>>now >>>> >>>that >>>> >>> we have it >>>> >>> >>>> >>> >>>> >>> *5) Implementing "undo": invertibility, tombstones, edge cases, >>>>TP2:* >>>> >>> [00:17] <alown> I am not sure how an 'undo stack' is going to work >>>>(at >>>> >>>all) >>>> >>> with federation... >>>> >>> [00:18] <josephg> well, you just do undo at the application level >>>> >>> [00:19] <josephg> "submit op which inserts text" ... later "submit >>>>op >>>> >>>which >>>> >>> removes text" >>>> >>> [00:19] <josephg> you don't need OT for that. >>>> >>> [00:20] <josephg> I imagine like, a semantic undo. In the client >>>>you >>>> >>>can >>>> >>> imagine making an undo op (which might not necessarily rollback an >>>> >>> operation (because of tombstones and all that)) >>>> >>> [00:20] <josephg> ... but would seem that way as far as the user is >>>> >>> concerned >>>> >>> [00:21] <josephg> then if the user hits ctrl+z, you can transform >>>>that >>>> >>> operation up to the current version and apply it >>>> >>> [00:21] <josephg> - the fact that its an undo isn't really >>>>relevant. >>>> >>> [00:21] <josephg> the bad thing about losing invertability is doing >>>> >>> playback >>>> >>> [00:21] <josephg> - because you can't scrub back through time >>>> >>> [00:21] <alown> But you have all the operations since the start, so >>>> >>>you can >>>> >>> play forward at least? >>>> >>> [00:23] <josephg> yeah exactly. >>>> >>> [00:23] <josephg> ... and make like, keyframes of the document >>>> >>> [00:23] <josephg> - and play forward from them or something. >>>> >>> [00:23] <alown> Hmm, so you can do the step-back without >>>>recalculating >>>> >>>the >>>> >>> entire document? >>>> >>> [00:24] <alown> I don't really like the idea of then having another >>>> >>> datastructure to have to pass around... >>>> >>> [00:24] <josephg> right - if you have a snapshot at version 1000, >>>>and >>>> >>>the >>>> >>> user is looking at 1010 and they try to step back to 1009, you can >>>>just >>>> >>> replay ops 1001-1009 on that version 1000 snapshot >>>> >>> [00:24] <alown> What was the problem with invertible operations (I >>>> >>>don't >>>> >>> understand OT enough yet to be able to properly comment on that >>>>side). >>>> >>> [00:25] <alown> (Other than it confuses people?) >>>> >>> [00:25] <josephg> hahaha actually people seem to love invertability >>>> >>> [00:25] <josephg> I don't know why. >>>> >>> [00:25] <josephg> I've been trying to remove it from sharejs, and >>>> >>>everyone >>>> >>> gets sad. >>>> >>> [00:26] <josephg> the problem is that if I make an op which >>>>deletes the >>>> >>> whole document (version 100, say) then I undo that operation >>>> >>> [00:26] <josephg> and you insert in the middle of the document at >>>> >>>version >>>> >>> 100, then your op gets transformed to do that insert at the start >>>>of >>>> >>>the >>>> >>> document instead at version 101 (because the content has >>>>disappeared) >>>> >>> [00:26] <josephg> and it never goes back to the middle of the >>>>document. >>>> >>> [00:27] <josephg> so, with tombstones you can get around that by >>>> >>>having a >>>> >>> 'resurrect' operation >>>> >>> [00:27] <josephg> (so deleting the whole document turns the whole >>>> >>>document >>>> >>> into tombstones, then we can resurrect them all again in the >>>>inverse) >>>> >>> [00:28] <josephg> but you can't invert an insert - because deleting >>>> >>>leaves >>>> >>> the tombstone there >>>> >>> [00:28] <josephg> and if you have a 'real delete' operation, then >>>>yeah, >>>> >>> you're back in the hole >>>> >>> [00:28] <josephg> also, with wave in particular, inverting is >>>>really >>>> >>> complicated >>>> >>> [00:29] <josephg> - see, if the wave says "<annotation >>>>bold:true>blah >>>> >>> blah<annotation bold:false> not bolded" >>>> >>> [00:29] <josephg> then if you insert at the end of the "blah blah", >>>> >>>it'll >>>> >>> automatically get bolded. >>>> >>> [00:30] <josephg> ... so if the text isn't bolded, and then you >>>>bold it >>>> >>> while I insert at the end of the text, you need to make sure my >>>>text >>>> >>> _isn't_ bolded or something >>>> >>> [00:31] <josephg> .... and yeah, I can't remember - but there's >>>>these >>>> >>> horror cases that I remember kept me from sleeping when I tried to >>>> >>> reimplement wave's OT code in C >>>> >>> [00:31] <alown> hmm >>>> >>> [00:31] <josephg> and it would have been fine if it wasn't >>>>invertible. >>>> >>> Well, at least it would have been tollerable. >>>> >>> [00:33] <josephg> So yeah. Conclusion: You can make invertability >>>> >>>work, but >>>> >>> its kind of a bitch, and you can't make it work for TP2 >>>> >>> [00:33] <josephg> which means it won't work if we're federating >>>> >>> [00:33] <alown> How are we hacking around that currently then? >>>> >>> [00:33] <josephg> well, we don't do TP2 >>>> >>> [00:34] <josephg> remember, federation just uses a bad version of >>>>the >>>> >>> current client-server protocol >>>> >>> [00:34] <josephg> - arranged in a tree of servers >>>> >>> [00:34] * alown goes and looks up which one TP2 was again >>>> >>> [00:35] <josephg> ... its the one that says you don't need a >>>>canonical >>>> >>> ordering of operations >>>> >>> [00:35] <josephg> sharejs and wave both use the server to pick the >>>> >>>order of >>>> >>> operations (based on which order they reach the server) >>>> >>> [00:35] <josephg> and then they use incrementing version numbers >>>>based >>>> >>>on >>>> >>> that order >>>> >>> [00:35] <alown> ah yep. >>>> >>> [00:35] <josephg> -> for p2p, that doesn't work because you don't >>>>have >>>> >>>a >>>> >>> centralized server, and anyone can send messages to anyone >>>> >>> [00:36] <josephg> and yeah, you need TP2 for that (which sort of >>>>says >>>> >>>you >>>> >>> can apply ops from 3 different sites in any order and it still >>>>works) >>>> >>> [00:37] <josephg> - and apparently someone proved that if you make >>>>it >>>> >>>work >>>> >>> for 3 sites, it works for any number of sites >>>> >>> [00:43] <alown> Anyhow, I can see leaving inversion out for >>>> >>>simplicity, but >>>> >>> don't yet understand why it can't be made to work with TP2. >>>> >>> [00:59] <alown> Hmm. Seen 'A Sequence Transformation Algorithm for >>>> >>> Supporting Cooperative work on Mobile Devices'? >>>> >>> [01:02] <josephg> >>>> >>> >>>> >>> >>>> >>> >>>> >>>>http://research.microsoft.com/en-us/um/redmond/groups/connect/cscw_10/do >>>> >>>cs/p159.pdf >>>> >>> ? >>>> >>> [01:15] <alown> The main feature is its use of storing local/remote >>>> >>> operations and processing them much later than receipt time. >>>> >>> [01:17] <alown> ABT satisfies TP1+2, so looks like this should(?) >>>> >>> [01:19] <josephg> need to read it >>>> >>> [01:19] <josephg> ... I'll go through it later >>>> >>> >>>> >>> >>>> >>> *6) Usability of a pure p2p system in Real Life (tm):* >>>> >>> [12:13] <alown> We also don't know if storing ops in a DHT is >>>>efficent >>>> >>> enough for our use case... >>>> >>> [12:14] <stenyak> in any case, let's say i fire up my wavep2p >>>>android >>>> >>> client and want to check for any new waves >>>> >>> [12:14] <stenyak> i definitely won't put up with a wait of 30 >>>>seconds >>>> >>>when >>>> >>> i have "this damn fast 4g connection!" in my cellphone >>>> >>> [12:14] <stenyak> i mean, that's the point of view of six pack joe >>>> >>> [12:14] <stenyak> and joe is definitely right.. >>>> >>> [12:15] * alown thinks of the hours it took to download the bitcoin >>>> >>> blockchain from the p2p system >>>> >>> [12:15] <stenyak> or browse through freenet, or whatever... its >>>>painly >>>> >>>slow >>>> >>> [12:16] <stenyak> in the end, i think that most users won't be >>>>running >>>> >>>a >>>> >>> full blown peer, but will be relying on an external server instead >>>> >>> [12:16] <stenyak> i.e. nobody runs their own email servers nowadays >>>> >>> [12:16] <stenyak> and the same can happen with wave >>>> >>> [12:16] <alown> Should a mobile client be doing the full p2p >>>> >>>federation, or >>>> >>> simply talking to a server which does it... >>>> >>> [12:16] <stenyak> the few who decide to run a full-blown wave peer, >>>> >>>should >>>> >>> be aware of the problems >>>> >>> [12:17] <alown> So, this should be less of a problem since the only >>>> >>>nodes >>>> >>> doing p2p will be proper full-time connected servers? >>>> >>> [12:17] <stenyak> the thing is, we can assume most people wont >>>>fire up >>>> >>> their own xmpp server, but go for jabber.org account >>>> >>> [12:17] <stenyak> and the same thing will presumably happen for >>>>wave, >>>> >>> simply because it's easier to do >>>> >>> [12:18] <stenyak> which doesn't pervent me from running my own >>>> >>>full-blown >>>> >>> wave server >>>> >>> [12:18] <stenyak> but that's a use case in which the user knows the >>>> >>> limitations >>>> >>> [12:19] <stenyak> [...] you and i will run several full-blown wave >>>> >>>peers at >>>> >>> home, at our parent's house, or whatever, but we'll know and >>>>accept the >>>> >>> problems >>>> >>> [12:19] <stenyak> i think that's the way to think about the problem >>>> >>> [12:19] <stenyak> heck, most people use github for permanent [git] >>>> >>> connectivity ;-) >>>> >>> [12:19] <stenyak> instead of opening ports to their laptop in >>>>their lan >>>> >>> [12:19] <stenyak> and those are the tech-savvy people... >>>> >>> [12:20] <alown> So, we have a p2p system between wave servers and >>>> >>>superwave >>>> >>> servers, with clients connecting to the server rather than doing >>>>the >>>> >>>p2p >>>> >>> itself... >>>> >>> [12:20] <stenyak> i'm not saying it's the way we should do it. i'm >>>> >>>saying >>>> >>> that's the way it most probably will pan out, because it's already >>>> >>> hapennign in 100% of the existing p2p protocols i know of >>>> >>> [12:20] <alown> Hmm... >>>> >>> [12:21] <stenyak> so we should plan for that instead of a >>>>theoretical >>>> >>>pure >>>> >>> p2p world >>>> >>> [12:21] <stenyak> if we assume there's servers like github, >>>>bitbucket >>>> >>>and >>>> >>> sourceforge, then suddently most of the problems go away, while >>>>still >>>> >>>not >>>> >>> preventing from people to run fully p2p if they want >>>> >>> >>>> >>> >>>> >>> *7) Comparison with BitTorrent and P2P-TV technologies:* >>>> >>> [12:21] <alown> BT doesn't have huge servers (and with magnet has >>>> >>>actually >>>> >>> move in the opposite direction). >>>> >>> [12:21] <stenyak> BT has no real-time needs >>>> >>> [12:22] <stenyak> that's why they can afford DHT >>>> >>> [12:22] <stenyak> dht could be used for simulating a forum-like >>>> >>>discussion >>>> >>> in wave. but we can't force that restriction from the server >>>> >>> [12:22] <stenyak> (i say forum-like, because people don't expect >>>> >>>reaction >>>> >>> within seconds there) >>>> >>> [12:23] <alown> How did iplayer do its live p2p broadcastinºg? >>>> >>> [12:23] * stenyak googles what iplayer is >>>> >>> [12:23] <alown> Sorry, BBC iPlayer is their TV-over-the-internet >>>> >>>system. >>>> >>> [12:24] <alown> Originally it used a p2p system, but got lots of >>>> >>>negative >>>> >>> press (because of assosciation with BT since it used p2p), so it >>>>now >>>> >>>uses a >>>> >>> centralized system instead. (And their bandwidth costs are much >>>> >>>higher). >>>> >>> [...] >>>> >>> [12:25] <stenyak> i seem to recall other [p2p] tv clients >>>> >>> [12:25] <stenyak> >>>> >>> >>>> >>> >>>> >>> >>>> >>>>http://wiki.xbmc.org/index.php?title=HOW-TO:Play_free_P2P_(peer-to-peer) >>>> >>>_online_streaming_TV >>>> >>> [...] >>>> >>> [12:26] <alown> Found a paper titled "RT-P2P: A Scalable Real-Time >>>> >>> Peer-to-Peer System with Probabilistic Timing Assurances" (google >>>>for >>>> >>>it) >>>> >>> [12:28] <alown> Lookt at the paper I mentioned. It relies on 'super >>>> >>>nodes' >>>> >>> to enable it to keep low latencies... >>>> >>> [...] >>>> >>> [12:27] <stenyak> but i'd be wary of using this (p2p tv) as an >>>> >>>inspiration. >>>> >>> i know there's delay of 10-30 seconds from my TV Formula1 image to >>>>the >>>> >>> telemetry that comes through HTTP from formula1.com website. this >>>>is >>>> >>> regular TV, and they don't care about 30 seconds of lag >>>> >>> [12:27] <stenyak> the only real problem of p2p tv is avoiding much >>>> >>>jitter >>>> >>> [12:27] <stenyak> as long as the stream arrives and is viewable, a >>>> >>>delay of >>>> >>> a minute doesn't matter that much >>>> >>> [12:28] <alown> True. >>>> >>> >>>> >>> >>>> >>> *8) Identifying participants (part 1):* >>>> >>> [12:09] <alown> I am also no longer sure what an 'account' should >>>>look >>>> >>> like, since it has no reason to be stuck to a domain... >>>> >>> [12:10] <stenyak> current wave discovery works by using the domain >>>> >>>name of >>>> >>> the email-address-like list of participants >>>> >>> [12:10] <stenyak> but here we're talking about hashes, public keys >>>>or >>>> >>> whatever >>>> >>> [12:10] <stenyak> which do not (necessarily) point to an particular >>>> >>>IP:PORT >>>> >>> or whatever >>>> >>> [12:10] <alown> Exactly the problem... >>>> >>> *...8) Identifying participants (part 2):* >>>> >>> [12:33] <stenyak> would it make sense that, while some >>>>participants are >>>> >>> identified by a pubkey (or whatever), many of them could be >>>>identified >>>> >>>by a >>>> >>> user@domain address, with which any peer can quickly locate >>>> supernodes? >>>> >>> [12:33] <stenyak> i mean some kind of dual "pubkey and optional >>>>domain >>>> >>> email-like addr" for the participants list >>>> >>> [12:34] <stenyak> the optional part being essential in the broader >>>> >>>internet >>>> >>> [12:34] <alown> Isn't that exactly what using Mozilla Persona >>>>would do >>>> >>>(map >>>> >>> user@domain to some public-key we can use) >>>> >>> [12:34] <alown> Removing the need for us to have to roll >>>>yet-another >>>> >>> authentication system. >>>> >>> [...] >>>> >>> [12:38] <stenyak> the idea would be that, for a person to be a >>>> >>>participant >>>> >>> in a wave, you *require* his pubkey. optionally, you may have >>>>acquired >>>> >>>ths >>>> >>> pubkey by asking "wave.google.com" about the user "joe", getting >>>>his >>>> >>> pubkey >>>> >>> as a result. >>>> >>> [12:39] <stenyak> and now that you have the pubkey and one of many >>>> >>>possible >>>> >>> email-like addresses (in this case j...@wave.google.com), then you >>>>can >>>> >>>use >>>> >>> the email-like address for displaying in the UI >>>> >>> [12:39] <stenyak> this means that, whoever wants to run pure p2p >>>>peers, >>>> >>> will have to give his pubkey >>>> >>> [12:39] <stenyak> and whoever uses the more traditional style, can >>>> >>>simply >>>> >>> give his email-like addr >>>> >>> [12:39] <stenyak> and the participants list will show a simple >>>> >>>email-like >>>> >>> address most of the time >>>> >>> [12:40] <alown> Do we then allow anyone to 'log in' to any wave >>>>server >>>> >>> running at any domain, since it should no-longer make any >>>>difference >>>> >>>where >>>> >>> they are in the network... >>>> >>> [12:41] <stenyak> yes, that's needed for world-wide-public waves, >>>> >>>which is >>>> >>> equivalent to a read-only forum on the net >>>> >>> [12:41] <stenyak> then there could be server-public waves, which is >>>> >>> equivalent to requiring sign-in to view a forum (and >>>>coincidentally the >>>> >>> current implementation of public waves in WiaB, right?) >>>> >>> [12:43] * alown has never tested what happens with public waves in >>>>the >>>> >>> current federation system >>>> >>> *...8) Identifying participants (part 3): >>>> >>> * >>>> >>> [21:35] <josephg> - Who is a user? If a user is >>>>sten...@example.com, >>>> >>>then >>>> >>> we can put a server at example.com and it can hold operations for >>>>you >>>> >>> [21:36] <josephg> ie, if I add you to a wave, my computer (or my >>>>wave >>>> >>> server or something) can send a message to example.com to say "Yo, >>>> >>>here's >>>> >>> some ops you should know about" >>>> >>> [21:36] <josephg> that would be similar to a mailbox >>>> >>> [21:37] <josephg> ... and it would work pretty well. Bear in mind >>>>that >>>> >>> there's no reason operations have to go through the wave server at >>>> >>> example.com - if we're both on a LAN together, we could discover >>>>one >>>> >>> another through DNS service discovery and send ops directly >>>> >>> [21:37] <josephg> .. without going through our respective wave >>>>servers >>>> >>> [21:38] <josephg> However - if our identities aren't tied to a >>>>domain >>>> >>>(eg >>>> >>> bitcoin), then we'll need to use a dht or something. >>>> >>> [21:42] <stenyak> the conclussion i've arrived at is that "users" >>>> >>> ultimately are a publickey (for which they have the privatekey). >>>>this >>>> >>>is >>>> >>> inconvenient for people to "add you to a wave", so a possibility >>>>would >>>> >>>be >>>> >>> to have a friendlyname=>pubkey server converter. this way people >>>>can >>>> >>>add " >>>> >>> sten...@example.com", by first finding out what the pubkey for >>>> >>> sten...@example.com really is >>>> >>> [21:43] <stenyak> the friendlyname would be optional, and in LAN >>>> >>> environments you could directly use the pubkey (instead of the >>>>friendly >>>> >>> name) >>>> >>> [21:43] <josephg> I think people will be more than happy to use a >>>> >>>frienly >>>> >>> name in a lan environment too >>>> >>> [21:43] <stenyak> discovery in a local network could be done with >>>> >>>bonjour >>>> >>> or something too (not just dns) >>>> >>> [21:44] <josephg> I <3 dns-sd >>>> >>> [21:44] <stenyak> [...] maybe they already have a contact list >>>>(read, >>>> >>>list >>>> >>> of friendlyname<>pubkey equivalences) they can use in the UI (even >>>>if >>>> >>>the >>>> >>> underlying system will use pubkeys anyway) >>>> >>> [21:44] <stenyak> and by contact list, i really mean a cache of >>>>some >>>> >>>sort >>>> >>> [21:45] <stenyak> (not some specific, complex roster system) >>>> >>> [21:45] <josephg> and you can do friendlyname -> pubkey really >>>>easily >>>> >>>by >>>> >>> just storing the pubkey on the user's domain >>>> >>> [21:45] <josephg> so, have the example.com webserver host >>>> >>> https://example.com/.wellknown/stenyak >>>> >>> [21:46] <josephg> = your public key. >>>> >>> >>>> >>> >>>> >>> *9) P2P anonymity (peers that want to anonymously lurk in a wave) >>>>(part >>>> >>> 1):* >>>> >>> [12:48] <stenyak> by the way, what about non-participants that >>>>simply >>>> >>>want >>>> >>> to lurk a wave? >>>> >>> [12:49] <stenyak> e.g. i'm given a wave uri >>>> >>> (wave://look_at_these_kittens_wave), and want to view it >>>> >>> [12:49] <alown> Whilst a wave is public, as soon as they 'read' >>>>the >>>> >>>wave, >>>> >>> they would have a metadata wavelet created, so would become a >>>> >>>participant >>>> >>> (if read-only). >>>> >>> [12:50] <stenyak> and from then on, whenever the wave changes, >>>>someone >>>> >>>will >>>> >>> try to make the change reach the peers with my privkey >>>> >>> [12:50] <stenyak> supposedly.. >>>> >>> *...9) P2P anonymity (peers that want to anonymously lurk in a >>>>wave) >>>> >>>(part >>>> >>> 2):* >>>> >>> [21:18] <josephg> stenyak: interesting point about people who want >>>>to >>>> >>>not >>>> >>> participate but follow a wave anyway - its really bad if other >>>>people >>>> >>>can >>>> >>> tell that they're there (assuming the wave is public). >>>> >>> [21:18] <josephg> I guess we just need to make sure that the >>>>metadata >>>> >>>wave >>>> >>> is invisible, and then its ok.. >>>> >>> [21:21] <stenyak> invisible.. to what peer/s? surely those that are >>>> >>> transmitting deltas to the lurkers will need to know they exist? >>>> >>> [21:21] <stenyak> (maybe some of the algorithms behind freenet can >>>>help >>>> >>> with this) >>>> >>> [21:21] <stenyak> (or even TOR) >>>> >>> >>>> >>> >>>> >>> *10) Encryption of waves:* >>>> >>> [21:47] <josephg> for waves themselves, I'm imagining giving each >>>>wave >>>> >>>an >>>> >>> AES key >>>> >>> [21:47] <josephg> then storing an encrypted version of the key for >>>>each >>>> >>> participant on the wave >>>> >>> [21:48] <josephg> .... anyway, that way anyone who has the AES key >>>>can >>>> >>>read >>>> >>> all ops on the wave >>>> >>> [21:48] <josephg> and can participate (because they can encrypt ops >>>> >>>for the >>>> >>> wave) >>>> >>> >>>> >>> >>>> >>> *11) Addition and removal of participants, and their ability to >>>>read >>>> >>>past >>>> >>> and future wave versions/deltas:* >>>> >>> [21:48] <stenyak> what about removing a user from a wave? >>>> >>> [21:49] <josephg> worst case, we can just make a new key and re-add >>>> >>> everyone using the new key >>>> >>> [21:49] <josephg> and keep around the old key too >>>> >>> [21:49] <josephg> so people can still read the old ops as well >>>> >>> [21:49] <stenyak> the user can access their browser cache for all >>>>we >>>> >>>care.. >>>> >>> if you ever read it, there will be ways to do it. "download now >>>> >>>wave-spy to >>>> >>> read waves you were removed from!" >>>> >>> [21:49] <stenyak> so providing an official way sounds better >>>> >>> [21:50] <stenyak> the AES key could change at any point in time, >>>>e.g. >>>> >>> whenever a new users is added (to prevent them accessing the >>>>history), >>>> >>>or >>>> >>> deleting them (to prevent them from reading future history) >>>> >>> [22:32] <josephg> um - in wave, we let new users see the whole >>>>history >>>> >>> [22:40] <stenyak> but that use case could be desirable, right? and >>>>if >>>> >>>we >>>> >>> support modification/versioning of the AES key, we might as well >>>>allow >>>> >>>that >>>> >>> too? the equivalent in email world would be to forward an email, >>>> >>>removing >>>> >>> the existing quotes >>>> >>> [23:17] <josephg> Yep definitely. >>>> >>> >>>> >>> >>>> >>> -- >>>> >>> Saludos, >>>> >>> Bruno González >>>> >>> >>>> >>> _______________________________________________ >>>> >>> Jabber: stenyak AT gmail.com >>>> >>> http://www.stenyak.com >>>> >>> >>>> > >>>> >>>> >>>> > >