`chainHead_storage`: Support more resilient pagination of descendants? #85

jsdw · 2023-08-29T11:53:22Z

In the legacy RPC methods, a user could call state_getKeysPaged with some prefix, and provide a starting key (and number of keys to fetch). They are able to then paginate over the child keys/values by tracking the last key that they have seen and using that to request the next batch of keys (and obtain values for them). This is resistant to network connection interruptions and such; they can always pick up from the last key they are interested in.

Using the new chainHead_storage API, a couple of things make iterating over descendent keys/values harder:

There is no way to just ask for descendent keys (I guess I'd ask for descendentHashes and then just ignore them and look at the keys, but this is less than ideal).
There is no way to pick up from where I left off in case the chainHead subscription I'm relying on stops for any reason (eg network interruption or pinned blocks getting too old).

Proposals

Being able to provide a startKey to the chainHead_storage items of type descendantsValues and descendantsHashes would probably go most of the way here (I'm not super sure if it's useful to be able to fetch keys without fetching values in the usual case of paginating over some descendent results).

I also wondered about supporting a new type like descendantsKeys in storage items. This would return only the keys, and no other values (so that you can eg download all of the keys and then ask for values later as a user paginates through some interface or whatever). If the startKey idea is acceptable then this would also allow a startKey to align with other descendants* types, and allow for reliable pagination/resuming of fetching keys as well as values or hashes.

What do you guys reckon?

/cc @josepot @tomaka

The text was updated successfully, but these errors were encountered:

josepot · 2023-08-29T14:05:33Z

I think that this section of the post on the polkadot forum that @tomaka wrote partially covers this:

Many of the functions assume that the implementation is a full node, and light clients weren’t taken into account when the legacy JSON-RPC API has been designed. A good example of this is maybe the state_getKeysPaged function. On a full node, this function is implemented by simply reading from the database and returning the result. On a light client, this function is implemented by querying all the keys of the given prefix from the networking, then filtering out the keys that aren’t in the requested page. If a JSON-RPC client calls this function multiple times, like you are supposed to do, the light client downloads all the keys multiple times, which is incredibly wasteful. Several other functions such as state_queryStorage simply can’t be implemented by a light client at all.

jsdw · 2023-08-29T14:42:27Z

Ooh that is interesting indeed, thanks for referencing it!

josepot · 2023-10-04T11:43:30Z

kinda related: smol-dot/smoldot#692

bkchr · 2023-10-10T10:25:30Z

On a light client, this function is implemented by querying all the keys of the given prefix from the networking, then filtering out the keys that aren’t in the requested page. If a JSON-RPC client calls this function multiple times, like you are supposed to do, the light client downloads all the keys multiple times, which is incredibly wasteful.

This statement is true. However, if we take into account the thing that @jsdw proposed, we can have pagination on the light client as well. With the use of the startKey and some upper bound on the items we are interested in, the light client can request this data from a full node. The next startKey would then be for example the last key of the first call to this RPC method. This probably requires some new light client network message, but this shouldn't be any problem.

@tomaka WDYT?

tomaka · 2023-10-10T10:46:12Z

Yeah this would require polkadot-fellows/RFCs#9

The problem in play here is that with polkadot-fellows/RFCs#9 the light client already has to do pagination with the networking requests. The responding node can cut the response at any point so that it fits in the maximum response size.

If on top of this the JSON-RPC client also does its own pagination (trying to be smart and all), then you end up doing many more networking requests than if the JSON-RPC client didn't do pagination.

It's all complicated and I have many other things to focus on so I haven't given this problem a lot of attention.

bkchr · 2023-10-10T10:58:34Z

Instead of letting the RPC client control the pagination, which is really sub-optimal as you explained, the strategy of letting the server/package size let decide on this. If the server is chunking the answers any way, could we not send out these chunks to the JSON RPC client as we get them? (for sure there are some packages that may only contain nodes, that we would skip)
We could even do this "pagination" by letting RPC request the next chunk when the RPC client "consumed" aka got all the data send of the current message. Like some kind of back pressure.
Or we do it even simpler and send back these results as they fit into on network package and then the RPC client needs to call the function again with the next startKey.

tomaka · 2023-10-10T11:04:41Z

But then there's also the possibility for the light client to split the request itself.
For example if the light client is connected to 3 peers, and you request 3 keys, it could ask for a proof from each peer in parallel.

If the JSON-RPC API doesn't support pagination (as is the case right now), then the light client is free to do that and this can be left as an implementation detail. But if the JSON-RPC API is aware of pagination, then it becomes impossible.

Again this strikes me as a really complicated problem and to me it seems that there are many more ramifications than I can think of.

A good way to solve this problem overall would be to make the size of a Merkle proof more or less predictable, which could be done by merkleizing the storage values, but that would be a huge change and we still haven't even finished the previous trie migration from a few years ago.

bkchr · 2023-10-10T20:51:04Z

A good way to solve this problem overall would be to make the size of a Merkle proof more or less predictable, which could be done by merkleizing the storage values,

What you mean by this? Split up every storage value into chunks of the same size and put these chunks into the trie as well? This sounds quite costly.

But then there's also the possibility for the light client to split the request itself. For example if the light client is connected to 3 peers, and you request 3 keys, it could ask for a proof from each peer in parallel.

If the JSON-RPC API doesn't support pagination (as is the case right now), then the light client is free to do that and this can be left as an implementation detail. But if the JSON-RPC API is aware of pagination, then it becomes impossible.

I don't really see why this would be a problem? Let's assume we mage the JSON-RPC API "page aware". This means that the RPC server would return alongside the value a finished key/value that tells the client if this was the last piece of the request or not. The page size would be opaque to the client and dependent on the server implementation. Though, defining an upper maximum would probably be a good idea. The client could then request the next page by calling the RPC again. (Similar to what we already have for these long running subscriptions)

tomaka · 2023-10-11T07:19:32Z

What you mean by this? Split up every storage value into chunks of the same size and put these chunks into the trie as well? This sounds quite costly.

I mean that every storage value would be a mini trie when it comes to how it's stored.
I have the impression that most storage values are very small (a public key, an account balance, etc.), in which case they would fit completely into one node. This change would only be relevant for larger values.

I don't really see why this would be a problem? Let's assume we mage the JSON-RPC API "page aware". This means that the RPC server would return alongside the value a finished key/value that tells the client if this was the last piece of the request or not. The page size would be opaque to the client and dependent on the server implementation. Though, defining an upper maximum would probably be a good idea. The client could then request the next page by calling the RPC again. (Similar to what we already have for these long running subscriptions)

Let's say you want to download all the keys of the empty prefix (i.e. all the keys in the entire trie).

If you have for example 4 peers, since you want to avoid networking round trips and since you don't know which peer is slower (or even malicious) and which is faster, the best solution is to ask peer 1 for all the keys from [0] to [3ffff...], peer 2 for all the keys from [4] to [7ffff...], peer 3 for all the keys from [8] to [affff...], and peer 4 for all the keys from [b] to [fffff...].

None of the four requests will return all the keys you asked for in a single response, but then you continue doing requests four by four until you have all of them. When you're done with one of the peers, you can split the remaining requests even more and continue using that peer for another list of keys.

I have no idea how that algorithm would translate with a JSON-RPC API that has pagination.

Though, defining an upper maximum would probably be a good idea.

Again, the JSON-RPC client has no way to know which upper maximum is appropriate. What will most likely happen is that it will in total require more networking requests (typically twice as many requests) to download everything, compared to not providing a maximum.

jsdw changed the title ~~chainHead_storage: Support user pagination of descendents?~~ chainHead_storage: Support more resilient pagination of descendants? Aug 29, 2023

This was referenced Aug 29, 2023

Paginating StorageMap paritytech/subxt#1139

Closed

Remove unused start_key that new RPC APIs may not be able to support paritytech/subxt#1148

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`chainHead_storage`: Support more resilient pagination of descendants? #85

`chainHead_storage`: Support more resilient pagination of descendants? #85

jsdw commented Aug 29, 2023

josepot commented Aug 29, 2023

jsdw commented Aug 29, 2023

josepot commented Oct 4, 2023

bkchr commented Oct 10, 2023

tomaka commented Oct 10, 2023 •

edited

Loading

bkchr commented Oct 10, 2023

tomaka commented Oct 10, 2023 •

edited

Loading

bkchr commented Oct 10, 2023

tomaka commented Oct 11, 2023 •

edited

Loading

chainHead_storage: Support more resilient pagination of descendants? #85

chainHead_storage: Support more resilient pagination of descendants? #85

Comments

jsdw commented Aug 29, 2023

Proposals

josepot commented Aug 29, 2023

jsdw commented Aug 29, 2023

josepot commented Oct 4, 2023

bkchr commented Oct 10, 2023

tomaka commented Oct 10, 2023 • edited Loading

bkchr commented Oct 10, 2023

tomaka commented Oct 10, 2023 • edited Loading

bkchr commented Oct 10, 2023

tomaka commented Oct 11, 2023 • edited Loading

`chainHead_storage`: Support more resilient pagination of descendants? #85

`chainHead_storage`: Support more resilient pagination of descendants? #85

tomaka commented Oct 10, 2023 •

edited

Loading

tomaka commented Oct 10, 2023 •

edited

Loading

tomaka commented Oct 11, 2023 •

edited

Loading