Hi Chris,

Is there a basis for calculating why there should only be 40 requests for authorized participants at the end? Also, is the Query_String limited to some maximum size?

When I benchmark it, even with the maximum utilization of 150 ASN numbers in the query list for a large IX like DE-CIX, I see about ten queries with ASN_LIST, including the IX and NetIX queries. With that, we would have already exhausted 25% of the volume.

My general suggestion would be that we leave a bit more headroom for requests in the same period without a self-throttling penalty. The target value should conclude at 10% of the queries for the largest IX as a variable; therefore, in 2022, at least 100 ~ 120 requests per minute shall be allowed.

I wrote https://github.com/ipcjk/ixgen half a decade ago (god, how time flies). I patched in the API keys yesterday; ASN_LIST will also be included in the next release. However, there is another significant advantage; the thing works with a local cache of the JSON files from PeeringDB. It can be used as a simple API server directly as a binary with compatible queries. So you can quickly get rid of 1000+ queries in a few seconds without SQL, other dependencies, and bugging the original peeringDB-source.

BR Jörg

On 31 May 2022, at 21:31, Chris Caputo wrote:

After the initial introduction of PeeringDB API throttling, some software
both open source and private, has been identified and updated. (open
source details are below; please upgrade and encourage others to do so)

This API throttling is being implemented to control costs by encouraging
efficient software design while making sure the PeeringDB resource is
shared well. The use of API keys is being encouraged so that admins can reach out to users/orgs with runaway or inefficient software, and because it is more secure than user/pass. In addition, org API keys ease employee
transitions.

Some tips for coders is below.

API throttling in place today:

- repeated anonymous identical requests with a response size above 100k
    are being limited to 1/hour
- repeated anonymous identical requests of any size are being limited to
    2/minute
  - anonymous queries are being limited to 400/minute per IP address
  - authenticated queries are being limited to 500/minute per user/org

Here is the current schedule of throttling changes. The schedule may
adjust as needed as new packages that need update are discovered, so as to
minimize disruption to the community...

On June 27th, adjust and watch for feedback from the community:

  - anonymous queries limited to 300/minute per IP address
  - authenticated queries limited to 400/minute per user/org

On July 11th, adjust and watch for feedback from the community:

  - anonymous queries limited to 200/minute per IP address
  - authenticated queries limited to 300/minute per user/org

On July 18th, adjust and watch for feedback from the community:

  - anonymous queries limited to 100/minute per IP address
  - authenticated queries limited to 200/minute per user/org

On July 25th, adjust and watch for feedback from the community:

  - anonymous queries limited to 50/minute per IP address
  - authenticated queries limited to 100/minute per user/org

On August 1st, adjust and watch for feedback from the community:

  - anonymous queries limited to 30/minute per IP address
  - authenticated queries limited to 80/minute per user/org

On August 8th, adjust and watch for feedback from the community:

  - anonymous queries limited to 20/minute per IP address
  - authenticated queries limited to 60/minute per user/org

On August 15th, adjust and watch for feedback from the community:

  - anonymous queries limited to 10/minute per IP address
  - authenticated queries limited to 40/minute per user/org

Feedback/questions/concerns welcome.

Thanks,
Chris

Software:

- arouteserver v1.16.0: has many updates including API key support along
  with more efficient querying.

- PeerFinder: API key & efficient querying patches at
  https://github.com/rucarrol/PeerFinder/pull/17 will hopefully be
  integrated.

Coding tips:

- Begin using a PeeringDB API key for all requests:

    https://docs.peeringdb.com/howto/api_keys/

- Begin performing actual caching, such as by using peeringdb-py.

    http://peeringdb.github.io/peeringdb-py/

- If unable to use a caching agent such as peeringdb-py:

   - Use an API key.

   - Set a User-Agent: header.

   - Use bulk queries (asn__in=$list_of_ASN_separated_by_comma) by
     querying 30 to 150 ASNs at a time (tune as appropriate).

   - Add a delay in between queries that is randomly between 2 and 2.5
     seconds, to reduce thundering herd.
_______________________________________________
Pdb-tech mailing list
[email protected]
https://lists.peeringdb.com/cgi-bin/mailman/listinfo/pdb-tech
_______________________________________________
Pdb-tech mailing list
[email protected]
https://lists.peeringdb.com/cgi-bin/mailman/listinfo/pdb-tech

Reply via email to