Description Usage Arguments Examples
Query a CDX index endpoint
1 2 3 4 |
cdx_api_endpoint |
the API endpoint to query. If using the Common Crawl CDX
index (or a CDX server that mimics the CC metadata services), it's a good
idea to use |
url |
host, url, wildcard to search for. e.g. |
include |
which fields to include in the output. The standard available
fields are usually: |
page |
page is the current page number, and defaults to 0 if omitted. If the page exceeds the number of available pages from the page count query, a 400 error will be returned. |
from, to |
Setting |
match_type |
Optional. If supplied, one of
As a shortcut, instead of specifying a separate |
limit |
limit the number of index lines returned. Limit must be set to a positive integer. If no limit is provided, all the matching lines are returned, which may be slow. |
sort |
Options. If supplied, one of:
Both options may be combined with |
1 2 | cidx <- fetch_collections_index()
rprj <- cdx_query(cidx$cdx_api[1], "*.r-project.org")
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.