To Curate
SPARQL queries and material to curate
This note lists the remaining query bundles and scratch materials that still need curation before they can be added to the cleaned query index.
Curation instructions
For each item below:
- Decide the endpoint first:
Wikidata Query Service,Qlever Wikidata,Qlever Metrin KG,IDSM/Sachem, orother. - Give the query a stable title in sentence case.
- Write a one-line description that states the input, the filter, and the main output.
- Keep the short query link if one exists. Remove raw embedded
?query=URLs from the curated note unless the full text must be preserved. - Record whether the item is a
query,query family,endpoint note,draft SPARQL block, orpresentation/scratch material. - Deduplicate variants. If multiple links differ only by output format or a small filter, keep one parent title and list the variants under it.
- If the item is still exploratory or broken, keep it in this note and add a short reason.
Moved to cleaned for manual review
IDSM-assisted compound occurrence lookupReferenced structure-organism pairs on WikidataChEBI substructure search through Sachem
Queries and material requiring curation
IDSM and Sachem material
-
Sachem non-permanent endpoint noteSource material: https://idsm.elixir-czech.cz:2443/sachem/#/search. Curation instruction: keep only if the endpoint is still relevant as a manual search tool; otherwise move it to a tools/endpoints note instead of the query index. -
Sachem grand parentsSource material: https://w.wiki/32Wz. Curation instruction: determine whether this is a structure-search query, a taxonomic summarisation query, or both; rename accordingly and note required inputs.
Generic LOTUS query bundle
Unnamed LOTUS query familySource material: https://w.wiki/$Q$, https://w.wiki/$R3, https://w.wiki/$RD, https://w.wiki/$RR, https://w.wiki/$RU, and https://w.wiki/$SF. Curation instruction: open each link, identify what changes between the variants, group them under one parent title if they are the same query family, and keep only the variants that add a distinct result view or filter.
Taxon and occurrence curation prototypes
-
Quest / Daniel Mietchen prototype familySource material: https://w.wiki/5a7K, https://w.wiki/5a8b, https://w.wiki/5a8g, https://w.wiki/5a8t, https://w.wiki/5a9B, https://w.wiki/5a9T, and https://w.wiki/5a9g, plus the raw SPARQL blocks inwikidata.sparql.md. Curation instruction: split this family into clearly named subgroups such asmain-subject curation,QuickStatements generation, andtaxon-compound literature matching; mark which variants are working, experimental, or superseded. -
Not working / no results prototypesSource material: thequest query mod from Mietchen to compounds (not working)andNo error but no results neitherblocks inwikidata.sparql.md. Curation instruction: do not add these to the cleaned query list yet; instead summarise why they fail, what was tried, and which working variant replaced them. -
Curating querySource material: the raw SPARQL block underCurating queryand links https://w.wiki/5a9T and https://w.wiki/5a9g. Curation instruction: title it based on outcome rather than process, for example by the property being curated (P703/main subject), and keep the QuickStatements output behaviour explicit in the description.
Scholarly-article graph split notes
Wikidata scholarly article split remindersSource material: https://w.wiki/Eexo, https://w.wiki/Eexs, https://w.wiki/Eext, and https://w.wiki/Eexz, plus the discussion excerpt inwikidata.sparql.md. Curation instruction: rewrite this as a short reference note explaining the graph split, when to use Qlever instead of WDQS, and which example query demonstrates the fix.
Embedded full query URLs
-
Embedded query.wikidata.org URLsSource material: longquery.wikidata.org/index.html#...andquery.wikidata.org/embed.html#...URLs inwikidata.sparql.md. Curation instruction: replace each embedded URL with a short title and, if possible, aw.wikishort link; keep the embedded form only in a dedicated archive note if the exact serialized query text matters. -
Embedded qlever ?query= URLsSource material: longhttps://qlever.../?query=...URLs in the Qlever section ofwikidata.sparql.md. Curation instruction: keep the stable short Qlever permalink as the curated link; archive the expanded URL only when it contains annotations or a version not represented by the short permalink.
Raw inline SPARQL blocks
-
Drug-protein interaction draftSource material: inline SPARQL block under## drug-prot interaction. Curation instruction: convert this into a titled query entry only after confirming scope, expected output columns, and whether the cancer restriction is intentional or example-only. -
VIT / JLW federated query blocksSource material: the two raw SPARQL blocks joining Wikidata withsinergiawolfender.orgnamespaces inwikidata.sparql.md. Curation instruction: decide whether these belong in a separate federated-query note; if kept, document the external dataset dependency, required taxon input, and matching logic on molecular formula. -
InChIKey occurrence lookup draftsSource material: thecontaining taxa + ref for a given IKsection with two raw SPARQL blocks and embedded URLs. Curation instruction: merge duplicate variants, name them by purpose (all taxa for an InChIKey prefix,taxon-restricted InChIKey lookup), and keep only one canonical version per use case.
Endpoint and reference material
Endpoint inventorySource material: https://www.wikidata.org/wiki/Wikidata:Lists/SPARQL_endpoints. Curation instruction: move this out of the query index unless it is needed as a reference note; it is endpoint documentation rather than a query.
Presentation and scratch material
-
To tweetSource material: the# to tweetsection inwikidata.sparql.md. Curation instruction: harvest any actual useful query links from the section and merge them into stable query families; discard the presentation framing. -
Presentation example: UniFrSource material:# example for prsenattion UniFrwith links https://w.wiki/4EMf and https://w.wiki/4EMo. Curation instruction: keep the underlying queries only if they are not already covered by an existing curated family; otherwise remove the presentation-specific label. -
ManagerSource material: the standaloneManagermarker inwikidata.sparql.md. Curation instruction: remove unless it refers to a recoverable query context elsewhere in the source note. -
mail_jakubSource material:[[mail_jakub|scratch.2021.02.02.150258.mail_jakub]]. Curation instruction: treat this as provenance for the indole-substructure idea, not as a query entry; move citation context to a note if useful.
Suggested output format after curation
Use this template when promoting an item into the cleaned note:
Title: one-sentence description. Query:<short link>. Variant:<second link if genuinely distinct>.
If a full SPARQL block must be preserved:
- Create a dedicated note for the block.
- Give it the same stable title.
- In the cleaned index, link only the short query URL and mention that the full SPARQL is archived separately.