@@ -15,23 +15,27 @@ pub_date: 2020-08-03
15
15
body:
16
16
## Introduction
17
17
The Creative Commons (CC) Catalog project collects and stores CC licensed images scattered across the internet, such
18
- that they can be made accessible to the general public via the [ CC Search] ( https://ccsearch.creativecommons.org/ )
19
- and [ CC Catalog API ] ( https://api.creativecommons.engineering/v1/ ) tools. Numerous information associated with each
20
- image, which help in the image search and categorisation process are stored via CC Catalog in the CC database.
18
+ that they can be made accessible to the general public via the [ CC Search] [ cc_search ] and [ CC Catalog API ] [ cc_api ]
19
+ tools. Numerous information associated with each image, which help in the image search and categorisation process are
20
+ stored via CC Catalog in the CC database.
21
21
22
- In my [ previous blog post] ( https://opensource.creativecommons.org/blog/entries/flickr-sub-provider-retrieval/ )
23
- of this series entitled 'Flickr Sub-provider Retrieval', I discussed how the images from a
24
- certain provider (such as Flickr) can be categorised based on the sub-provider values (which reflects the underlying
25
- organisation or entity that published the images through the provider). We have similarly implemented the sub-provider
26
- retrieval logic for Europeana and Smithsonian providers. Unlike in Flickr and Europeana, every single image from
27
- Smithsonian is categorised under some sub-provider value where the sub-providers are identified based on a unit code
28
- value as contained in the API response (for more information please refer to the pull request
29
- https://github.com/creativecommons/cccatalog/pull/455 ). The unit code values and the corresponding sub provider values
30
- are maintained in the dictionary * SMITHSONIAN_SUB_PROVIDERS* . However, there is the possibility of the * unit code*
31
- values being updated at the Smithsonian API level, and it is important that we have a mechanism of reflecting those
32
- updates in the * SMITHSONIAN_SUB_PROVIDERS* dictionary as well. In this blog post, we discuss how we learn the potential
22
+ In my [ previous blog post] [ flickr_blog_post ] of this series entitled 'Flickr Sub-provider Retrieval', I discussed how
23
+ the images from a certain provider (such as Flickr) can be categorised based on the sub-provider values (which reflects
24
+ the underlying organisation or entity that published the images through the provider). We have similarly implemented
25
+ the sub-provider retrieval logic for Europeana and Smithsonian providers. Unlike in Flickr and Europeana, every single
26
+ image from Smithsonian is categorised under some sub-provider value where the sub-providers are identified based on a
27
+ unit code value as contained in the API response (for more information please refer to the pull request [ #455 ] [ pr_455 ] ).
28
+ The unit code values and the corresponding sub provider values are maintained in the dictionary
29
+ * SMITHSONIAN_SUB_PROVIDERS* . However, there is the possibility of the * unit code* values being updated at the
30
+ Smithsonian API level, and it is important that we have a mechanism of reflecting those updates in the
31
+ * SMITHSONIAN_SUB_PROVIDERS* dictionary as well. In this blog post, we discuss how we learn the potential
33
32
changes to the * unit code* values and keep the * SMITHSONIAN_SUB_PROVIDERS* dictionary up-to-date.
34
33
34
+ [ cc_search ] : https://ccsearch.creativecommons.org/
35
+ [ cc_api ] : https://api.creativecommons.engineering/v1/
36
+ [ flickr_blog_post ] : ../flickr-sub-provider-retrieval/
37
+ [ pr_455 ] : https://github.com/creativecommons/cccatalog/pull/455
38
+
35
39
## Implementation
36
40
### Retrieving the latest unit codes
37
41
We are required to obtain the latest * unit codes* supported by the Smithsonian API to achieve this task. Furthermore,
0 commit comments