Skip to content

quantify the size and diversity of the commons--the collection of works that are openly licensed or in the public domain

License

Notifications You must be signed in to change notification settings

creativecommons/quantifying

Repository files navigation

quantifying

Quantifying the Commons

Overview

This project seeks to quantify the size and diversity of the commons--the collection of works that are openly licensed or in the public domain.

Code of Conduct

CODE_OF_CONDUCT.md:

The Creative Commons team is committed to fostering a welcoming community. This project and all other Creative Commons open source projects are governed by our Code of Conduct. Please report unacceptable behavior to conduct@creativecommons.org per our reporting guidelines.

Contributing

See CONTRIBUTING.md.

Development

Prerequisites

This repository uses pipenv to manage the required Python modules:

Tooling

Data Sources

CC Legal Tools

Flickr

  • The Flickr API exposes identifiers for users, photos, photosets and other uniquely identifiable objects.
  • The Flickr API consists of a set of callable methods, and some API endpoints.
  • For more detailed description, visit: API documentation - Flickr Services.
  • The hs.csv file is a sample CSV of pulled data. Ideally the script will generate final data CSVs.
  • Each license will have a CSV to save the data.
  • Due to memory limit, the license CSVs are not pushed into github.

Google Custom Search JSON API

Internet Archive Python Interface

A python interface to archive.org to achieve API requests towards internet archive.

The Metropolitan Museum of Art Collection API

An API endpoint for receiving Metropolitan Muesum of Art Collection's CC-Licensed works.

Latest Updates | The Metropolitan Museum of Art Collection API:

The Metropolitan Museum of Art provides select datasets of information on more than 470,000 artworks in its Collection for unrestricted commercial and noncommercial use. To the extent possible under law, The Metropolitan Museum of Art has waived all copyright and related or neighboring rights to this dataset using the Creative Commons Zero license.

Vimeo API

The Vimeo API allows users to perform filtered, advanced search on Vimeo videos.

MediaWiki API

  • The MediaWiki Action API is a web service that allows access to some wiki features like authentication, page operations, and search. It can provide meta information about the wiki and the logged-in user.
  • language-codes_csv.csv
    • A list of language codes in ISO 639-1 Format to access statistics of each wikipedia main page across different languages. In the script, this file is named as language-codes_csv to minimize the amount of manual work required for running the script provided the same language encoding file. The user would have to rename the header and file name of their .csv ISO code list according to the concurrent file on Github if they would like to use some list other than the concurrent one.
    • This file that this script uses can be downloaded from: https://datahub.io/core/language-codes

Youtube Data API

An API from YouTube for platform users to upload videos, adjust video parameters, and obtain search results.

History

For information on past efforts, see history.md.

Copying & License

Code

LICENSE: the code within this repository is licensed under the Expat/MIT license.

Data

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication button

The data within this repository is dedicated to the public domain under the CC0 1.0 Universal (CC0 1.0) Public Domain Dedication.

Documentation

CC BY 4.0 license button

The documentation within the project is licensed under a Creative Commons Attribution 4.0 International License.

About

quantify the size and diversity of the commons--the collection of works that are openly licensed or in the public domain

Resources

License

Code of conduct

Stars

Watchers

Forks