-
-
Notifications
You must be signed in to change notification settings - Fork 184
/
Copy pathsearch_roadmap.json
174 lines (174 loc) · 10.3 KB
/
search_roadmap.json
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
{
"quarters": [
{
"name": "Q3 2020",
"tasks": [
{
"description": "Make changes to the search algorithm that incorporate image popularity data gathered from sources that provide it. ",
"gid": "1149385618454696",
"name": "Improve Search Algorithm with Popularity Data Integration"
},
{
"description": "Move our data cleaning code from the ingestion step of the API to the initial data processing step of the Catalog to eliminate unnecessary repetitive data cleaning.",
"gid": "1167425798148805",
"name": "Move data cleaning pipeline from API to Catalog"
},
{
"description": "Update Catalog schema to include new metadata generated through AWS Rekognition.",
"gid": "1154270978154717",
"name": "Implement architecture for schema for new metadata [AWS Grant]"
},
{
"description": "Plan out search algorithm changes to incorporate image metadata generated via AWS Rekognition.",
"gid": "1154270978154720",
"name": "Plan search algorithm changes for new metadata [AWS Grant]"
},
{
"description": "Improve how and where we explain licenses, and consider ways to make it easier for reusers to understand and comply with license requirements.",
"gid": "1147666754358269",
"name": "License Explanation/Compliance Improvements"
},
{
"description": "Offline Old Search (oldsearch.creativecommons.org) and redirect traffic to CC Search. Prior to this, build in messaging on Old Search, and support similar functionality on CC Search. See \"Meta Search Integration\" for related work.",
"gid": "1149456632174214",
"name": "Offline old CC Search"
},
{
"description": "Research and test potential integrations for Web Monetization into CC Search and other CC web properties.",
"gid": "1153114910798067",
"name": "Web Monetization: Phase 1"
},
{
"description": "Support rendering of 3D objects on frontend of CC Search.",
"gid": "1186693612765819",
"name": "3D Support in UI"
},
{
"description": "Improve the support pages on CC Search, which includes the Collections page, for a better experience. Add explanation text for collections, improve flow.",
"gid": "1149385618454685",
"name": "Improved Support Pages"
},
{
"description": "Make accessibility improvements to the UI.",
"gid": "1171784517466065",
"name": "Accessibility Improvements"
},
{
"description": "Build infrastructure necessary for internationalization, to allow CC Search to be accessible in other languages.",
"gid": "1149456632174198",
"name": "Internationalization Infrastructure"
},
{
"description": "Update our Common Crawl provider infrastructure to:\n(1) use Apache Airflow instead of AWS tools like Data Pipeline & Glue for processing data\n(2) unify provider processing to use the same base classes as API providers",
"gid": "1167425798148813",
"name": "Improve Common Crawl Infrastructure"
},
{
"description": "Designing and prototyping an upcoming user interface for searching for audio on CC Search.",
"gid": "1163392248010945",
"name": "Design Sprint: Audio UI for CC Search"
},
{
"description": "Design and user test UIs for audio. Ingest a pilot collection of audio to the Catalog, build support in the API. Integrate design to frontend to allow users to search for CC licensed audio.",
"gid": "1171015130050099",
"name": "Audio Support and Integration"
},
{
"description": "Generate metadata via machine learning (using AWS Rekognition) on a set of ~100 million high quality images from the CC Catalog.",
"gid": "1154270978154727",
"name": "Run Rekognition on 100m images [AWS Grant]"
},
{
"description": "For all possible providers, use their APIs to ingest data into the CC Catalog instead of scraping websites via Common Crawl data.",
"gid": "1149385618454708",
"name": "Switch from Common Crawl to API"
}
]
},
{
"name": "Q4 2020",
"tasks": [
{
"description": null,
"gid": "1186693612765822",
"name": "Search Relevance Improvements: Language Analysis, Quality Metrics, Minimums"
},
{
"description": "Design updates to the CC Search UI in response to new metadata available as a result of applying machine learning to selected images in the Catalog. At a minimum, we expect new filters will be an option. Integration of design will take place subsequently.",
"gid": "1154270978154729",
"name": "Plan UI Updates in Response to Metadata [AWS Grant]"
},
{
"description": "Automate the process of finding new providers of CC-licensed content to index into the CC Catalog.",
"gid": "1167425798148819",
"name": "Provider Review Automation"
},
{
"description": "Build an analytics UI that is fed by Google Analytics and our internal analytics database.",
"gid": "1149385618454692",
"name": "Usage/Reuse Metrics Dashboard"
},
{
"description": "Once the Rekognition crawl finishes, we want to crawl the rest of the catalog (but not feed them to rekognition). This will give us useful metadata like dimensions and quality.",
"gid": "1186693612765814",
"name": "Scrape all images and set up feed for new ones"
},
{
"description": "Create better documentation for community contributors by consolidating internal and public documentation and making it available for everyone.",
"gid": "1167425798148815",
"name": "Improve Documentation for Community Contributors"
},
{
"description": "Manage Catalog deployment and provisioning entirely through infrastructure as code.",
"gid": "1167425798148811",
"name": "Improve Catalog Deployment and Provisioning"
},
{
"description": "Make CC Catalog API documentation more accessible to CC Search users, and improve user experience.",
"gid": "1164969092703369",
"name": "API documentation improvements"
},
{
"description": "Design and build an embed of CC Search that can be placed on any website, as a starting point to discover objects in CC Search. Components from Design Library must be used, with the goal of simplicity.",
"gid": "1168725971351188",
"name": "CC Search HTML Embed"
},
{
"description": "Plan out the usage of scraping ccREL metadata from the internet to index new content into the CC Catalog.",
"gid": "1164969092703371",
"name": "Plan use of ccREL for easily adding content to cccatalog"
},
{
"description": "Update CC Search user personas based on user research during 2020.",
"gid": "1149385618454694",
"name": "User Persona Redevelopment"
},
{
"description": "Design and implement seamless support for multiple languages in CC Search, as content in languages becomes available. This is preceded by Internationalization Infrastructure work.",
"gid": "1146971105237778",
"name": "Support multiple languages in CC Search\n"
},
{
"description": "Implement design updates to the CC Search UI. Designs will be created in response to new metadata available as a result of applying machine learning to selected images in the Catalog. At a minimum, we expect new filters to be rolled out.",
"gid": "1154270978154731",
"name": "Implement UI Updates for new Metadata [AWS Grant]"
},
{
"description": "Update our search algorithm to use metadata gathered using machine learning analysis (using AWS Rekognition).",
"gid": "1154270978154736",
"name": "Implement Search Algorithm Changes [AWS Grant]"
},
{
"description": "Release the infrastructure code used to power the CC Catalog, API, and CC Search projects publicly.",
"gid": "1154346889867710",
"name": "Ensure Infrastructure Code is Open Source [AWS Grant]"
},
{
"description": "Enrich CC Catalog and data found in the wild using Common Crawl, for example, to track where CC-licensed content is reused.",
"gid": "1167425798148821",
"name": "Enrich CC Catalog data with data from Common Crawl"
}
]
}
]
}