You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+21Lines changed: 21 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -37,3 +37,24 @@ The [Makefile](./Makefile) contains targets to apply a consistent formatting to
37
37
## Citations from Google Scholar Alerts
38
38
39
39
As an initial step and to get a higher coverage, citations are extracted from Google Scholar Alert e-mails received April 2016 to date. See [gscholar_alerts](./gscholar_alerts/).
40
+
41
+
## Updating the awesome graph that everyone loves
42
+
43
+
## Uploading the raw data to Hugging Face
44
+
45
+
### Google Scholar
46
+
47
+
This data is split by year to make it easier to explore.
48
+
49
+
- pull the updated repo
50
+
-`make gscholar-bib`
51
+
- look in tmp for 2024.jsonl etc.
52
+
- upload at https://huggingface.co/datasets/commoncrawl/citations/tree/main
53
+
54
+
### Annotated Citations
55
+
56
+
This much smaller dataset has the extra fields mentioned above.
0 commit comments