Skip to content

Commit 23a4aed

Browse files
committed
fix: add missing item for new JSON importer from cc-pyspark
1 parent 7bde1a4 commit 23a4aed

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

src/script/hostgraph/build_hostgraph.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,10 +34,10 @@ SPARK_HADOOP_OPTS=""
3434
SPARK_EXTRA_OPTS=""
3535

3636
HOST_LINK_EXTRACTOR=./hostlinks_extract_fastwarc.py
37-
PYFILES_HOST_LINK_EXTRACTOR="sparkcc.py,sparkcc_fastwarc.py,wat_extract_links.py"
37+
PYFILES_HOST_LINK_EXTRACTOR="sparkcc.py,sparkcc_fastwarc.py,wat_extract_links.py,json_importer.py"
3838

3939
HOST_LINKS_TO_GRAPH=./hostlinks_to_graph.py
40-
PYFILES_HOST_LINKS_TO_GRAPH="sparkcc.py,iana_tld.py,wat_extract_links.py"
40+
PYFILES_HOST_LINKS_TO_GRAPH="sparkcc.py,iana_tld.py,wat_extract_links.py,json_importer.py"
4141
HOST_LINKS_TO_GRAPH_ARGS=(--validate_host_names) # --normalize_host_names
4242

4343

0 commit comments

Comments
 (0)