for data in DataLoader(ops.data_source.readthedocs('', include='*html')):
# batch
for data in DataLoader(ops.data_source.readthedocs('', include='*html'), batch_size=10):
***page_prefix:*** *str*
The root path of the page. Generally, the crawled links are relative paths. The complete URL needs to be obtained by splicing the root path + relative path.
***index_page:*** *str*
The main page contains links to all other pages, if None, will use `page_prefix`.