Output subfolder, and port 1256 is unused on your machine, you might run: docker run -it -rm -v $(pwd)/output:/data -p 1256:80 kiwix/kiwix-tools kiwix-serve /data/ifixit_fr_selection_2022-04.zimĪnd then navigate to ( on your favorite browser. To test if the ZIM produced is OK, you should run kiwix-serve, once more with Docker.įor instance, if you produced a file named ifixit_fr_selection_2022-04.zim in the This will produce a ZIM in the output folder of your current directory. In French : docker run -it -v $(pwd)/output:/output -rm openzim/fixit:local ifixit2zim -language fr -output /output -tmp-dir /tmp -category Apple_PDA Then run the scraper with CLI arguments needed for your test (everything after ifixit2zim in the example below).įor instance, if you want to run a scrape of only the Apple_PDA category, including its guides, Test the scraper in a Docker containerįirst, build the Docker image (to be ran in the main folder of this repo): docker build -t openzim/ifixit:local. In doubt, execute the scraper in aĭocker container as explained below. If Kiwix still doesn’t work after adding zims, try rebooting or sorting modules in the admin interface and hitting save. So this shouldn't be a problem for quick tests. The main issue is that the full-text index is not working, NOTA : there is some limitations to the execution of the underlying libzim library on Ensure your code passed black formatting, isort and flake8 (88 chars).If your contribution is far from trivial, open an issue to discuss it first.Please open an issue on Github and/or submit a Pull-request. BitTorrentwhich downloads the file with a torrent software. Once you have Kiwix open, go to the Kiwix internal library to pick your content. Once Kiwix (the software)is installed you have many ways to download content (files): With Kiwixif your version of Kiwix allows it. See requirements.txt for the list of python dependencies. Kiwix (the software) The ZIM filewith the content of your choice. venvĬall deactivate to quit the virtual environment. In addition to Python3, you also need to have an up-to-date installation of pip, setuptools and wheel as recommanded here (wheel is important since you will have to build some dependencies). If you are not using the Docker image, you are advised to use it in a virtual environment to avoid installing software dependencies on your system. Ifixit2zim is a Python3 ( 3.6+) software. Docker docker run -v my_dir:/output openzim/ifixit ifixit2zim -help The list of supported languages is visible in the -help message. Ifixit2zim works off a language version that you must provide via the -language argument. org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump. ) and puts them in a ZIM file, a clean and user friendly format for storing content for offline usage. A mirror is an object that reflects an image. This scraper downloads the iFixit resources (categories, guides. Special thanks go to Emmanuel Engelhart who made this all possible.Ifixit2zim is an openZIM scraper to create offline versions of iFixit website, in all its supported languages. Vinita Parkash (who inspired the effort), and Dr. The full resolution images are available on Libre Pathology and WikiCommons.Ī big thanks go to Dr. The image size was limited to make the ZIM file more compact. We are looking at using Parsoid/MWoffliner for the HTML generation. Wikipedia, Wikiversity.Ĭurrently, we are using httrack to generate the static HTML pages from an offline copy of the site, and then feed that to zimwriterfs to make the ZIM file. The offline version is based on Kiwix, which was developed to create offline versions of WikiMedia Foundation projects, e.g. Open Kiwix and load the ZIM file - takes less than 2 minutes.ask our site admin for the link & password to get the pre-release. Download the Libre Pathology ZIM file - takes a while as it is ~700MB.Install Kiwix - takes less than 5 minutes.Setting up an offline version of Libre Pathology Kiwix has a strong built-in search feature, and navigation otherwise is similar to online.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |