Pipeline for ingesting data about housing near RIT.
- Configure your environment (instructions on the wiki).
- Choose an unassigned issue, and comment that you're working on it.
- Open a PR containing a new
fetch,parse, ornormalizescript! (details on these stages)
Results are periodically committed to vaccine-feed-ingest-results. Once your PR is merged, you will be able to see the output of your work there!
See the wiki for instructions on how to run housing-data-ingest.
For more information on (pipeline stages) and how to contribute, see the wiki!
The below details on interacting with our production environment are intended for staff developers.
In production, all stages for all runners are run, and outputs are stored to the vaccine-feeds bucket on GCS.
If you are developing a feature that interacts with the remote storage, you need to test GCS then install the gcloud SDK from setup instructions and use the vaccine-feeds-dev bucket (you will need to be granted access).
Results are also periodically committed to vaccine-feed-ingest-results.
- Authenticate to gcloud with an account that has access to
vaccine-feeds-devbucket.
gcloud auth application-default login- Run ingestion with an GCS
--output-dir
poetry run housing-data-ingest all-stages --output-dir=gs://vaccine-feeds-dev/locations/-
Request an account on the VIAL staging server
https://vial-staging.calltheshots.us -
Create an API Key for yourself at
https://vial-staging.calltheshots.us/admin/api/apikey/ -
Store the API key in project
.envfile with the varVIAL_APIKEY
-
Load SF.GOV source feed to VIAL
poetry run housing-data-ingest load-to-vial ca/sf_gov