-
Notifications
You must be signed in to change notification settings - Fork 21
Description
Description
Without changing any user-facing syntax, it would be helpful to make it possible to avoid re-querying the remote API's based on a local cache file.
Motivation
During exploratory data analysis on some larger datasets it's common to surface issues that can be corrected prior to geo-coding but it's helpful to just see which addresses fail in order to identify those issues. Another user has mentioned that this would be desirable previously for what it's worth but I think it's a different and simpler feature than the feature their comment is under. (#39 (comment))
Your Contribution
I can implement a first version of the feature based on hashing
- the packaged unique inputs as they will be sent to the geocoding service;
- the identity of the geocoding service; and
- the arguments that will be sent to the geocoding service
I think it would make sense to merged the cached results transparently. I think it would also make sense to add a function that the user can apply to find a hash for a given record and also a function that allows specific hashes to be dropped from the cache in case the user wants to force re-coding. The (opt-in) cache can likely just be a tibble or set of tibbles saved to a .rds file.
I've implemented some hacky versions of this locally but opened the issue to find out if you have opinions about how something like that should be implemented within the package.