Skip to content

Cache for previously coded addresses #187

@sakrejda-umich

Description

@sakrejda-umich

Description

Without changing any user-facing syntax, it would be helpful to make it possible to avoid re-querying the remote API's based on a local cache file.

Motivation

During exploratory data analysis on some larger datasets it's common to surface issues that can be corrected prior to geo-coding but it's helpful to just see which addresses fail in order to identify those issues. Another user has mentioned that this would be desirable previously for what it's worth but I think it's a different and simpler feature than the feature their comment is under. (#39 (comment))

Your Contribution

I can implement a first version of the feature based on hashing

  1. the packaged unique inputs as they will be sent to the geocoding service;
  2. the identity of the geocoding service; and
  3. the arguments that will be sent to the geocoding service

I think it would make sense to merged the cached results transparently. I think it would also make sense to add a function that the user can apply to find a hash for a given record and also a function that allows specific hashes to be dropped from the cache in case the user wants to force re-coding. The (opt-in) cache can likely just be a tibble or set of tibbles saved to a .rds file.

I've implemented some hacky versions of this locally but opened the issue to find out if you have opinions about how something like that should be implemented within the package.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions