-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
Summary
Please consider adding YARS-PG as a first-class serialization format in graphfaker (initially as an export target; optionally as an import/loader later). YARS-PG is a concise, text-based format designed specifically for property graphs, with a formal grammar and reference implementations. Supporting it would immediately improve interoperability, reproducibility, and downstream tooling integration for datasets generated by graphfaker.
Why YARS-PG would benefit graphfaker
-
Interoperability across PG tools and research
- YARS-PG was designed for publication and exchange of property graphs. Exporting
graphfakeroutputs to YARS-PG makes it straightforward to share synthetic benchmarks and teaching datasets across tools, languages, and databases that already speak or validate YARS-PG.
- YARS-PG was designed for publication and exchange of property graphs. Exporting
-
Bridging PG and RDF/Linked Data ecosystems*
- There are existing bridges that parse YARS-PG into RDF* datasets. This unlocks RDF/Linked-Data tooling (validation, transformation, indexing) on top of
graphfakeroutputs without bespoke converters.
- There are existing bridges that parse YARS-PG into RDF* datasets. This unlocks RDF/Linked-Data tooling (validation, transformation, indexing) on top of
-
Plain-text, diff-friendly artifacts (great for Git)
- YARS-PG files are human-readable and line-diff-friendly, which improves code review, dataset versioning, and reproducible experiments—key for academic and industrial workflows.
-
Feature coverage aligned with property-graph practice
- The format and grammar capture the core PG concepts (nodes, edges, properties, labels/types, edge properties/annotations), which map naturally from
graphfaker’s internal NetworkX graphs.
- The format and grammar capture the core PG concepts (nodes, edges, properties, labels/types, edge properties/annotations), which map naturally from
-
Ready-to-use grammar and parsers
- A formal ANTLR4 grammar with Python and Java parsers exists. This lowers implementation risk and provides a built-in way to validate exported files as part of CI.
-
Licensing compatibility
graphfakeris MIT-licensed; YARS-PG grammar/impl. is BSD-3-Clause and/or CC-BY for docs—compatible for optional dependency or dev-time validator.
What is YARS-PG (at a glance)
- A text-based serialization purpose-built for property graphs.
- Comes with a formal grammar (ANTLR4 + EBNF) and reference parsers (Python/Java).
- Designed to be simple, extensible, and platform-independent.
- Validated in a peer-reviewed venue and used in example datasets/benchmarks.
References
- Grammar & implementation (ANTLR4, EBNF, examples, Python/Java parsers): https://github.com/lszeremeta/yarspg
- Academic paper (IEEE Access, 2024): https://ieeexplore.ieee.org/document/10536116/
- Release record (v5.0.0, DOI): https://zenodo.org/records/10676695
- YARS-PG → RDF* bridge (example implementation): https://github.com/gkellogg/yarspg
Metadata
Metadata
Assignees
Labels
No labels