Parser Stats<br>automatically and incrementally reported with each parse
See AboutUs Getting Started blog post announcing the open-sourcing of this technology. github
This was a web based experiment manager that could prepare, monitor, and refine task specific grammars turned into engines with a slightly enhanced version of Ian Piumarta's pegleg. site
Developed sediment inspired uniform sampling algorithm. Add elements at period p until sediment reaches limit. Then decimate the sediment by 1/2 and 2x the period. github
See related reservoir sampling which is happy to be uniformly random but not evenly spaced. wikipedia
My approach here was informed by my positive experience with XML Import in Ruby.