Websites I Run

Data Repositories I run

Software Frameworks / Libraries

  • nsq.io - realtime distributed message processing at scale
  • gomrjob - Go framework for running MapReduce jobs on Hadoop or Dataproc
  • git-open-pull - convert an issue to a pull request from the CLI
  • private_s3_httpd - HTTP Server for private Amazon S3 content
  • lru - Go library for caching arbitrary data with least-recently-used (LRU) eviction strategy
  • sortdb - HTTP API for querying data in a sorted CSV file
  • data_hacks - command line tools for data analysis
  • urlnorm - python library for URL normalization
  • json2csv - convert stream of JSON messages to CSV
  • little_bigtable - emulator for Google Bigtable with sqlite3 persistance

Archived Projects

Jehiah Czebotar