Skip to content

Latest commit

 

History

History
44 lines (33 loc) · 1.53 KB

README.md

File metadata and controls

44 lines (33 loc) · 1.53 KB

The Harvester Project

Description

With this project I want to give one's attention about the amount of information that people is sharing publicly on Internet.

The idea is have a lot of Pitchforks implementations, all of them taking into account the TOC of each site. Basically, a Pitchfork is the implementation of a social network crawler.

To obtain the information, we need a start point. This start point is called Seed and it can be any value into the common dictionary.

Seeds

  • Email
  • Company
  • GithubUser
  • TwitterUser
  • PersonalSite
  • Avatar
  • Name
  • Location
  • Timezone
  • Description
  • Language
  • KeybaseUser
  • FacebookUser
  • HackernewsUser
  • RedditUser
  • LinkedinUser
  • BitcoinAddress

Pitchforks

  • Web crawler
  • Github
  • Keybase
  • Twitter

TODOs

  • Seed weight implementation. Not all the results have the same importance or reliability.
  • More Pitchforks (Facebook, Flickr, Quora, StackOverflow, Tumblr, etc.)

Icons made by Freepik from www.flaticon.com is licensed by CC 3.0 BY