Great to hear you are working on this problem! I think these problem will get a lot easier with just more data. You can check @Pip the WoT guy's it works amazingly well for finding real people who have a decent amount of connections. With more data, algorithms will be able to figure thinks out pretty reliably just doing some kind of "unsupervised learning".
Home