Here's a password data set (20 MB) with 2 million entries, from dazzlepond.com. I discovered this Malaysian website when investigating new subscriber email addresses on Analyticbridge (to decide whether they were associated with spam or other malicious activity). This Malaysian website also claims to have the full list of 450,000 Yahoo email accounts that were recently hijacked - you can indeed download all these email addresses from their website (and possibly check whether hijacked email addresses share patterns that make them vulnerable).
Anyway, the reason for sharing the password data set with you is for you to test your data science skills: try to answer the following questions:
Thank you for sharing!
Here's what I found: http://parasdoshi.com/2012/08/14/what-can-a-dataset-of-hacked-passw...
I was surfing the web for my research on a data problem, happen to found this ...very few of the companies apply data science starting with a true definition of business problem and solving using data science techniques.. this looks promising ...do check
The link to the "Offical salary..." data did not work.
here is what I have done my analysis. Any input is welcome. Thanks