Skip to content

This a scrapy project collecting millions of personal information from LinkedIn

Notifications You must be signed in to change notification settings

nlyu/Projects_Scrapy

Repository files navigation

scrapy_uiuc

Intro:

This is a multithread/process web scrapy that can get information for UIUC aluminai from LinkedIn.   这是一个多线程/进程网络爬虫,可以抓取领英上面uiuc校友的信息。
Information include: (Name, Connection, Companys, Titles, School, Degree, Major)   我们统计的信息包括:(姓名,好友数,工作公司,职位,学校,文凭,专业)
This project is made by three UIUC students(three contributor), and this is made as an tool for our research symposium 2017. We are going to demo our report as a research poster in campus on April 17th.   这个是由三位uiuc同学做的(见贡献者信息),我们做这个是为了参加uiuc 2017研究展览会。在四月17号我们会吧我们爬虫得到的数据分析成结果展示在我们的研究海报上。

How to use:  

You need... a labtop(Mac/Linux) + any linkedin account...and now you are all set! This program is runing on python3 + selenium so you probably need to install the some library.
Either enter pip3 install XXX / brew install XXX (BTW you need to install pip3 and brew first)
你需要一台笔记本电脑(Mac/Linux) + 任何领英账号,就可以用我们的这个软件啦   这个程序基于python3 + selenium 所以你可能还要装一些库才可以使用,至于怎么装库,聪明的你肯定没有问题。   看见提示缺什么,你就装什么,用pip3 和 brew就可以轻松安装了
We have a gui interface so no worry. The output result is in result_final.csv   我们有个图形界面所以不用担心,结果在result_final.csv 中

You have to enter your linkedin account and password on the 20th & 21st line of linkedin_ver2.0.py code. To launch the scrapy, type python3 linkedin_ver2.0.py on your command line 你需要在linkedin_ver2.0.py的第 20 和 21行中输入自己的linkedin 账号和密码才可以使用, 启动命令 python3 linkedin_ver2.0.py

About

This a scrapy project collecting millions of personal information from LinkedIn

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages