Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
M
media_data_crawler
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
zhiwei
media_data_crawler
Commits
ee35ccbd
Commit
ee35ccbd
authored
Sep 21, 2018
by
[zhangzhiwei]
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
添加知乎回答采集
parent
144dcd3b
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
45 additions
and
15 deletions
+45
-15
src/main/java/com/zhiwei/media_data_crawler/data/DataCrawler.java
+45
-15
No files found.
src/main/java/com/zhiwei/media_data_crawler/data/DataCrawler.java
View file @
ee35ccbd
package
com
.
zhiwei
.
media_data_crawler
.
data
;
package
com
.
zhiwei
.
media_data_crawler
.
data
;
import
java.net.Proxy
;
import
java.net.Proxy
;
import
java.util.ArrayList
;
import
java.util.Date
;
import
java.util.List
;
import
java.util.List
;
import
java.util.Map
;
import
com.zhiwei.media_data_crawler.crawler.BaiduNewsCrawlerParse
;
import
com.zhiwei.media_data_crawler.crawler.*
;
import
com.zhiwei.media_data_crawler.crawler.BaiduTiebaCrawlerParse
;
import
com.zhiwei.media_data_crawler.entity.*
;
import
com.zhiwei.media_data_crawler.crawler.DoubanCrawlerParse
;
import
com.zhiwei.tools.tools.ZhiWeiTools
;
import
com.zhiwei.media_data_crawler.crawler.SoCrawlerParse
;
import
org.jsoup.Jsoup
;
import
com.zhiwei.media_data_crawler.crawler.SoNewsCrawlerParse
;
import
org.jsoup.nodes.Document
;
import
com.zhiwei.media_data_crawler.crawler.SougouNewsCrawlerParse
;
import
org.jsoup.select.Elements
;
import
com.zhiwei.media_data_crawler.crawler.SougouZhihuCrawlerParse
;
import
com.zhiwei.media_data_crawler.crawler.TianYaCrawlerParse
;
import
com.zhiwei.media_data_crawler.entity.DouBanData
;
import
com.zhiwei.media_data_crawler.entity.LunTanData
;
import
com.zhiwei.media_data_crawler.entity.NewsData
;
import
com.zhiwei.media_data_crawler.entity.TiebaData
;
import
com.zhiwei.media_data_crawler.entity.ZhiHuData
;
public
class
DataCrawler
{
public
class
DataCrawler
{
...
@@ -342,6 +338,40 @@ public class DataCrawler {
...
@@ -342,6 +338,40 @@ public class DataCrawler {
return
null
;
return
null
;
}
}
}
}
/**
* 获取知乎回答数据
* @param url
* @param endDate
* @param proxy
* @return
* @throws Exception
*/
public
static
List
<
ZhihuAnswer
>
getAnswerList
(
String
url
,
Date
endDate
,
Proxy
proxy
)
throws
Exception
{
try
{
return
ZhihuAnwserCrawlerParse
.
getAnswerList
(
url
,
endDate
,
proxy
);
}
catch
(
Exception
e
){
throw
e
;
}
}
/**
* 获取单页知乎回答数据
* @param url
* @param page
* @param endDate
* @param proxy
* @return
* @throws Exception
*/
public
static
Map
<
String
,
Object
>
getAnswerList
(
String
url
,
int
page
,
Date
endDate
,
Proxy
proxy
)
throws
Exception
{
try
{
return
ZhihuAnwserCrawlerParse
.
getAnswerList
(
url
,
page
,
endDate
,
proxy
);
}
catch
(
Exception
e
){
throw
e
;
}
}
}
}
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment