Package 'dicecrawler'

3 downloads 0 Views 72KB Size Report
Jul 19, 2017 - getjobs(title, phrase, direct, areacode, country, state, skill, cityzip, ip, age, diceid, ... Specifies the United States Post Office state code for jobs.
Package ‘dicecrawler’ July 19, 2017 Type Package Title Downloads Job Descriptions from Dice.com Version 0.1.0 Description A Web crawler for . The function getjobs() automatically crawls Dice.com and downloads job descriptions based on the supplied parameters. The job data is returned via a data frame. JobSearch API supplied by Dice.com is used for retrieving job data. Depends jsonlite, rvest, curl, xml2 License GPL-3 Encoding UTF-8 LazyData true RoxygenNote 6.0.1 NeedsCompilation no Author Vlad Krotov [aut, cre, cph] Maintainer Vlad Krotov Repository CRAN Date/Publication 2017-07-19 16:55:26 UTC

R topics documented: getjobs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Index

2 4

1

2

getjobs

getjobs

Function for downloading job data from Dice.com

Description The function automatically crawls Dice.com and downloads job descriptions based on the supplied parameters. JobSearch API supplied by Dice.com is used to search for jobs. The job data is returned via a dataframe. Usage getjobs(title, phrase, direct, areacode, country, state, skill, cityzip, ip, age, diceid, sortcode, sortdirection) Arguments title

Required parameter. Specifies search text for the entire job description. This parameter is a string, value should be supplied in quotes.

phrase

Optional parameter. This parameteris used to specify whether the search should be for an exact phrase. The value "1" should be supplied to treat a job title as a phrase. If job title is a phrase (e.g. "data scientist"), then the title will be encoded as a phrase into the search URL. Otherwise, jobs containing both words ("data" and "scientist", in no particular order) will be retreived. This parameter is a string, value should be supplied in quotes.

direct

Optional parameter. If the value of this parameter is "1" then jobs returned will be for direct hires. This parameter is a string, value should be supplied in quotes.

areacode

Optional parameter. Specifies the area code for jobs to be retreived. This parameter is a string, value should be supplied in quotes.

country

Optional parameter. Specifies the ISO 3166 country code for for jobs to be retreived. This parameter is a string, value should be supplied in quotes.

state

Optional parameter. Specifies the United States Post Office state code for jobs to be retreived. This parameter is a string, value should be supplied in quotes.

skill

Optional parameter. Specifies the search text for the skill property of each job listing. This parameter is a string, value should be supplied in quotes.

cityzip

Optional parameter. Specifies the city are where the jobs to be retreived are listed. A zip code is required for this parameter. Jobs within 40 mile radius of the zip code will be retreived. This parameter is a string, value should be supplied in quotes.

ip

Optional parameter. Specifies an IP address that will be used to look up a geocode which will be used in the search. This parameter is a string, value should be supplied in quotes.

age

Optional parameter. Specifies the age of the posting in days This parameter is a string, value should be supplied in quotes.

getjobs

3

diceid

Optional parameter. Specifies Dice ID of a company posting jobs. Only jobs from that company will be retreived. This parameter is a string, value should be supplied in quotes.

sortcode

Optional parameter. Specifies how retreived jobs will be sorted: sortcode = "1" sorts by posted age; sortcode= "2" sorts by job title; sortcode = "3" sorts by company; sortcode = "4" sorts by location. This parameter is a string, value should be supplied in quotes.

sortdirection

Optional parameter. Specifies sort direction of the jobs retreived. If sortdirection="a", then sort order is ascending. If sortdirection="d", then sort order is descending. This parameter is a string, value should be supplied in quotes.

Details This function makes use of the following packages: jsonlite, rvest, curl. Only one parameter is required: ’title’. It takes approximately 1 second to download one job. Value The function returns a data frame named job_table. The dataframe contains data about jobs retreived. The data frame contains the following columns: JobURL, JobTitle, Company, JobLocation, JobDate, JobDescription Author(s) Vlad Krotov ([email protected]) Examples ## Download data scientist jobs in Kentucky. An exact match for "data scientist" is needed library(dicecrawler) jobsTX