I'm always looking for ways to download data from the internet into R. Though I prefer to host and access plain-text data sets (CSV is my personal favourite) from GitHub (see my short paper on the topic) sometimes it's convenient to get data stored on Dropbox.
There has been a change in the way Dropbox URLs work and I just added some functionality to the repmis R package. So I though that I'ld write a quick post on how to directly download data from Dropbox into R.
The download method is different depending on whether or not your plain-text data is in a Dropbox Public folder or not.
Dropbox Public Folder
Dropbox is trying to do away with its public folders. New users need to actively create a Public folder. Regardless, sometimes you may want to download data from one. It used to be that files in Public folders were accessible through non-secure (http) URLs. It's easy to download these into R, just use the read.table
command, where the URL is the file name. Dropbox recently changed Public links to be secure (https) URLs. These cannot be accessed with read.table
.
Instead you need can use the source_data
command from repmis:
FinURL <-"https://dl.dropbox.com/u/12581470/code/Replicability_code/Fin_Trans_Replication_Journal/Data/public.fin.msm.model.csv"
# Download data
FinRegulatorData <- repmis::source_data(FinURL,
sep = ",",
header = TRUE)
Non-Public Dropbox Folders
Getting data from a non-Public folder into R was a trickier. When you click on a Dropbox-based file's Share Link button you are taken to a secure URL, but not for the file itself. The Dropbox webpage you're taken to is filled with lots of other Dropbox information. I used to think that accessing a plain-text data file embedded in one of these webpages would require some tricky web scrapping. Luckily, today I ran across this blog post by Kay Cichini.
With some modifications I was able to easily create a function that could download data from non-Public Dropbox folders. The source_DropboxData
command is in the most recent version of repmis (v0.2.4) is the result. All you need to know is the name of the file you want to download and its Dropbox key. You can find both of these things in the URL for the webpage that appears when you click on Share Link
. Here is an example:
https://www.dropbox.com/s/exh4iobbm2p5p1v/fin_research_note.csv
The file name is at the very end (fin_research_note.csv
) and the key is the string of letters and numbers in the middle (exh4iobbm2p5p1v
). Now we have all of the information we need for source_DropboxData
:
FinDataFull <- repmis::source_DropboxData("fin_research_note.csv",
"exh4iobbm2p5p1v",
sep = ",",
header = TRUE)
Comments
Correct me if I'm wrong, but can you use rDrop db.read.csv to access data if you don't have the user's credentials?
Cheers,
Bob
Magnificent blog I visit this blog it's extremely wonderful. Interestingly, in this blog content composed plainly and reasonable. The substance of data is useful.
Oracle Fusion HCM Online Training
Oracle Fusion SCM Online Training
Oracle Fusion Financials Online Training
Big Data and Hadoop Training In Hyderabad
oracle fusion financials classroom training
Oracle Fusion HCM Classroom Training
oracle cpq online training / Oracle CPQ Class Room Training
Oracle Taleo Online Training
Best Website Development service In Noida
Website Designing service In Noida
Best digital marketing service In Noida
Best digital marketing Company in Noida
Indian Bookmarking list
Indian Bookmarking list
India Classified Submission List
Indian Classified List
Data Analytics Course Online 360DigiTMG
Cyber Security Training in Bangalore
Data Analytics Course in Bangalore
Data Science Course
vé máy bay đi Mỹ giá bao nhiêu
chuyến bay hồi hương từ mỹ về việt nam
vé đà nẵng giá rẻ
vé máy bay đi đà lạt vietnam airline
giá vé máy bay đi phú quốc vietnam airline
Digital Marketing Course in Janakpuri.
business analytics course
best data science institute in hyderabad
Digital Marketing Course in Hyderabad
best data science institute in hyderabad
DevOps Training in Hyderabad
DevOps Course in Hyderabad
AWS Training in Hyderabad
Become a Data Science Expert with us.We provide Classroom training on IBM Certified Data Science at Hyderabad for the individuals who believe hand-held training. We teach as per the Indian Standard Time (IST) with In-depth practical Knowledge on each topic in classroom training, 80 – 90 Hrs of Real-time practical training classes.
Data Science Training in Hyderabad
Data Science Course in Hyderabad
AWS Training in Hyderabad
AWS Course in Hyderabad
Python Certification in Hyderabad
Machine Learning Certification in Hyderabad
Oracle Fusion HCM Training
Workday Training
Okta Training
Palo Alto Training
Adobe Analytics Training
Informatica MDM Training in Chennai
Informatica MDM Training Institute In Chennai
data scientist course
digital marketing training in hyderabad
Data Engineering Course in India
Data Scientist Training in Bangalore
ai course in hyderabad
Data Science Training in Jaipur
Scrivener Crack
EarthView Crack
ProgDVB Professional Crack
Dropbox Crack
Data Science Course in Durgapur
UnoGeeks Offers the best Oracle Fusion Financials Training in the market today. If you want to become Expert Fusion Financials Consultant, Enrol in the Oracle Fusion Financials Online Training offered by UnoGeeks.
I have bookmarked your website because this site contains valuable information in it. I am really happy with articles quality and presentation. Thanks a lot for keeping great stuff. I am very much thankful for this site.data science training in jabalpur
릴게임사이트
ai course
Thanks for sharing an amazing and informative post. The information shared by you is really useful for me. Keep it up to do great work and hope to see more of your posts in the near future.
read more :semi frameless shower door
Thanks & Regards,
BroadMind - Best Study abroad consultant in Chennai