I would like to include an example dataset (of Twitter tweets and metadata) in an R Package I'm writing.
I downloaded an example data.frame using the Twitter API and saved it as .RData (with the corresponding .R data description file) in my package.
When I run R CMD Check, I get the following NOTE,
* checking data for non-ASCII characters ... NOTE
Note: found 287 marked UTF-8 strings
I tried saving the data.frame with ASCII=TRUE, hoping this would fix the problem. But it persists. Any idea on how I can get R CMD CHECK to run without notes?
(also, I would be open to removing all UTF-8 marked strings from the example data if that's the solution). Thank you!
example row from data.frame:
First time in SF (@ San Francisco International Airport (SFO) - @flysfo in San Francisco, CA) https://t.co/1245xqxtwesr
favorited favoriteCount replyToSN created truncated replyToSID id replyToUID
1 FALSE 0 <NA> 2015-03-13 23:30:35 FALSE <NA> 576525795927179264 <NA>
statusSource screenName retweetCount isRetweet retweeted
1 <a href="http://foursquare.com" rel="nofollow">Foursquare</a> my_name93 0 FALSE FALSE
longitude latitude
1 -122.38100052 37.61865062