tag:blogger.com,1999:blog-7583720.post2986321075851124250..comments2024-03-05T03:17:02.289-08:00Comments on Salmon Run: MapReduce with Python and mrjob on Amazon EMRSujit Palhttp://www.blogger.com/profile/06835223352394332155noreply@blogger.comBlogger3125tag:blogger.com,1999:blog-7583720.post-33796873835176803002014-06-10T16:45:14.731-07:002014-06-10T16:45:14.731-07:00Just came across this when looking for something e...Just came across this when looking for something else. Here is a script to install nltk (and its dependencies) on AWS's S3, maybe consider replacing the bootstrap_cmds with this one if the original does not work (it did for me, but Shreyas FNU indicated that it didn't for him):<br /><br />s3://awsdocs/gettingstarted/latest/sentiment/config-nltk.sh<br /><br />Inside the script, are a bunchSujit Palhttps://www.blogger.com/profile/06835223352394332155noreply@blogger.comtag:blogger.com,1999:blog-7583720.post-68748097759867574122014-05-03T07:40:42.520-07:002014-05-03T07:40:42.520-07:00Your configuration may be a bit off, as someone ob...Your configuration may be a bit off, as someone observed on your SO question. The configuration I show worked for me, although I remember that it barfed on the stopwords list because it expected the stopwords corpus to be loaded. It had taken me a while to figure out how to bootstrap nltk, so instead of trying to load the corpus via bootstrap, I simply replaced it with an explicit list. I Sujit Palhttps://www.blogger.com/profile/06835223352394332155noreply@blogger.comtag:blogger.com,1999:blog-7583720.post-22189382389857045002014-05-02T22:23:21.063-07:002014-05-02T22:23:21.063-07:00Hi, I am trying to use nltk like you did, but alas...Hi, I am trying to use nltk like you did, but alas doesn't work for me. Here is my question: http://stackoverflow.com/questions/23440564/bootstrapping-libraries-on-emr-using-python-mrjob<br /><br />Any comments would be wonderfulAnonymoushttps://www.blogger.com/profile/12092587833682705277noreply@blogger.com