tag:blogger.com,1999:blog-7583720.post1980141423705900564..comments2024-03-17T13:30:18.387-07:00Comments on Salmon Run: Exploring Nutch-GORA with CassandraSujit Palhttp://www.blogger.com/profile/06835223352394332155noreply@blogger.comBlogger6125tag:blogger.com,1999:blog-7583720.post-87294231700422214892014-01-29T08:49:28.208-08:002014-01-29T08:49:28.208-08:00Hi Rajnikant, when I last looked at GORA, the MySQ...Hi Rajnikant, when I last looked at GORA, the MySQL module was not very stable and there were explicit warnings to not use it. Not sure if things have changed since. My use of MySQL is fairly limited (I use it as a repository for application data, almost never CLOB/BLOB) and hasn't exposed this kind of error, so I can't say for sure, but a quick Google search for "increase mysql Sujit Palhttps://www.blogger.com/profile/06835223352394332155noreply@blogger.comtag:blogger.com,1999:blog-7583720.post-25771602094205921742014-01-28T21:00:48.366-08:002014-01-28T21:00:48.366-08:00Hi
I am crawling a complete website. i have done...Hi <br /><br />I am crawling a complete website. i have done all configuration for nutch and solr.<br /><br />Crawling is start but after one day it stop and shows an exception <br /><br />com.mysql.jdbc.exceptions.jdbc4.MySQLDataException:<br /><br />mysql buffersize outofrange.<br />how i can increase mysqlcachebuffer size for crawling a complete(large) website<br /><br />Thanks<br />Rajni kantrajnikanthttps://www.blogger.com/profile/04880465522663331920noreply@blogger.comtag:blogger.com,1999:blog-7583720.post-20761590128096016872014-01-12T11:52:14.975-08:002014-01-12T11:52:14.975-08:00You are welcome, glad it helped.You are welcome, glad it helped.Sujit Palhttps://www.blogger.com/profile/06835223352394332155noreply@blogger.comtag:blogger.com,1999:blog-7583720.post-91116131076761909602014-01-11T16:28:55.850-08:002014-01-11T16:28:55.850-08:00Thank you very much for helping me learn so many n...Thank you very much for helping me learn so many new things.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-7583720.post-14607146691583160052012-01-13T23:42:42.796-08:002012-01-13T23:42:42.796-08:00Thanks for the pointers Julien. I did not know the...Thanks for the pointers Julien. I did not know the first property (now I know where to look for the others as well). I adapted some code based on the meta_url example in the Writing Plugins wiki page, and <a href="http://sujitpal.blogspot.com/2012/01/nutchgora-scoring-and-indexing-plugins.html" rel="nofollow">wrote about it here</a>.Sujit Palhttps://www.blogger.com/profile/06835223352394332155noreply@blogger.comtag:blogger.com,1999:blog-7583720.post-18888621635320984782012-01-09T08:33:37.234-08:002012-01-09T08:33:37.234-08:00Good to see people using Nutch-Gora!
These issues...Good to see people using Nutch-Gora!<br /><br />These issues are commonly asked on the mailing list<br /><br />1. See property db.ignore.external.links<br /><br />2&3 Google for the urlmeta plugin<br /><br />Cheers<br /><br />JulienJulien Niochehttps://www.blogger.com/profile/16499716503708780310noreply@blogger.com