Hello everyone,
I am currently working on a SAP HANA on AWS implementation, and am using Java with the twitter4j library, along with the ngdbc.jar file (obtained from the SAP HANA startup focus program). What I would like to do is to store the tweets of the live twitter stream onto the SAP HANA database. At present, I am obtaining tweets from the Twitter stream, then inserting these tweets with individual insert statements into the database when the number of tweets obtained crosses a threshold; then obtaining more tweets, then inserting these with individual insert statements; and so on. However, this seems to be taking an extremely long time, as a result of which quite a few tweets are being missed out on. Could someone please suggest a more efficient approach as to how to go about this, either by using a feature of SAP HANA that I may have missed, or even of Amazon AWS, or perhaps using an alternate library or method of doing things?
Thanks