I assume , I will have to use objectId timestamp to get incremental data. Still waiting to get access to new data source. So do not know if there is any timestamp field in JSON document body yet.
Did not find much info on MongoDB & Sqoop connectivity. Does sqoop work with MongoDB Hadoop Connector? If sqoop works with mongoDB Hadoop connector how should the query look like to pull incremental data?
Do you mean the document objectId timestamp or a timestamp field in the JSON document body?
I am trying to find any information on using Sqoop to import data from MongoDB. I am looking to do incremental import based on timestamp from MongoDB and want to import all document sets into Hadoop. If anyone is doing this already, any pointers on how to do it, sample code, any issues encountered will be helpful.
Thanks in advance,