sajeevaravind's comments

sajeevaravind | 10 years ago | on: Apply HN: Vaultedge – a private Google for your private data

Thanks Vaibhaw, glad you find it useful. Our public beta launch in couple weeks will only have the web version. But the following release will have a Android app. We are already working on the UX for the app. Let us know if you need any particular usecase to be solved in the mobile version. Thanks for the question. Happy to clarify if you need more details.

sajeevaravind | 10 years ago | on: Apply HN: Vaultedge – a private Google for your private data

Hi Vikas,

That's a good question. Spark is an inbox assistant which helps to improve the email tasks productivity. Eventhough the categorization of emails makes it looks like it is in the same space as Vaultedge, I think it is not. Vaultedge is a personal document organizer, which helps you to find your documents quickly irrespective of whereever it is. So we work across emails, cloud storage, offline devices etc. So in that sense, the purpose of these two tools are different at least as of now.

Thanks for the great question. Happy to answer any follow up questions.

sajeevaravind | 10 years ago | on: Apply HN: Vaultedge – a private Google for your private data

Hi Shanu,

That's a good question. Part of the problem is addressed by few startups like meta.sc and mohiomap, all in infancy like us. They both allow search for files across all storage. But they don't attempt to categorize the data nor allow data of offline devices to be kept within them.

We are complimentary to Dropbox and Googledrive. Users can continue to keep their data where ever they are storing now. We just make the discovery easier.

Thanks, those are some great questions. Happy to answer any further questions you may have.

sajeevaravind | 10 years ago | on: Apply HN: Vaultedge – a private Google for your private data

Glad you liked it. Vaultedge will categorize a document into "some" category 80% of the times. Remaining 20% of the documents will be put into a General category. Within the 80% of the classified documents, we have seen that the classification is 60-80% accurate and the accuracy rate varies within that range based on the type of documents user has. If the user has large number of documents unseen by Vaultedge before, then the accuracy lowers. But we have a continuous feedback system which learns based on user actions. If user changes the category to a different one, we learn from that and will apply that categorization to new set of documents for that user.

The initial target market is IT/Finance professionals who use multiple email accounts, cloud drives etc.

Thanks for the question. Happy to answer any follow up question you may have.

sajeevaravind | 10 years ago | on: Apply HN: Vaultedge – a private Google for your private data

Vaultedge uses a combination of Machine learning and parsing to categorise documents. First level ML will identify the file as belonging to certain category and then that category focussed ML training set will give us the sub-categories. Sometimes we also do plain parsing to arrive at the category. But all these are automated, there is no manual intervention involved. Thanks for your question. Please feel free to ask any follow up questions .

sajeevaravind | 10 years ago | on: Apply HN: Vaultedge – a private Google for your private data

Though Vaultedge is hosted on cloud, it can support offline data sitting in laptops or usb drives. When connecting a offline device to Vaultedge, Vaultedge will ask user to install a client program. This client program will help in categorization and creation of search index of the offline data. Once categorized and indexed, periodically the client program will redo the operations when there are changes to the data. When user clicks on such a offline file, Vaultedge has the ability to pull the file from the offline storage if the device is connected. Users also have the option of keeping a copy of such offline data within Vaultedge cloud for faster access. Thanks for the question. Please let me know if you have further questions.

sajeevaravind | 10 years ago | on: Apply HN: Vaultedge – a private Google for your private data

I would like to understand a little better your comment about "metadata is not linked to a profile". Do you say by not linked to a profile, there shouldn't be a way for a hacker to figure out the metadata of a user? Or you mean "metadata" should be associated to individual accounts(say dropbox), so incase of a breach, only the metadata for that account is compromised and not for all the accounts of a user? Thanks for your time.

sajeevaravind | 10 years ago | on: Apply HN: Vaultedge – a private Google for your private data

Short answer is Vaultedge will create search index based on the content of the bill and also has additional knowledge that it is a cell phone bill from its classification algorithm. Put together this will allow user to search for "last month's cell phone bill".

Long answer: Let's say your bill is in pdf or html format. For Vaultedge initially it is nothing but a document, it has no idea that it is a bill. Vaultedge will extract the contents of this document and test it against our training data set using Machine Learning(ML). In this process, the document will be identified as a "Bill". Then it will apply it on a different training set which will identify it as a "cell phone" or "cable bill". Then we do further analysis and extract info like this bill is "for the month of march 2016". Now all of these info is used to construct the search index. So you can search now for "march bill" or "last months bill" to get to that bill. Please don't hesitate to clarify if anything is not clear.

sajeevaravind | 10 years ago | on: Apply HN: Vaultedge – a private Google for your private data

User has to connect his services to Vaultedge. Services can be cloud storage like dropbox and google drive, emails like gmail and outlook and offline storage like laptops and usb disks. Vaultedge will periodically check for any new files (docs only now) in these services and classifies them using Machine learning and also builds a search index for these files. Other than giving oAuth read access to these services, user need not do anything.

Thanks for asking. Happy to answer any follow up question you may have.

page 1