Machine Learning Problems: The Easy Parts

Contify

So, we implemented de-duplication algorithms to significantly reduce the resources required to process the information in these documents. The features are based on the frequency and importance of entities among other things (discussed at a later point in this post). We therefore had to write our own version of hierarchical clustering that returns a large number of tight clusters or duplicates. Inverse document frequency. which are not useful for de-duplication.

Machine Learning Problems: The Easy Parts

Contify

So, we implemented de-duplication algorithms to significantly reduce the resources required to process the information in these documents. The features are based on the frequency and importance of entities among other things (discussed at a later point in this post). We therefore had to write our own version of hierarchical clustering that returns a large number of tight clusters or duplicates. Inverse document frequency. which are not useful for de-duplication.

14 Quick Tips for Kick-Ass Lead Management

Hubspot

Determine the validity of a lead. We definitely recommend de-duplicating leads based on email address at the very least, but you should also verify information such as zip code, phone number, and email address when possible to keep lead records up to date, and thus, functional. Watch how leads respond to the type of content you deliver, the method of communication, and the frequency and timing of the communication to improve your lead nurturing and shorten the sales cycle.