Data retrieval is the process of retrieving specific pieces of information from the web or from data warehouses. Examples can be to ”retrieve all the male clients from a certain company that have an age from 30 to 40 years old”. This usually requires some SQL-like queries, and do not require any statistical operations. Data retrieval is a verificative approach.

On the other hand, data mining refers to the process of retrieving more generic and statistic information from a dataset. An example can be to ”retrieve the characteristics of the clients” or ”which are the societies which compete on the same market as this society”. Data mining is an explorative approach. Often the process of data mining includes techniques from machine learning, statistics, pattern recognition etc.


#math/statistics#ai/data-science resources:

  • Luca Salvati, Adele Sateriano - Analisi dei Dati p. 11