Kód: 06812870
Advances in technology have provided industry with an arrayof devices for collecting data. The frequency and scale of data collection means that there are now many large datasets being generated. To find patterns in these datasets ... celý popis
Nákupem získáte 151 bodů
Advances in technology have provided industry with an arrayof devices for collecting data. The frequency and scale of data collection means that there are now many large datasets being generated. To find patterns in these datasets it would be useful to be able to apply modern methods of classification such as support vector machines. Unfortunately these methods are computationally expensive, quadratic in the number of data points in fact, and socannot be applied directly. This book proposes a framework where by a variety of clustering methods can be used to summarise datasets, that is, reduce them to a smaller but still representative dataset so that these advanced methods can be applied. It compares the results of using this framework against using random selection on alarge number of classification and regression problems. Results show thatthe clustered datasets are on average fifty percent smaller than the original datasets without loss of classification accuracy which is significantly better than random selection. They also show that there is no free lunch, for each dataset it is important to choose a clustering method carefully.
Zařazení knihy Knihy v angličtině Mathematics & science Mathematics
1506 Kč
Osobní odběr Praha, Brno a 12903 dalších
Copyright ©2008-24 nejlevnejsi-knihy.cz Všechna práva vyhrazenaSoukromíCookies
Nákupní košík ( prázdný )