The real drawbacks of such techniques, are that they can’t cope well with noisy data, and they tend to so slow as to be unusable on any thing but small artificial datasets.
There have three types of attributes, numeric , ordinal and nominal . For numeric attributes, there have notion of distance, but for ordinal attributes, there is no notion for distance.
Interval quantities have values that are not only ordered but also measured in fixed and equal units.
ARFF Files means independence and unordered datasets.
The problem is to decide which attributes to leave out without affecting the final decision.
Today I download a copy of GAUSS software which aims to provides Mathematical and Statistical tools for rapid development.
“Social media” Web sites such as Flickr, del.icio.us, YouTube, MySpace, and Facebook contain various forms of user-generated content, from plaintext to rich multimedia. Such sites let users place tags, recommendations, and comments onto user-generated content and to form tightly-knit user communities based on shared interests.
• ranking of user-generated content;
• graph analysis of social corpora;
• social content recommendations;
• personalization of social search;
• fusion of social content with other sources; and
• social corporate analysis and mining.
His research interests lie in measurement, modeling, algorithms, and analytics for large heterogeneous data sets such as the Web.

Recent Comments