SCM

SCM Repository

[tm] Annotation of /trunk/R/textmin/ChangeLog
ViewVC logotype

Annotation of /trunk/R/textmin/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 23 - (view) (download)
Original Path: trunk/R/trunk/ChangeLog

1 : feinerer 21 2005-11-19 Ingo Feinerer <h0125130@wu-wien.ac.at>
2 :    
3 : feinerer 22 * R/textdoccol.R: Constructor of textdoccol allows import of CSV
4 :     files. See the questionnaire data/Umfrage.csv for such an example.
5 : feinerer 23 We are now able to import files in Reuters-21578 XML format.
6 : feinerer 22
7 : feinerer 21 * Changed class interfaces in various files. Weighting of the text
8 :     matrix is now possible.
9 :    
10 : feinerer 20 2005-11-08 Ingo Feinerer <h0125130@wu-wien.ac.at>
11 :    
12 :     * R/textdoccol.R: One can build term-document matrices if
13 :     nessecary (with buildTDM(...)) and fill the field tdm from a text
14 :     document collection with it.
15 :    
16 :     * R/textmatrix.R: Wrote S4 class for term-document matrices.
17 :    
18 : feinerer 19 2005-11-06 Ingo Feinerer <h0125130@wu-wien.ac.at>
19 :    
20 :     * R/textdoccol.R: We now can read in a whole XML file with several
21 :     news items.
22 :    
23 : feinerer 17 2005-11-05 Ingo Feinerer <h0125130@wu-wien.ac.at>
24 :    
25 :     * R/textdoccol.R: Set up an S4 class for a collection of text
26 :     documents. A first attempt to read in XML input (like the RCV1
27 :     set) was made.
28 :    
29 :     * R/textdocument.R: Set up an S4 class for text documents. Wrote
30 :     some accessor functions.
31 :    
32 :     * data/newsitem.xml: Added this XML file for testing purposes. It
33 :     contains a single news item from the Reuters Corpus Volume 1
34 :     (RCV1) XML set.
35 :    
36 : feinerer 16 2005-10-07 Ingo Feinerer <h0125130@wu-wien.ac.at>
37 :    
38 :     * R/textmatrix.R (textmatrix): Removed the transpose of the original
39 :     textmatrix as k-means clustering provided by R (kmeans) now works on
40 :     this textmatrix. The result is a k-means text clustering with a
41 :     similarity measure based upon word frequences.
42 :    
43 :     2005-10-05 Ingo Feinerer <h0125130@wu-wien.ac.at>
44 :    
45 :     * R/textmatrix.R: Adapted the preprocessing code from the R
46 :     package "lsa" written by Fridolin Wild to build a document text matrix.
47 :    
48 :     2005-10-02 Ingo Feinerer <h0125130@wu-wien.ac.at>
49 :    
50 :     * Set up the R Text Mining Package infrastructure.

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge