SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC trunk/tm/ChangeLog revision 876, Sat Dec 6 15:58:01 2008 UTC
# Line 1  Line 1 
1    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/source.R (DataframeSource): New source class for data frames.
4    
5            * R/source.R: Fixed non-standard call evaluation.
6    
7    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
8    
9            * R/source.R (URISource): New source class for a single document.
10    
11    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
12    
13            * R/source.R: Refactoring.
14    
15    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
16    
17            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
18            Rmpi installations more gracefully.
19    
20    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
21    
22            * R/source.R (Source): Add Length slot.
23    
24    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
25    
26            * R/AAA.R: Unify duplicated .onLoad function.
27    
28    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
29    
30            * DESCRIPTION (Suggests): Added Rmpi.
31    
32    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
33    
34            * R/source.R (getElem): Fix 'no visible binding' warning.
35    
36            * man/WeightFunction.Rd: Fix signature.
37    
38    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
39    
40            * R/weight.R: Introduce name abbreviations for weighting functions.
41    
42    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
43    
44            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
45    
46            * R/cluster.R: Provide convenience functions for using a MPI
47            cluster.
48    
49            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
50            available.
51    
52            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
53            available.
54    
55    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
56    
57            * R/textdoccol.R (lapply): Removed debug print out.
58    
59    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
60    
61            * R/reader.R (readRCV1): Improved meta data extraction from
62            Reuters Corpus Volume 1 documents.
63    
64    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
65    
66            * R/transform.R: Ensure that all mappings preserve multiline
67            structures.
68    
69    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
70    
71            * R/filter.R: Every filter has now an attribute indicating whether
72            it sould be applied to document level (doclevel).
73    
74            * R/textdoccol.R (tmFilter): Set searchFullText as new default
75            filter.
76    
77    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
78    
79            * R/transform.R (replacePatterns): Replaced removeWords by
80            replacePatterns. Suggested by Christian Buchta.
81    
82            * R/textdoccol.R (inspect): Improved formatting.
83    
84    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
85    
86            * inst/CITATION: Updated JSS article information.
87    
88            * R/textdoccol.R (setAs): Added coerce method from list to
89            corpus.
90    
91            * R/meta.R (meta): Improved meta data handling.
92    
93    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
94    
95            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
96            Christian Buchta.
97    
98            * inst/CITATION: Added template to include JSS article reference.
99    
100    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
101    
102            * R/textdoccol.R (tmMap): Introduced lazy mapping.
103    
104            * R/source.R: Added VectorSource.
105    
106    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
107    
108            * man/: Language codes should be in ISO 639-1 format.
109    
110            * R/textdoccol.R (asPlain): Preserve local meta data.
111    
112    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
113    
114            * R/textdoccol.R (writeCorpus): Function for writing a corpus
115            containing plain text documents to disk.
116    
117    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
118    
119            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
120            always set correctly.
121    
122            * R/textdoccol.R: Set load = TRUE as default for load on demand
123            since in most cases this is the wanted behaviour.
124    
125    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
126    
127            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
128    
129            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
130    
131    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
132    
133            * R/meta.R (meta): New function for consistent access to meta data
134            of document collections, repositories, and texts.
135    
136    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
137    
138            * R/: Better support for encodings.
139    
140    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
141    
142            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
143            selection when no reader argument is given.
144    
145    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
146    
147            * R/source.R (CSVSource): Now uses read.csv instead of scan
148            internally.
149    
150    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
151    
152            * R/reader.R (getReaders): Returns available reader functions.
153    
154            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
155            as default.
156    
157    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
158    
159            * R/stopwords.R (stopwords): Shortened code, removed codetools
160            variable warnings.
161    
162            * man/: Documentation for showMeta, added an example for tmMap.
163    
164            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
165            some minor typos fixed.
166    
167    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
168    
169            * R/aobjects.R (showMeta): Added method for pretty printing a
170            text document's meta data.
171    
172    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
173    
174            * R/textdoccol.R (TextDocCol): Better handling of empty
175            arguments.
176    
177            * NAMESPACE: Exported readDOC.
178    
179            * man/completeStems.Rd: Added an example.
180    
181    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
182    
183            * R/stopwords.R (stopwords): Look up .dat files at every
184            call. Allows users to modify stopword .dat files interactively.
185    
186    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
187    
188            * R/termdocmatrix.R (termFreq): Correct processing of empty
189            documents.
190    
191    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
192    
193            * man/: Updated documentation.
194    
195    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
196    
197            * R/complete.R (completeStems): Completes (heuristically) word
198            stems.
199    
200            * R/termdocmatrix.R (TermDocMatrix2): New modular
201            constructor.
202    
203            * NAMESPACE: Exported termFreq.
204    
205    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
206    
207            * R/reader.R (readDOC): Added MS Word reader (using antiword).
208    
209    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
210    
211            * R/weight.R: Weighting functions for TermDocMatrix.
212    
213    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
214    
215            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
216            functions for accessing dimension, column, and row names.
217    
218            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
219    
220    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
221    
222            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
223    
224    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
225    
226            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
227    
228    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * R/reader.R (readPDF): Removed manual checks for pdftotext and
231            pdfinfo. The system call gives a warning anyway.
232    
233    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
234    
235            * R/textdoccol.R (asPlain): Conversion from
236            StructuredTextDocuments to PlainTextDocuments.
237    
238    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
239    
240            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
241            for accessing term-document matrices.
242    
243            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
244            are installed.
245    
246    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
247    
248            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
249            Christian Buchta.
250    
251    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
252    
253            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
254    
255    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
256    
257            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
258    
259            * R/reader.R (readPDF): Added PDF reader.
260    
261    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
262    
263            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
264    
265            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
266    
267            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
268    
269            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
270    
271    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/distmeasure.R (dissimilarity): Replaced dists call from
274            package cba by new dist call from package proxy.
275    
276    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
277    
278            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
279    
280    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
281    
282            * R/termdocmatrix.R: require() uses the quietly option to suppress
283            loading messages.
284    
285    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
286    
287            * R/dictionary.R: Added dictionary support.
288    
289    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
292            documents. This simplifies some functions, e.g., asPlain.
293    
294    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
295    
296            * inst/doc/tm.Rnw: Fixed some typos in vignette.
297    
298    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
299    
300            * R/textdoccol.R (replaceWords): Added method to replace a set of
301            words by a single word. Useful for synonyms.
302    
303    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
304    
305            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
306    
307    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
308    
309            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
310            vectors. Thanks to Ariel Maguyon for his error report.
311            (removeSparseTerms): New function to remove columns from a
312            term-document matrix exceeding a sparse factor.
313    
314    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
315    
316            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
317    
318    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
319    
320            * man/sFilter.Rd: Corrected documentation on statement format (use
321            '==' instead of '=').
322    
323    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
324    
325            * R/aobjects.R (StructuredTextDocument): Inherits from
326            TextDocument.
327    
328    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
329    
330            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
331            on sparse matrices as proposed by Martin Maechler.
332    
333    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
334    
335            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
336            \pkg{filehash} version makes them deprecated.
337    
338    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
339    
340            * R/termdocmatrix.R (textvector): Stemming is now performed before
341            erasing stopwords.
342            (weightMatrix): Adapted to handle sparse matrices.
343            (TermDocMatrix): Sparse matrix is now efficiently built by
344            direct stepwise insertion of row values into it.
345    
346    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
347    
348            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
349            due to ongoing problems. For our purposes the latter is as useful
350            as the replaced package.
351    
352    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
355    
356            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
357    
358    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
359    
360            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
361            languages with available stopwords.
362    
363    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
364    
365            * inst/doc/tm.Rnw: Minor corrections in the vignette.
366    
367    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
368    
369            * DESCRIPTION: Update to version 0.2, since a lot of new features
370            have been integrated.
371    
372            * inst/stopwords: Updated existing stopwords and added stopwords
373            for various other languages.
374    
375    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
376    
377            * man/: Updated documentation.
378    
379            * Work/testDb.R: Script to test database stuff.
380    
381            * R/: Fixed various database related bugs. Seems to be rather
382            useable now, i.e., consider as alpha status for now.
383    
384    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
385    
386            * R/: Fixed some bugs related to database support.
387    
388    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
389    
390            * man/: Added a lot of examples to the manuals.
391    
392    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
393    
394            * man/: Updated parts of the documentation.
395    
396            * R/textdoccol.R (asPlain): Added conversion from newsgroup
397            documents to plain text documents.
398    
399    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
400    
401            * R/textdoccol.R: Finished experimental database support. Not yet
402            intensively tested.
403    
404            * R/source.R: Now each source has a default reader.
405    
406            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
407            class anymore.
408    
409            * R/plaintextdoc.R: Custom show method for plain text documents.
410    
411            * R/aobjects.R: Added a class for structured text documents.
412    
413            * R/reader.R: Replaced remaining \code{parser} occurrences with
414            \code{reader}.
415    
416            * R/textdoccol.R (summary): Indent tags.
417    
418            * R/textdoccol.R (removePunctuation): Transform method to remove
419            punctuation marks.
420    
421    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
422    
423            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
424            using prescindMeta().
425    
426    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
427    
428            * R/textdoccol.R: Improved database support.
429    
430    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
431    
432            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
433    
434            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
435            language code.
436    
437            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
438            into parserControl argument.
439    
440            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
441    
442    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
443    
444            * Work/tmDataSetup.R: The datasets acq and crude can now be
445            created on the fly.
446    
447            * R/stopwords.R: Introduced a function returning the stopwords for
448            a given language (English, German and French at the moment)
449    
450            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
451            otherwise falls back to Snowball package.
452    
453    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
454    
455            * man/dissimilarity-methods.Rd: Make clear that any method offered
456            by "dists" from package "cba" can be used.
457    
458    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
459    
460            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
461            to Kurt's latex suggestion. Removed points and underscores in
462            variable names for consistent naming.
463    
464            * DESCRIPTION: Update to version 0.1-2.
465    
466            * man/TextRepository.Rd: Fixed bug in documentation.
467    
468    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
469    
470            * DESCRIPTION: Update to version 0.1-1.
471    
472    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
473    
474            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
475            wordStem.
476    
477    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
478    
479            * R/: Changes due to Kurt's review.
480    
481    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
482    
483            * R/: Implemented improvements based upon comments by David
484            Meyer.
485    
486    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
487    
488            * inst/doc/: Rewrote vignette.
489    
490            * man/: Improved documentation.
491    
492    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
493    
494            * man/: Updated documentation.
495    
496            * DESCRIPTION: Changed package name to "tm". Updated version to
497            0.1 for first CRAN release.
498    
499            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
500            list archive example.
501    
502            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
503            archive example.
504    
505            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
506            from (several mails per box) mbox format to (single mail per file)
507            eml format.
508    
509    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
510    
511            * data/crude.rda: Rebuilt.
512    
513            * data/acq.rda: Rebuilt.
514    
515            * R/reader.R: Factored out reader and parser methods from
516            textdoccol.R.
517    
518            * R/source.R: Factored out Source methods from aobjects.R and
519            textdoccol.R.
520            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
521            feeds.
522    
523            * R/textdoccol.R (DirSource): Added support for recursive
524            traversal of directories.
525    
526    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
527    
528            * R/textdoccol.R ([[): Loads the document corpus automatically
529            into memory upon access.
530            (tm_transform, tm_filter): Removed several checks whether the
531            document is already loaded ([[ ensures this now).
532            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
533            mailing list archive.
534    
535    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
536    
537            * R/aobjects.R (TextDocument): Is now a virtual class.
538            (Source): Is now a virtual class.
539    
540    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
541    
542            * R/textdoccol.R (c): Support for an arbitrary number of document
543            collections.
544    
545    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
546    
547            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
548            append_meta and remove_meta.
549    
550            * R/textdoccol.R: Removed modify_metadata method.
551    
552            * R/textrepo.R: Removed modify_metadata method.
553    
554            * R/textdoccol.R (remove_meta): Supports removal of document
555            collection metadata and document (= in data frame) metadata.
556    
557    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
558    
559            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
560    
561            * data/crude.rda: Rebuilt.
562    
563            * data/acq.rda: Rebuilt.
564    
565            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
566    
567            * R/textdoccol.R ([): Bug fix for subsetting a document
568            collection's data frame.
569    
570    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
571    
572            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
573            to s_filter.
574    
575            * R/textdoccol.R: Local text documents' metadata can now be copied
576            to a document collection's data frame with prescind_meta.
577    
578    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
579    
580            * R/: Text documents' slot metadata is now accessible in s_filter.
581    
582            * R/: Rewrote s_filter function (has still some restrictions).
583    
584    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
585    
586            * R/: Various fixes in handling metadata.
587    
588            * R/: Added update mechanism for text document collections.
589    
590    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
591    
592            * R/: Merging of document collections now creates a binary tree
593            for reconstructing merged document collections.
594    
595            * R/: Redesign of metadata for document collections.
596    
597    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
598    
599            * R/: Messages now use \code{ngettext}.
600    
601    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
602    
603            * R/: Added functions for modifying and removing metadata.
604    
605    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
606    
607            * man/: Updated some documentation.
608    
609            * R/: Corrected some connection issues.
610    
611            * inst/doc: Worked on the vignette.
612    
613    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
614    
615            * inst/: Added texts and started vignette.
616    
617            * R/: Final changes based upon David's comments.
618    
619    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
620    
621            * NAMESPACE: Corrected exports (generic methods need exportMethods
622            directives!).
623    
624    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
625    
626            * R/: Modified the TextDocCol constructur and various parsers. It
627            is now modular and supports various file formats via plugins (see
628            the new "Source" class).
629    
630    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
631    
632            * man/: Revised documentation after previous code changes.
633    
634    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
635    
636            * R/: Remaining changes as discussed with David.
637    
638    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
639    
640            * R/: Some changes as suggested by David. The rest will follow
641            within the next days.
642    
643    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
644    
645            * man/: Finished documentation.
646    
647    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
648    
649            * man/: Wrote some documentation.
650    
651    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
652    
653            * R/: Further syntactic sugar in form of additional assignment and
654            accessor methods.
655    
656    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
657    
658            * R/: Syntactic sugar in form of "length", "show" and "summary"
659            operators.
660    
661    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
662    
663            * R/: Diverse updates. Mainly on default operators ("[" or "c")
664            and dissimilarities.
665    
666    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
667    
668            * R/: Added similarity functions.
669    
670            * data/: Added english stopwords.
671    
672    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
673    
674            * data/: Examples compiled for new features
675    
676            * R/: Changes due to new structure.
677    
678            * NAMESPACE: Corrected namespace to reflect new structure.
679    
680            * R/termdocmatrix.R: Adapted for new naming scheme.
681    
682    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
683    
684            * R/textdoccol.R: Adapted code for new class structure. Wrote
685            several transform and filter functions operating on text document
686            collections (alias text document databases).
687    
688            * R/aobjects.R: Adapted class structure with inheritance,
689            repositories and additional meta data. Loading files on demand is
690            now possible.
691    
692    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
693    
694            * R/: Some cosmetic cleanups.
695    
696            * inst/: Removed vignette on clustering. That and much more is now
697            described in the JSS paper on text mining. Based upon that
698            article an elaborated vignette will be incorporated in the future.
699    
700    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
701    
702            * R/: Updated generic S4 methods to comply with signature changes
703            in newer versions of R (> 2.3)
704    
705    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
706    
707            * ext/R/importRIS.R: Automatic RIS import is now possible.
708    
709    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
710    
711            * R/textdoccol.R: Added RIS HTML input format.
712    
713    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
714    
715            * R/textdoccol.R: Removed bug that caused invalid text document
716            collections when handling many input files.
717    
718    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
719    
720            * R/textdoccol.R: Restructured and extended file import
721            mechanism.
722    
723            * inst/doc/clustering.Rnw: Adapted vignette for use with
724            ReutNews.rda
725    
726            * man/ReutNews.Rd: Documentation for ReutNews.rda
727    
728            * data/ReutNews.rda: A tiny Reuters21578 example data set.
729    
730    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
731    
732            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
733            clustering facilities of this package.
734    
735    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
736    
737            * R/aobjects.R: Changed package document structure to avoid class
738            dependency problems.
739    
740    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
741    
742            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
743            data set.
744    
745            *  Finished documentation and reordered directory structure. Now "R
746            CMD check textmin" works without errors.
747    
748    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
749    
750            * src/: Various splits can now be easily created for the
751            Reuters21578 data set.
752    
753    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
754    
755            *  Updated documentation
756    
757    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
758    
759            *  Wrote R documentation for some classes and methods.
760    
761    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
762    
763            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
764            files. See the questionnaire data/Umfrage.csv for such an example.
765            We are now able to import files in Reuters-21578 XML format.
766    
767            *  Changed class interfaces in various files. Weighting of the text
768            matrix is now possible.
769    
770    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
771    
772            * R/textdoccol.R: One can build term-document matrices if
773            nessecary (with buildTDM(...)) and fill the field tdm from a text
774            document collection with it.
775    
776            * R/textmatrix.R: Wrote S4 class for term-document matrices.
777    
778    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
779    
780            * R/textdoccol.R: We now can read in a whole XML file with several
781            news items.
782    
783  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
784    
785          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.876

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge