SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC trunk/tm/ChangeLog revision 875, Sat Dec 6 13:25:03 2008 UTC
# Line 1  Line 1 
1    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/source.R: Fixed non-standard call evaluation.
4    
5    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/source.R (URISource): New source class for a single document.
8    
9    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
10    
11            * R/source.R: Refactoring.
12    
13    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
14    
15            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
16            Rmpi installations more gracefully.
17    
18    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
19    
20            * R/source.R (Source): Add Length slot.
21    
22    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
23    
24            * R/AAA.R: Unify duplicated .onLoad function.
25    
26    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
27    
28            * DESCRIPTION (Suggests): Added Rmpi.
29    
30    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
31    
32            * R/source.R (getElem): Fix 'no visible binding' warning.
33    
34            * man/WeightFunction.Rd: Fix signature.
35    
36    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/weight.R: Introduce name abbreviations for weighting functions.
39    
40    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
43    
44            * R/cluster.R: Provide convenience functions for using a MPI
45            cluster.
46    
47            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
48            available.
49    
50            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
51            available.
52    
53    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
54    
55            * R/textdoccol.R (lapply): Removed debug print out.
56    
57    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
58    
59            * R/reader.R (readRCV1): Improved meta data extraction from
60            Reuters Corpus Volume 1 documents.
61    
62    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
63    
64            * R/transform.R: Ensure that all mappings preserve multiline
65            structures.
66    
67    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
68    
69            * R/filter.R: Every filter has now an attribute indicating whether
70            it sould be applied to document level (doclevel).
71    
72            * R/textdoccol.R (tmFilter): Set searchFullText as new default
73            filter.
74    
75    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
76    
77            * R/transform.R (replacePatterns): Replaced removeWords by
78            replacePatterns. Suggested by Christian Buchta.
79    
80            * R/textdoccol.R (inspect): Improved formatting.
81    
82    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
83    
84            * inst/CITATION: Updated JSS article information.
85    
86            * R/textdoccol.R (setAs): Added coerce method from list to
87            corpus.
88    
89            * R/meta.R (meta): Improved meta data handling.
90    
91    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
92    
93            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
94            Christian Buchta.
95    
96            * inst/CITATION: Added template to include JSS article reference.
97    
98    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
99    
100            * R/textdoccol.R (tmMap): Introduced lazy mapping.
101    
102            * R/source.R: Added VectorSource.
103    
104    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
105    
106            * man/: Language codes should be in ISO 639-1 format.
107    
108            * R/textdoccol.R (asPlain): Preserve local meta data.
109    
110    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
111    
112            * R/textdoccol.R (writeCorpus): Function for writing a corpus
113            containing plain text documents to disk.
114    
115    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
116    
117            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
118            always set correctly.
119    
120            * R/textdoccol.R: Set load = TRUE as default for load on demand
121            since in most cases this is the wanted behaviour.
122    
123    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
124    
125            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
126    
127            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
128    
129    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
130    
131            * R/meta.R (meta): New function for consistent access to meta data
132            of document collections, repositories, and texts.
133    
134    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
135    
136            * R/: Better support for encodings.
137    
138    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
139    
140            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
141            selection when no reader argument is given.
142    
143    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
144    
145            * R/source.R (CSVSource): Now uses read.csv instead of scan
146            internally.
147    
148    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
149    
150            * R/reader.R (getReaders): Returns available reader functions.
151    
152            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
153            as default.
154    
155    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
156    
157            * R/stopwords.R (stopwords): Shortened code, removed codetools
158            variable warnings.
159    
160            * man/: Documentation for showMeta, added an example for tmMap.
161    
162            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
163            some minor typos fixed.
164    
165    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
166    
167            * R/aobjects.R (showMeta): Added method for pretty printing a
168            text document's meta data.
169    
170    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
171    
172            * R/textdoccol.R (TextDocCol): Better handling of empty
173            arguments.
174    
175            * NAMESPACE: Exported readDOC.
176    
177            * man/completeStems.Rd: Added an example.
178    
179    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
180    
181            * R/stopwords.R (stopwords): Look up .dat files at every
182            call. Allows users to modify stopword .dat files interactively.
183    
184    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
185    
186            * R/termdocmatrix.R (termFreq): Correct processing of empty
187            documents.
188    
189    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
190    
191            * man/: Updated documentation.
192    
193    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
194    
195            * R/complete.R (completeStems): Completes (heuristically) word
196            stems.
197    
198            * R/termdocmatrix.R (TermDocMatrix2): New modular
199            constructor.
200    
201            * NAMESPACE: Exported termFreq.
202    
203    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
204    
205            * R/reader.R (readDOC): Added MS Word reader (using antiword).
206    
207    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
208    
209            * R/weight.R: Weighting functions for TermDocMatrix.
210    
211    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
212    
213            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
214            functions for accessing dimension, column, and row names.
215    
216            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
217    
218    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
219    
220            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
221    
222    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
223    
224            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
225    
226    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
227    
228            * R/reader.R (readPDF): Removed manual checks for pdftotext and
229            pdfinfo. The system call gives a warning anyway.
230    
231    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
232    
233            * R/textdoccol.R (asPlain): Conversion from
234            StructuredTextDocuments to PlainTextDocuments.
235    
236    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
237    
238            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
239            for accessing term-document matrices.
240    
241            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
242            are installed.
243    
244    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
247            Christian Buchta.
248    
249    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
250    
251            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
252    
253    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
256    
257            * R/reader.R (readPDF): Added PDF reader.
258    
259    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
260    
261            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
262    
263            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
264    
265            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
266    
267            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
268    
269    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
270    
271            * R/distmeasure.R (dissimilarity): Replaced dists call from
272            package cba by new dist call from package proxy.
273    
274    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
275    
276            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
277    
278    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
279    
280            * R/termdocmatrix.R: require() uses the quietly option to suppress
281            loading messages.
282    
283    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
284    
285            * R/dictionary.R: Added dictionary support.
286    
287    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
288    
289            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
290            documents. This simplifies some functions, e.g., asPlain.
291    
292    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
293    
294            * inst/doc/tm.Rnw: Fixed some typos in vignette.
295    
296    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
297    
298            * R/textdoccol.R (replaceWords): Added method to replace a set of
299            words by a single word. Useful for synonyms.
300    
301    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
302    
303            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
304    
305    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
306    
307            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
308            vectors. Thanks to Ariel Maguyon for his error report.
309            (removeSparseTerms): New function to remove columns from a
310            term-document matrix exceeding a sparse factor.
311    
312    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
313    
314            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
315    
316    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
317    
318            * man/sFilter.Rd: Corrected documentation on statement format (use
319            '==' instead of '=').
320    
321    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
322    
323            * R/aobjects.R (StructuredTextDocument): Inherits from
324            TextDocument.
325    
326    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
327    
328            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
329            on sparse matrices as proposed by Martin Maechler.
330    
331    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
332    
333            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
334            \pkg{filehash} version makes them deprecated.
335    
336    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
337    
338            * R/termdocmatrix.R (textvector): Stemming is now performed before
339            erasing stopwords.
340            (weightMatrix): Adapted to handle sparse matrices.
341            (TermDocMatrix): Sparse matrix is now efficiently built by
342            direct stepwise insertion of row values into it.
343    
344    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
345    
346            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
347            due to ongoing problems. For our purposes the latter is as useful
348            as the replaced package.
349    
350    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
351    
352            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
353    
354            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
355    
356    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
359            languages with available stopwords.
360    
361    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
362    
363            * inst/doc/tm.Rnw: Minor corrections in the vignette.
364    
365    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
366    
367            * DESCRIPTION: Update to version 0.2, since a lot of new features
368            have been integrated.
369    
370            * inst/stopwords: Updated existing stopwords and added stopwords
371            for various other languages.
372    
373    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
374    
375            * man/: Updated documentation.
376    
377            * Work/testDb.R: Script to test database stuff.
378    
379            * R/: Fixed various database related bugs. Seems to be rather
380            useable now, i.e., consider as alpha status for now.
381    
382    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
383    
384            * R/: Fixed some bugs related to database support.
385    
386    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
387    
388            * man/: Added a lot of examples to the manuals.
389    
390    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
391    
392            * man/: Updated parts of the documentation.
393    
394            * R/textdoccol.R (asPlain): Added conversion from newsgroup
395            documents to plain text documents.
396    
397    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
398    
399            * R/textdoccol.R: Finished experimental database support. Not yet
400            intensively tested.
401    
402            * R/source.R: Now each source has a default reader.
403    
404            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
405            class anymore.
406    
407            * R/plaintextdoc.R: Custom show method for plain text documents.
408    
409            * R/aobjects.R: Added a class for structured text documents.
410    
411            * R/reader.R: Replaced remaining \code{parser} occurrences with
412            \code{reader}.
413    
414            * R/textdoccol.R (summary): Indent tags.
415    
416            * R/textdoccol.R (removePunctuation): Transform method to remove
417            punctuation marks.
418    
419    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
420    
421            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
422            using prescindMeta().
423    
424    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
425    
426            * R/textdoccol.R: Improved database support.
427    
428    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
429    
430            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
431    
432            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
433            language code.
434    
435            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
436            into parserControl argument.
437    
438            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
439    
440    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
441    
442            * Work/tmDataSetup.R: The datasets acq and crude can now be
443            created on the fly.
444    
445            * R/stopwords.R: Introduced a function returning the stopwords for
446            a given language (English, German and French at the moment)
447    
448            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
449            otherwise falls back to Snowball package.
450    
451    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
452    
453            * man/dissimilarity-methods.Rd: Make clear that any method offered
454            by "dists" from package "cba" can be used.
455    
456    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
457    
458            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
459            to Kurt's latex suggestion. Removed points and underscores in
460            variable names for consistent naming.
461    
462            * DESCRIPTION: Update to version 0.1-2.
463    
464            * man/TextRepository.Rd: Fixed bug in documentation.
465    
466    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
467    
468            * DESCRIPTION: Update to version 0.1-1.
469    
470    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
471    
472            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
473            wordStem.
474    
475    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
476    
477            * R/: Changes due to Kurt's review.
478    
479    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
480    
481            * R/: Implemented improvements based upon comments by David
482            Meyer.
483    
484    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
485    
486            * inst/doc/: Rewrote vignette.
487    
488            * man/: Improved documentation.
489    
490    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
491    
492            * man/: Updated documentation.
493    
494            * DESCRIPTION: Changed package name to "tm". Updated version to
495            0.1 for first CRAN release.
496    
497            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
498            list archive example.
499    
500            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
501            archive example.
502    
503            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
504            from (several mails per box) mbox format to (single mail per file)
505            eml format.
506    
507    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
508    
509            * data/crude.rda: Rebuilt.
510    
511            * data/acq.rda: Rebuilt.
512    
513            * R/reader.R: Factored out reader and parser methods from
514            textdoccol.R.
515    
516            * R/source.R: Factored out Source methods from aobjects.R and
517            textdoccol.R.
518            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
519            feeds.
520    
521            * R/textdoccol.R (DirSource): Added support for recursive
522            traversal of directories.
523    
524    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
525    
526            * R/textdoccol.R ([[): Loads the document corpus automatically
527            into memory upon access.
528            (tm_transform, tm_filter): Removed several checks whether the
529            document is already loaded ([[ ensures this now).
530            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
531            mailing list archive.
532    
533    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
534    
535            * R/aobjects.R (TextDocument): Is now a virtual class.
536            (Source): Is now a virtual class.
537    
538    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
539    
540            * R/textdoccol.R (c): Support for an arbitrary number of document
541            collections.
542    
543    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
544    
545            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
546            append_meta and remove_meta.
547    
548            * R/textdoccol.R: Removed modify_metadata method.
549    
550            * R/textrepo.R: Removed modify_metadata method.
551    
552            * R/textdoccol.R (remove_meta): Supports removal of document
553            collection metadata and document (= in data frame) metadata.
554    
555    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
556    
557            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
558    
559            * data/crude.rda: Rebuilt.
560    
561            * data/acq.rda: Rebuilt.
562    
563            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
564    
565            * R/textdoccol.R ([): Bug fix for subsetting a document
566            collection's data frame.
567    
568    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
569    
570            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
571            to s_filter.
572    
573            * R/textdoccol.R: Local text documents' metadata can now be copied
574            to a document collection's data frame with prescind_meta.
575    
576    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
577    
578            * R/: Text documents' slot metadata is now accessible in s_filter.
579    
580            * R/: Rewrote s_filter function (has still some restrictions).
581    
582    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
583    
584            * R/: Various fixes in handling metadata.
585    
586            * R/: Added update mechanism for text document collections.
587    
588    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
589    
590            * R/: Merging of document collections now creates a binary tree
591            for reconstructing merged document collections.
592    
593            * R/: Redesign of metadata for document collections.
594    
595    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
596    
597            * R/: Messages now use \code{ngettext}.
598    
599    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
600    
601            * R/: Added functions for modifying and removing metadata.
602    
603    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
604    
605            * man/: Updated some documentation.
606    
607            * R/: Corrected some connection issues.
608    
609            * inst/doc: Worked on the vignette.
610    
611    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
612    
613            * inst/: Added texts and started vignette.
614    
615            * R/: Final changes based upon David's comments.
616    
617    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
618    
619            * NAMESPACE: Corrected exports (generic methods need exportMethods
620            directives!).
621    
622    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
623    
624            * R/: Modified the TextDocCol constructur and various parsers. It
625            is now modular and supports various file formats via plugins (see
626            the new "Source" class).
627    
628    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
629    
630            * man/: Revised documentation after previous code changes.
631    
632    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
633    
634            * R/: Remaining changes as discussed with David.
635    
636    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
637    
638            * R/: Some changes as suggested by David. The rest will follow
639            within the next days.
640    
641    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
642    
643            * man/: Finished documentation.
644    
645    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
646    
647            * man/: Wrote some documentation.
648    
649    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
650    
651            * R/: Further syntactic sugar in form of additional assignment and
652            accessor methods.
653    
654    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * R/: Syntactic sugar in form of "length", "show" and "summary"
657            operators.
658    
659    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
660    
661            * R/: Diverse updates. Mainly on default operators ("[" or "c")
662            and dissimilarities.
663    
664    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
665    
666            * R/: Added similarity functions.
667    
668            * data/: Added english stopwords.
669    
670    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
671    
672            * data/: Examples compiled for new features
673    
674            * R/: Changes due to new structure.
675    
676            * NAMESPACE: Corrected namespace to reflect new structure.
677    
678            * R/termdocmatrix.R: Adapted for new naming scheme.
679    
680    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
681    
682            * R/textdoccol.R: Adapted code for new class structure. Wrote
683            several transform and filter functions operating on text document
684            collections (alias text document databases).
685    
686            * R/aobjects.R: Adapted class structure with inheritance,
687            repositories and additional meta data. Loading files on demand is
688            now possible.
689    
690    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
691    
692            * R/: Some cosmetic cleanups.
693    
694            * inst/: Removed vignette on clustering. That and much more is now
695            described in the JSS paper on text mining. Based upon that
696            article an elaborated vignette will be incorporated in the future.
697    
698    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
699    
700            * R/: Updated generic S4 methods to comply with signature changes
701            in newer versions of R (> 2.3)
702    
703    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
704    
705            * ext/R/importRIS.R: Automatic RIS import is now possible.
706    
707    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
708    
709            * R/textdoccol.R: Added RIS HTML input format.
710    
711    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
712    
713            * R/textdoccol.R: Removed bug that caused invalid text document
714            collections when handling many input files.
715    
716    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
717    
718            * R/textdoccol.R: Restructured and extended file import
719            mechanism.
720    
721            * inst/doc/clustering.Rnw: Adapted vignette for use with
722            ReutNews.rda
723    
724            * man/ReutNews.Rd: Documentation for ReutNews.rda
725    
726            * data/ReutNews.rda: A tiny Reuters21578 example data set.
727    
728    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
729    
730            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
731            clustering facilities of this package.
732    
733    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
734    
735            * R/aobjects.R: Changed package document structure to avoid class
736            dependency problems.
737    
738    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
739    
740            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
741            data set.
742    
743            *  Finished documentation and reordered directory structure. Now "R
744            CMD check textmin" works without errors.
745    
746    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
747    
748            * src/: Various splits can now be easily created for the
749            Reuters21578 data set.
750    
751    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            *  Updated documentation
754    
755    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
756    
757            *  Wrote R documentation for some classes and methods.
758    
759    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
760    
761            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
762            files. See the questionnaire data/Umfrage.csv for such an example.
763            We are now able to import files in Reuters-21578 XML format.
764    
765            *  Changed class interfaces in various files. Weighting of the text
766            matrix is now possible.
767    
768    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
769    
770            * R/textdoccol.R: One can build term-document matrices if
771            nessecary (with buildTDM(...)) and fill the field tdm from a text
772            document collection with it.
773    
774            * R/textmatrix.R: Wrote S4 class for term-document matrices.
775    
776    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
777    
778            * R/textdoccol.R: We now can read in a whole XML file with several
779            news items.
780    
781  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
782    
783          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.875

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge