SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC trunk/tm/ChangeLog revision 881, Sat Dec 20 09:06:13 2008 UTC
# Line 1  Line 1 
1    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
4    
5    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
8    
9    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
10    
11            * R/source.R (DataframeSource): New source class for data frames.
12    
13            * R/source.R: Fixed non-standard call evaluation.
14    
15    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
16    
17            * R/source.R (URISource): New source class for a single document.
18    
19    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
20    
21            * R/source.R: Refactoring.
22    
23    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
24    
25            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
26            Rmpi installations more gracefully.
27    
28    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
29    
30            * R/source.R (Source): Add Length slot.
31    
32    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
33    
34            * R/AAA.R: Unify duplicated .onLoad function.
35    
36    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
37    
38            * DESCRIPTION (Suggests): Added Rmpi.
39    
40    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/source.R (getElem): Fix 'no visible binding' warning.
43    
44            * man/WeightFunction.Rd: Fix signature.
45    
46    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
47    
48            * R/weight.R: Introduce name abbreviations for weighting functions.
49    
50    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
51    
52            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
53    
54            * R/cluster.R: Provide convenience functions for using a MPI
55            cluster.
56    
57            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
58            available.
59    
60            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
61            available.
62    
63    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
64    
65            * R/textdoccol.R (lapply): Removed debug print out.
66    
67    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
68    
69            * R/reader.R (readRCV1): Improved meta data extraction from
70            Reuters Corpus Volume 1 documents.
71    
72    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
73    
74            * R/transform.R: Ensure that all mappings preserve multiline
75            structures.
76    
77    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
78    
79            * R/filter.R: Every filter has now an attribute indicating whether
80            it sould be applied to document level (doclevel).
81    
82            * R/textdoccol.R (tmFilter): Set searchFullText as new default
83            filter.
84    
85    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
86    
87            * R/transform.R (replacePatterns): Replaced removeWords by
88            replacePatterns. Suggested by Christian Buchta.
89    
90            * R/textdoccol.R (inspect): Improved formatting.
91    
92    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
93    
94            * inst/CITATION: Updated JSS article information.
95    
96            * R/textdoccol.R (setAs): Added coerce method from list to
97            corpus.
98    
99            * R/meta.R (meta): Improved meta data handling.
100    
101    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
102    
103            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
104            Christian Buchta.
105    
106            * inst/CITATION: Added template to include JSS article reference.
107    
108    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
109    
110            * R/textdoccol.R (tmMap): Introduced lazy mapping.
111    
112            * R/source.R: Added VectorSource.
113    
114    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
115    
116            * man/: Language codes should be in ISO 639-1 format.
117    
118            * R/textdoccol.R (asPlain): Preserve local meta data.
119    
120    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
121    
122            * R/textdoccol.R (writeCorpus): Function for writing a corpus
123            containing plain text documents to disk.
124    
125    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
126    
127            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
128            always set correctly.
129    
130            * R/textdoccol.R: Set load = TRUE as default for load on demand
131            since in most cases this is the wanted behaviour.
132    
133    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
134    
135            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
136    
137            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
138    
139    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
140    
141            * R/meta.R (meta): New function for consistent access to meta data
142            of document collections, repositories, and texts.
143    
144    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
145    
146            * R/: Better support for encodings.
147    
148    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
149    
150            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
151            selection when no reader argument is given.
152    
153    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * R/source.R (CSVSource): Now uses read.csv instead of scan
156            internally.
157    
158    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
159    
160            * R/reader.R (getReaders): Returns available reader functions.
161    
162            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
163            as default.
164    
165    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
166    
167            * R/stopwords.R (stopwords): Shortened code, removed codetools
168            variable warnings.
169    
170            * man/: Documentation for showMeta, added an example for tmMap.
171    
172            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
173            some minor typos fixed.
174    
175    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
176    
177            * R/aobjects.R (showMeta): Added method for pretty printing a
178            text document's meta data.
179    
180    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
181    
182            * R/textdoccol.R (TextDocCol): Better handling of empty
183            arguments.
184    
185            * NAMESPACE: Exported readDOC.
186    
187            * man/completeStems.Rd: Added an example.
188    
189    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
190    
191            * R/stopwords.R (stopwords): Look up .dat files at every
192            call. Allows users to modify stopword .dat files interactively.
193    
194    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
195    
196            * R/termdocmatrix.R (termFreq): Correct processing of empty
197            documents.
198    
199    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
200    
201            * man/: Updated documentation.
202    
203    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
204    
205            * R/complete.R (completeStems): Completes (heuristically) word
206            stems.
207    
208            * R/termdocmatrix.R (TermDocMatrix2): New modular
209            constructor.
210    
211            * NAMESPACE: Exported termFreq.
212    
213    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
214    
215            * R/reader.R (readDOC): Added MS Word reader (using antiword).
216    
217    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
218    
219            * R/weight.R: Weighting functions for TermDocMatrix.
220    
221    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
222    
223            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
224            functions for accessing dimension, column, and row names.
225    
226            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
227    
228    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
231    
232    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
233    
234            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
235    
236    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
237    
238            * R/reader.R (readPDF): Removed manual checks for pdftotext and
239            pdfinfo. The system call gives a warning anyway.
240    
241    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
242    
243            * R/textdoccol.R (asPlain): Conversion from
244            StructuredTextDocuments to PlainTextDocuments.
245    
246    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
247    
248            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
249            for accessing term-document matrices.
250    
251            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
252            are installed.
253    
254    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
255    
256            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
257            Christian Buchta.
258    
259    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
260    
261            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
262    
263    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
264    
265            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
266    
267            * R/reader.R (readPDF): Added PDF reader.
268    
269    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
270    
271            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
272    
273            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
274    
275            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
276    
277            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
278    
279    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
280    
281            * R/distmeasure.R (dissimilarity): Replaced dists call from
282            package cba by new dist call from package proxy.
283    
284    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
287    
288    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * R/termdocmatrix.R: require() uses the quietly option to suppress
291            loading messages.
292    
293    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/dictionary.R: Added dictionary support.
296    
297    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
298    
299            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
300            documents. This simplifies some functions, e.g., asPlain.
301    
302    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
303    
304            * inst/doc/tm.Rnw: Fixed some typos in vignette.
305    
306    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
307    
308            * R/textdoccol.R (replaceWords): Added method to replace a set of
309            words by a single word. Useful for synonyms.
310    
311    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
312    
313            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
314    
315    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
316    
317            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
318            vectors. Thanks to Ariel Maguyon for his error report.
319            (removeSparseTerms): New function to remove columns from a
320            term-document matrix exceeding a sparse factor.
321    
322    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
325    
326    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
327    
328            * man/sFilter.Rd: Corrected documentation on statement format (use
329            '==' instead of '=').
330    
331    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
332    
333            * R/aobjects.R (StructuredTextDocument): Inherits from
334            TextDocument.
335    
336    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
337    
338            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
339            on sparse matrices as proposed by Martin Maechler.
340    
341    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
342    
343            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
344            \pkg{filehash} version makes them deprecated.
345    
346    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
347    
348            * R/termdocmatrix.R (textvector): Stemming is now performed before
349            erasing stopwords.
350            (weightMatrix): Adapted to handle sparse matrices.
351            (TermDocMatrix): Sparse matrix is now efficiently built by
352            direct stepwise insertion of row values into it.
353    
354    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
355    
356            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
357            due to ongoing problems. For our purposes the latter is as useful
358            as the replaced package.
359    
360    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
363    
364            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
365    
366    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
367    
368            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
369            languages with available stopwords.
370    
371    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
372    
373            * inst/doc/tm.Rnw: Minor corrections in the vignette.
374    
375    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
376    
377            * DESCRIPTION: Update to version 0.2, since a lot of new features
378            have been integrated.
379    
380            * inst/stopwords: Updated existing stopwords and added stopwords
381            for various other languages.
382    
383    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
384    
385            * man/: Updated documentation.
386    
387            * Work/testDb.R: Script to test database stuff.
388    
389            * R/: Fixed various database related bugs. Seems to be rather
390            useable now, i.e., consider as alpha status for now.
391    
392    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
393    
394            * R/: Fixed some bugs related to database support.
395    
396    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
397    
398            * man/: Added a lot of examples to the manuals.
399    
400    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
401    
402            * man/: Updated parts of the documentation.
403    
404            * R/textdoccol.R (asPlain): Added conversion from newsgroup
405            documents to plain text documents.
406    
407    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * R/textdoccol.R: Finished experimental database support. Not yet
410            intensively tested.
411    
412            * R/source.R: Now each source has a default reader.
413    
414            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
415            class anymore.
416    
417            * R/plaintextdoc.R: Custom show method for plain text documents.
418    
419            * R/aobjects.R: Added a class for structured text documents.
420    
421            * R/reader.R: Replaced remaining \code{parser} occurrences with
422            \code{reader}.
423    
424            * R/textdoccol.R (summary): Indent tags.
425    
426            * R/textdoccol.R (removePunctuation): Transform method to remove
427            punctuation marks.
428    
429    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
430    
431            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
432            using prescindMeta().
433    
434    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
435    
436            * R/textdoccol.R: Improved database support.
437    
438    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
439    
440            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
441    
442            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
443            language code.
444    
445            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
446            into parserControl argument.
447    
448            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
449    
450    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
451    
452            * Work/tmDataSetup.R: The datasets acq and crude can now be
453            created on the fly.
454    
455            * R/stopwords.R: Introduced a function returning the stopwords for
456            a given language (English, German and French at the moment)
457    
458            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
459            otherwise falls back to Snowball package.
460    
461    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
462    
463            * man/dissimilarity-methods.Rd: Make clear that any method offered
464            by "dists" from package "cba" can be used.
465    
466    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
467    
468            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
469            to Kurt's latex suggestion. Removed points and underscores in
470            variable names for consistent naming.
471    
472            * DESCRIPTION: Update to version 0.1-2.
473    
474            * man/TextRepository.Rd: Fixed bug in documentation.
475    
476    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
477    
478            * DESCRIPTION: Update to version 0.1-1.
479    
480    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
481    
482            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
483            wordStem.
484    
485    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
486    
487            * R/: Changes due to Kurt's review.
488    
489    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
490    
491            * R/: Implemented improvements based upon comments by David
492            Meyer.
493    
494    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
495    
496            * inst/doc/: Rewrote vignette.
497    
498            * man/: Improved documentation.
499    
500    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
501    
502            * man/: Updated documentation.
503    
504            * DESCRIPTION: Changed package name to "tm". Updated version to
505            0.1 for first CRAN release.
506    
507            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
508            list archive example.
509    
510            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
511            archive example.
512    
513            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
514            from (several mails per box) mbox format to (single mail per file)
515            eml format.
516    
517    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
518    
519            * data/crude.rda: Rebuilt.
520    
521            * data/acq.rda: Rebuilt.
522    
523            * R/reader.R: Factored out reader and parser methods from
524            textdoccol.R.
525    
526            * R/source.R: Factored out Source methods from aobjects.R and
527            textdoccol.R.
528            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
529            feeds.
530    
531            * R/textdoccol.R (DirSource): Added support for recursive
532            traversal of directories.
533    
534    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
535    
536            * R/textdoccol.R ([[): Loads the document corpus automatically
537            into memory upon access.
538            (tm_transform, tm_filter): Removed several checks whether the
539            document is already loaded ([[ ensures this now).
540            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
541            mailing list archive.
542    
543    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
544    
545            * R/aobjects.R (TextDocument): Is now a virtual class.
546            (Source): Is now a virtual class.
547    
548    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
549    
550            * R/textdoccol.R (c): Support for an arbitrary number of document
551            collections.
552    
553    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
554    
555            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
556            append_meta and remove_meta.
557    
558            * R/textdoccol.R: Removed modify_metadata method.
559    
560            * R/textrepo.R: Removed modify_metadata method.
561    
562            * R/textdoccol.R (remove_meta): Supports removal of document
563            collection metadata and document (= in data frame) metadata.
564    
565    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
566    
567            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
568    
569            * data/crude.rda: Rebuilt.
570    
571            * data/acq.rda: Rebuilt.
572    
573            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
574    
575            * R/textdoccol.R ([): Bug fix for subsetting a document
576            collection's data frame.
577    
578    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
579    
580            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
581            to s_filter.
582    
583            * R/textdoccol.R: Local text documents' metadata can now be copied
584            to a document collection's data frame with prescind_meta.
585    
586    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
587    
588            * R/: Text documents' slot metadata is now accessible in s_filter.
589    
590            * R/: Rewrote s_filter function (has still some restrictions).
591    
592    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
593    
594            * R/: Various fixes in handling metadata.
595    
596            * R/: Added update mechanism for text document collections.
597    
598    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
599    
600            * R/: Merging of document collections now creates a binary tree
601            for reconstructing merged document collections.
602    
603            * R/: Redesign of metadata for document collections.
604    
605    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
606    
607            * R/: Messages now use \code{ngettext}.
608    
609    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
610    
611            * R/: Added functions for modifying and removing metadata.
612    
613    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
614    
615            * man/: Updated some documentation.
616    
617            * R/: Corrected some connection issues.
618    
619            * inst/doc: Worked on the vignette.
620    
621    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
622    
623            * inst/: Added texts and started vignette.
624    
625            * R/: Final changes based upon David's comments.
626    
627    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
628    
629            * NAMESPACE: Corrected exports (generic methods need exportMethods
630            directives!).
631    
632    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
633    
634            * R/: Modified the TextDocCol constructur and various parsers. It
635            is now modular and supports various file formats via plugins (see
636            the new "Source" class).
637    
638    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
639    
640            * man/: Revised documentation after previous code changes.
641    
642    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
643    
644            * R/: Remaining changes as discussed with David.
645    
646    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
647    
648            * R/: Some changes as suggested by David. The rest will follow
649            within the next days.
650    
651    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
652    
653            * man/: Finished documentation.
654    
655    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
656    
657            * man/: Wrote some documentation.
658    
659    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
660    
661            * R/: Further syntactic sugar in form of additional assignment and
662            accessor methods.
663    
664    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
665    
666            * R/: Syntactic sugar in form of "length", "show" and "summary"
667            operators.
668    
669    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
670    
671            * R/: Diverse updates. Mainly on default operators ("[" or "c")
672            and dissimilarities.
673    
674    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
675    
676            * R/: Added similarity functions.
677    
678            * data/: Added english stopwords.
679    
680    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
681    
682            * data/: Examples compiled for new features
683    
684            * R/: Changes due to new structure.
685    
686            * NAMESPACE: Corrected namespace to reflect new structure.
687    
688            * R/termdocmatrix.R: Adapted for new naming scheme.
689    
690    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
691    
692            * R/textdoccol.R: Adapted code for new class structure. Wrote
693            several transform and filter functions operating on text document
694            collections (alias text document databases).
695    
696            * R/aobjects.R: Adapted class structure with inheritance,
697            repositories and additional meta data. Loading files on demand is
698            now possible.
699    
700    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
701    
702            * R/: Some cosmetic cleanups.
703    
704            * inst/: Removed vignette on clustering. That and much more is now
705            described in the JSS paper on text mining. Based upon that
706            article an elaborated vignette will be incorporated in the future.
707    
708    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
709    
710            * R/: Updated generic S4 methods to comply with signature changes
711            in newer versions of R (> 2.3)
712    
713    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
714    
715            * ext/R/importRIS.R: Automatic RIS import is now possible.
716    
717    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
718    
719            * R/textdoccol.R: Added RIS HTML input format.
720    
721    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
722    
723            * R/textdoccol.R: Removed bug that caused invalid text document
724            collections when handling many input files.
725    
726    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
727    
728            * R/textdoccol.R: Restructured and extended file import
729            mechanism.
730    
731            * inst/doc/clustering.Rnw: Adapted vignette for use with
732            ReutNews.rda
733    
734            * man/ReutNews.Rd: Documentation for ReutNews.rda
735    
736            * data/ReutNews.rda: A tiny Reuters21578 example data set.
737    
738    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
739    
740            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
741            clustering facilities of this package.
742    
743    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
744    
745            * R/aobjects.R: Changed package document structure to avoid class
746            dependency problems.
747    
748    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
749    
750            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
751            data set.
752    
753            *  Finished documentation and reordered directory structure. Now "R
754            CMD check textmin" works without errors.
755    
756    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
757    
758            * src/: Various splits can now be easily created for the
759            Reuters21578 data set.
760    
761    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
762    
763            *  Updated documentation
764    
765    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
766    
767            *  Wrote R documentation for some classes and methods.
768    
769    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
770    
771            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
772            files. See the questionnaire data/Umfrage.csv for such an example.
773            We are now able to import files in Reuters-21578 XML format.
774    
775            *  Changed class interfaces in various files. Weighting of the text
776            matrix is now possible.
777    
778    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
779    
780            * R/textdoccol.R: One can build term-document matrices if
781            nessecary (with buildTDM(...)) and fill the field tdm from a text
782            document collection with it.
783    
784            * R/textmatrix.R: Wrote S4 class for term-document matrices.
785    
786    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
787    
788            * R/textdoccol.R: We now can read in a whole XML file with several
789            news items.
790    
791  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
792    
793          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.881

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge