SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 28, Tue Dec 6 13:46:33 2005 UTC trunk/tm/ChangeLog revision 872, Tue Nov 25 16:36:08 2008 UTC
# Line 1  Line 1 
1    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
4            Rmpi installations more gracefully.
5    
6    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
7    
8            * R/source.R (Source): Add Length slot.
9    
10    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
11    
12            * R/AAA.R: Unify duplicated .onLoad function.
13    
14    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
15    
16            * DESCRIPTION (Suggests): Added Rmpi.
17    
18    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
19    
20            * R/source.R (getElem): Fix 'no visible binding' warning.
21    
22            * man/WeightFunction.Rd: Fix signature.
23    
24    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
25    
26            * R/weight.R: Introduce name abbreviations for weighting functions.
27    
28    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
29    
30            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
31    
32            * R/cluster.R: Provide convenience functions for using a MPI
33            cluster.
34    
35            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
36            available.
37    
38            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
39            available.
40    
41    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
42    
43            * R/textdoccol.R (lapply): Removed debug print out.
44    
45    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
46    
47            * R/reader.R (readRCV1): Improved meta data extraction from
48            Reuters Corpus Volume 1 documents.
49    
50    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
51    
52            * R/transform.R: Ensure that all mappings preserve multiline
53            structures.
54    
55    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
56    
57            * R/filter.R: Every filter has now an attribute indicating whether
58            it sould be applied to document level (doclevel).
59    
60            * R/textdoccol.R (tmFilter): Set searchFullText as new default
61            filter.
62    
63    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
64    
65            * R/transform.R (replacePatterns): Replaced removeWords by
66            replacePatterns. Suggested by Christian Buchta.
67    
68            * R/textdoccol.R (inspect): Improved formatting.
69    
70    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
71    
72            * inst/CITATION: Updated JSS article information.
73    
74            * R/textdoccol.R (setAs): Added coerce method from list to
75            corpus.
76    
77            * R/meta.R (meta): Improved meta data handling.
78    
79    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
80    
81            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
82            Christian Buchta.
83    
84            * inst/CITATION: Added template to include JSS article reference.
85    
86    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
87    
88            * R/textdoccol.R (tmMap): Introduced lazy mapping.
89    
90            * R/source.R: Added VectorSource.
91    
92    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
93    
94            * man/: Language codes should be in ISO 639-1 format.
95    
96            * R/textdoccol.R (asPlain): Preserve local meta data.
97    
98    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
99    
100            * R/textdoccol.R (writeCorpus): Function for writing a corpus
101            containing plain text documents to disk.
102    
103    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
104    
105            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
106            always set correctly.
107    
108            * R/textdoccol.R: Set load = TRUE as default for load on demand
109            since in most cases this is the wanted behaviour.
110    
111    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
112    
113            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
114    
115            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
116    
117    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
118    
119            * R/meta.R (meta): New function for consistent access to meta data
120            of document collections, repositories, and texts.
121    
122    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
123    
124            * R/: Better support for encodings.
125    
126    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
127    
128            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
129            selection when no reader argument is given.
130    
131    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
132    
133            * R/source.R (CSVSource): Now uses read.csv instead of scan
134            internally.
135    
136    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
137    
138            * R/reader.R (getReaders): Returns available reader functions.
139    
140            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
141            as default.
142    
143    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
144    
145            * R/stopwords.R (stopwords): Shortened code, removed codetools
146            variable warnings.
147    
148            * man/: Documentation for showMeta, added an example for tmMap.
149    
150            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
151            some minor typos fixed.
152    
153    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * R/aobjects.R (showMeta): Added method for pretty printing a
156            text document's meta data.
157    
158    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
159    
160            * R/textdoccol.R (TextDocCol): Better handling of empty
161            arguments.
162    
163            * NAMESPACE: Exported readDOC.
164    
165            * man/completeStems.Rd: Added an example.
166    
167    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
168    
169            * R/stopwords.R (stopwords): Look up .dat files at every
170            call. Allows users to modify stopword .dat files interactively.
171    
172    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
173    
174            * R/termdocmatrix.R (termFreq): Correct processing of empty
175            documents.
176    
177    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
178    
179            * man/: Updated documentation.
180    
181    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
182    
183            * R/complete.R (completeStems): Completes (heuristically) word
184            stems.
185    
186            * R/termdocmatrix.R (TermDocMatrix2): New modular
187            constructor.
188    
189            * NAMESPACE: Exported termFreq.
190    
191    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
192    
193            * R/reader.R (readDOC): Added MS Word reader (using antiword).
194    
195    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
196    
197            * R/weight.R: Weighting functions for TermDocMatrix.
198    
199    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
200    
201            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
202            functions for accessing dimension, column, and row names.
203    
204            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
205    
206    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
207    
208            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
209    
210    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
213    
214    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * R/reader.R (readPDF): Removed manual checks for pdftotext and
217            pdfinfo. The system call gives a warning anyway.
218    
219    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
220    
221            * R/textdoccol.R (asPlain): Conversion from
222            StructuredTextDocuments to PlainTextDocuments.
223    
224    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
225    
226            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
227            for accessing term-document matrices.
228    
229            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
230            are installed.
231    
232    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
233    
234            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
235            Christian Buchta.
236    
237    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
238    
239            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
240    
241    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
242    
243            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
244    
245            * R/reader.R (readPDF): Added PDF reader.
246    
247    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
248    
249            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
250    
251            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
252    
253            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
254    
255            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
256    
257    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * R/distmeasure.R (dissimilarity): Replaced dists call from
260            package cba by new dist call from package proxy.
261    
262    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
263    
264            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
265    
266    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
267    
268            * R/termdocmatrix.R: require() uses the quietly option to suppress
269            loading messages.
270    
271    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/dictionary.R: Added dictionary support.
274    
275    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
276    
277            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
278            documents. This simplifies some functions, e.g., asPlain.
279    
280    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
281    
282            * inst/doc/tm.Rnw: Fixed some typos in vignette.
283    
284    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/textdoccol.R (replaceWords): Added method to replace a set of
287            words by a single word. Useful for synonyms.
288    
289    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
292    
293    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
296            vectors. Thanks to Ariel Maguyon for his error report.
297            (removeSparseTerms): New function to remove columns from a
298            term-document matrix exceeding a sparse factor.
299    
300    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
301    
302            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
303    
304    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * man/sFilter.Rd: Corrected documentation on statement format (use
307            '==' instead of '=').
308    
309    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
310    
311            * R/aobjects.R (StructuredTextDocument): Inherits from
312            TextDocument.
313    
314    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
315    
316            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
317            on sparse matrices as proposed by Martin Maechler.
318    
319    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
320    
321            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
322            \pkg{filehash} version makes them deprecated.
323    
324    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
325    
326            * R/termdocmatrix.R (textvector): Stemming is now performed before
327            erasing stopwords.
328            (weightMatrix): Adapted to handle sparse matrices.
329            (TermDocMatrix): Sparse matrix is now efficiently built by
330            direct stepwise insertion of row values into it.
331    
332    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
333    
334            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
335            due to ongoing problems. For our purposes the latter is as useful
336            as the replaced package.
337    
338    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
339    
340            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
341    
342            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
343    
344    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
345    
346            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
347            languages with available stopwords.
348    
349    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
350    
351            * inst/doc/tm.Rnw: Minor corrections in the vignette.
352    
353    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
354    
355            * DESCRIPTION: Update to version 0.2, since a lot of new features
356            have been integrated.
357    
358            * inst/stopwords: Updated existing stopwords and added stopwords
359            for various other languages.
360    
361    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
362    
363            * man/: Updated documentation.
364    
365            * Work/testDb.R: Script to test database stuff.
366    
367            * R/: Fixed various database related bugs. Seems to be rather
368            useable now, i.e., consider as alpha status for now.
369    
370    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
371    
372            * R/: Fixed some bugs related to database support.
373    
374    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
375    
376            * man/: Added a lot of examples to the manuals.
377    
378    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
379    
380            * man/: Updated parts of the documentation.
381    
382            * R/textdoccol.R (asPlain): Added conversion from newsgroup
383            documents to plain text documents.
384    
385    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
386    
387            * R/textdoccol.R: Finished experimental database support. Not yet
388            intensively tested.
389    
390            * R/source.R: Now each source has a default reader.
391    
392            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
393            class anymore.
394    
395            * R/plaintextdoc.R: Custom show method for plain text documents.
396    
397            * R/aobjects.R: Added a class for structured text documents.
398    
399            * R/reader.R: Replaced remaining \code{parser} occurrences with
400            \code{reader}.
401    
402            * R/textdoccol.R (summary): Indent tags.
403    
404            * R/textdoccol.R (removePunctuation): Transform method to remove
405            punctuation marks.
406    
407    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
410            using prescindMeta().
411    
412    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * R/textdoccol.R: Improved database support.
415    
416    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
417    
418            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
419    
420            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
421            language code.
422    
423            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
424            into parserControl argument.
425    
426            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
427    
428    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
429    
430            * Work/tmDataSetup.R: The datasets acq and crude can now be
431            created on the fly.
432    
433            * R/stopwords.R: Introduced a function returning the stopwords for
434            a given language (English, German and French at the moment)
435    
436            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
437            otherwise falls back to Snowball package.
438    
439    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
440    
441            * man/dissimilarity-methods.Rd: Make clear that any method offered
442            by "dists" from package "cba" can be used.
443    
444    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
445    
446            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
447            to Kurt's latex suggestion. Removed points and underscores in
448            variable names for consistent naming.
449    
450            * DESCRIPTION: Update to version 0.1-2.
451    
452            * man/TextRepository.Rd: Fixed bug in documentation.
453    
454    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
455    
456            * DESCRIPTION: Update to version 0.1-1.
457    
458    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
459    
460            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
461            wordStem.
462    
463    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
464    
465            * R/: Changes due to Kurt's review.
466    
467    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
468    
469            * R/: Implemented improvements based upon comments by David
470            Meyer.
471    
472    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
473    
474            * inst/doc/: Rewrote vignette.
475    
476            * man/: Improved documentation.
477    
478    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
479    
480            * man/: Updated documentation.
481    
482            * DESCRIPTION: Changed package name to "tm". Updated version to
483            0.1 for first CRAN release.
484    
485            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
486            list archive example.
487    
488            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
489            archive example.
490    
491            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
492            from (several mails per box) mbox format to (single mail per file)
493            eml format.
494    
495    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
496    
497            * data/crude.rda: Rebuilt.
498    
499            * data/acq.rda: Rebuilt.
500    
501            * R/reader.R: Factored out reader and parser methods from
502            textdoccol.R.
503    
504            * R/source.R: Factored out Source methods from aobjects.R and
505            textdoccol.R.
506            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
507            feeds.
508    
509            * R/textdoccol.R (DirSource): Added support for recursive
510            traversal of directories.
511    
512    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
513    
514            * R/textdoccol.R ([[): Loads the document corpus automatically
515            into memory upon access.
516            (tm_transform, tm_filter): Removed several checks whether the
517            document is already loaded ([[ ensures this now).
518            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
519            mailing list archive.
520    
521    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
522    
523            * R/aobjects.R (TextDocument): Is now a virtual class.
524            (Source): Is now a virtual class.
525    
526    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
527    
528            * R/textdoccol.R (c): Support for an arbitrary number of document
529            collections.
530    
531    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
532    
533            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
534            append_meta and remove_meta.
535    
536            * R/textdoccol.R: Removed modify_metadata method.
537    
538            * R/textrepo.R: Removed modify_metadata method.
539    
540            * R/textdoccol.R (remove_meta): Supports removal of document
541            collection metadata and document (= in data frame) metadata.
542    
543    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
544    
545            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
546    
547            * data/crude.rda: Rebuilt.
548    
549            * data/acq.rda: Rebuilt.
550    
551            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
552    
553            * R/textdoccol.R ([): Bug fix for subsetting a document
554            collection's data frame.
555    
556    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
557    
558            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
559            to s_filter.
560    
561            * R/textdoccol.R: Local text documents' metadata can now be copied
562            to a document collection's data frame with prescind_meta.
563    
564    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
565    
566            * R/: Text documents' slot metadata is now accessible in s_filter.
567    
568            * R/: Rewrote s_filter function (has still some restrictions).
569    
570    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
571    
572            * R/: Various fixes in handling metadata.
573    
574            * R/: Added update mechanism for text document collections.
575    
576    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
577    
578            * R/: Merging of document collections now creates a binary tree
579            for reconstructing merged document collections.
580    
581            * R/: Redesign of metadata for document collections.
582    
583    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
584    
585            * R/: Messages now use \code{ngettext}.
586    
587    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
588    
589            * R/: Added functions for modifying and removing metadata.
590    
591    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
592    
593            * man/: Updated some documentation.
594    
595            * R/: Corrected some connection issues.
596    
597            * inst/doc: Worked on the vignette.
598    
599    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
600    
601            * inst/: Added texts and started vignette.
602    
603            * R/: Final changes based upon David's comments.
604    
605    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
606    
607            * NAMESPACE: Corrected exports (generic methods need exportMethods
608            directives!).
609    
610    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
611    
612            * R/: Modified the TextDocCol constructur and various parsers. It
613            is now modular and supports various file formats via plugins (see
614            the new "Source" class).
615    
616    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
617    
618            * man/: Revised documentation after previous code changes.
619    
620    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
621    
622            * R/: Remaining changes as discussed with David.
623    
624    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
625    
626            * R/: Some changes as suggested by David. The rest will follow
627            within the next days.
628    
629    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
630    
631            * man/: Finished documentation.
632    
633    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
634    
635            * man/: Wrote some documentation.
636    
637    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
638    
639            * R/: Further syntactic sugar in form of additional assignment and
640            accessor methods.
641    
642    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
643    
644            * R/: Syntactic sugar in form of "length", "show" and "summary"
645            operators.
646    
647    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
648    
649            * R/: Diverse updates. Mainly on default operators ("[" or "c")
650            and dissimilarities.
651    
652    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
653    
654            * R/: Added similarity functions.
655    
656            * data/: Added english stopwords.
657    
658    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
659    
660            * data/: Examples compiled for new features
661    
662            * R/: Changes due to new structure.
663    
664            * NAMESPACE: Corrected namespace to reflect new structure.
665    
666            * R/termdocmatrix.R: Adapted for new naming scheme.
667    
668    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
669    
670            * R/textdoccol.R: Adapted code for new class structure. Wrote
671            several transform and filter functions operating on text document
672            collections (alias text document databases).
673    
674            * R/aobjects.R: Adapted class structure with inheritance,
675            repositories and additional meta data. Loading files on demand is
676            now possible.
677    
678    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
679    
680            * R/: Some cosmetic cleanups.
681    
682            * inst/: Removed vignette on clustering. That and much more is now
683            described in the JSS paper on text mining. Based upon that
684            article an elaborated vignette will be incorporated in the future.
685    
686    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
687    
688            * R/: Updated generic S4 methods to comply with signature changes
689            in newer versions of R (> 2.3)
690    
691    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
692    
693            * ext/R/importRIS.R: Automatic RIS import is now possible.
694    
695    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
696    
697            * R/textdoccol.R: Added RIS HTML input format.
698    
699    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
700    
701            * R/textdoccol.R: Removed bug that caused invalid text document
702            collections when handling many input files.
703    
704    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
705    
706            * R/textdoccol.R: Restructured and extended file import
707            mechanism.
708    
709            * inst/doc/clustering.Rnw: Adapted vignette for use with
710            ReutNews.rda
711    
712            * man/ReutNews.Rd: Documentation for ReutNews.rda
713    
714            * data/ReutNews.rda: A tiny Reuters21578 example data set.
715    
716    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
717    
718            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
719            clustering facilities of this package.
720    
721    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
722    
723            * R/aobjects.R: Changed package document structure to avoid class
724            dependency problems.
725    
726  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
727    
728            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
729            data set.
730    
731          * Finished documentation and reordered directory structure. Now "R          * Finished documentation and reordered directory structure. Now "R
732          CMD check textmin" works without errors.          CMD check textmin" works without errors.
733    

Legend:
Removed from v.28  
changed lines
  Added in v.872

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge