SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 28, Tue Dec 6 13:46:33 2005 UTC pkg/ChangeLog revision 925, Fri Apr 3 17:39:44 2009 UTC
# Line 1  Line 1 
1    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/weight.R: Remove weightLogical since it does not return a
4            dgCMatrix.
5    
6            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
7            or TermDocumentMatrix instead.
8    
9    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
10    
11            * inst/doc/extensions.Rnw: Finished vignette.
12    
13    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
14    
15            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
16            DocumentTermMatrix representations.
17    
18    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
19    
20            * R/reader.R (readXML): New reader for arbitrary XML files.
21    
22    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
23    
24            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
25            (XMLSource): New XMLSource class for arbitrary XML files.
26            (Source): New slot Vectorized.
27    
28    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
29    
30            * R/reader.R (readCustom): Experimental reader which can be
31            customized via user-defined mappings.
32    
33            * R/reader.R: Always use UTC time zone.
34    
35            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
36    
37    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
38    
39            * R/reader.R (readDOC): Options can be passed over to antiword.
40    
41            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
42            pdftotext.
43    
44    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
45    
46            * R/source.R (DirSource): Add pattern and ignore.case arguments
47            which are internally passed over to list.files().
48    
49    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
50    
51            * inst/doc/tm.Rnw: Suppress pointless loading message.
52    
53    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
54    
55            * DESCRIPTION: Speed up package loading (via moving packages not
56            strictly necessary for normal operation to Suggests instead of
57            Depends).
58    
59    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
60    
61            * R/reader.R (readNewsgroup): The date format is now configurable.
62    
63    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
64    
65            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
66    
67    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
68    
69            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
70    
71    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
72    
73            * R/source.R (DataframeSource): New source class for data frames.
74    
75            * R/source.R: Fixed non-standard call evaluation.
76    
77    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
78    
79            * R/source.R (URISource): New source class for a single document.
80    
81    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
82    
83            * R/source.R: Refactoring.
84    
85    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
86    
87            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
88            Rmpi installations more gracefully.
89    
90    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
91    
92            * R/source.R (Source): Add Length slot.
93    
94    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
95    
96            * R/AAA.R: Unify duplicated .onLoad function.
97    
98    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
99    
100            * DESCRIPTION (Suggests): Added Rmpi.
101    
102    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
103    
104            * R/source.R (getElem): Fix 'no visible binding' warning.
105    
106            * man/WeightFunction.Rd: Fix signature.
107    
108    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
109    
110            * R/weight.R: Introduce name abbreviations for weighting functions.
111    
112    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
113    
114            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
115    
116            * R/cluster.R: Provide convenience functions for using a MPI
117            cluster.
118    
119            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
120            available.
121    
122            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
123            available.
124    
125    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
126    
127            * R/textdoccol.R (lapply): Removed debug print out.
128    
129    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
130    
131            * R/reader.R (readRCV1): Improved meta data extraction from
132            Reuters Corpus Volume 1 documents.
133    
134    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
135    
136            * R/transform.R: Ensure that all mappings preserve multiline
137            structures.
138    
139    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
140    
141            * R/filter.R: Every filter has now an attribute indicating whether
142            it sould be applied to document level (doclevel).
143    
144            * R/textdoccol.R (tmFilter): Set searchFullText as new default
145            filter.
146    
147    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
148    
149            * R/transform.R (replacePatterns): Replaced removeWords by
150            replacePatterns. Suggested by Christian Buchta.
151    
152            * R/textdoccol.R (inspect): Improved formatting.
153    
154    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
155    
156            * inst/CITATION: Updated JSS article information.
157    
158            * R/textdoccol.R (setAs): Added coerce method from list to
159            corpus.
160    
161            * R/meta.R (meta): Improved meta data handling.
162    
163    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
164    
165            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
166            Christian Buchta.
167    
168            * inst/CITATION: Added template to include JSS article reference.
169    
170    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
171    
172            * R/textdoccol.R (tmMap): Introduced lazy mapping.
173    
174            * R/source.R: Added VectorSource.
175    
176    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
177    
178            * man/: Language codes should be in ISO 639-1 format.
179    
180            * R/textdoccol.R (asPlain): Preserve local meta data.
181    
182    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
183    
184            * R/textdoccol.R (writeCorpus): Function for writing a corpus
185            containing plain text documents to disk.
186    
187    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
188    
189            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
190            always set correctly.
191    
192            * R/textdoccol.R: Set load = TRUE as default for load on demand
193            since in most cases this is the wanted behaviour.
194    
195    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
196    
197            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
198    
199            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
200    
201    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
202    
203            * R/meta.R (meta): New function for consistent access to meta data
204            of document collections, repositories, and texts.
205    
206    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
207    
208            * R/: Better support for encodings.
209    
210    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
213            selection when no reader argument is given.
214    
215    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
216    
217            * R/source.R (CSVSource): Now uses read.csv instead of scan
218            internally.
219    
220    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
221    
222            * R/reader.R (getReaders): Returns available reader functions.
223    
224            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
225            as default.
226    
227    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
228    
229            * R/stopwords.R (stopwords): Shortened code, removed codetools
230            variable warnings.
231    
232            * man/: Documentation for showMeta, added an example for tmMap.
233    
234            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
235            some minor typos fixed.
236    
237    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
238    
239            * R/aobjects.R (showMeta): Added method for pretty printing a
240            text document's meta data.
241    
242    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
243    
244            * R/textdoccol.R (TextDocCol): Better handling of empty
245            arguments.
246    
247            * NAMESPACE: Exported readDOC.
248    
249            * man/completeStems.Rd: Added an example.
250    
251    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
252    
253            * R/stopwords.R (stopwords): Look up .dat files at every
254            call. Allows users to modify stopword .dat files interactively.
255    
256    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
257    
258            * R/termdocmatrix.R (termFreq): Correct processing of empty
259            documents.
260    
261    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
262    
263            * man/: Updated documentation.
264    
265    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
266    
267            * R/complete.R (completeStems): Completes (heuristically) word
268            stems.
269    
270            * R/termdocmatrix.R (TermDocMatrix2): New modular
271            constructor.
272    
273            * NAMESPACE: Exported termFreq.
274    
275    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
276    
277            * R/reader.R (readDOC): Added MS Word reader (using antiword).
278    
279    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
280    
281            * R/weight.R: Weighting functions for TermDocMatrix.
282    
283    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
284    
285            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
286            functions for accessing dimension, column, and row names.
287    
288            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
289    
290    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
291    
292            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
293    
294    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
295    
296            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
297    
298    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
299    
300            * R/reader.R (readPDF): Removed manual checks for pdftotext and
301            pdfinfo. The system call gives a warning anyway.
302    
303    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
304    
305            * R/textdoccol.R (asPlain): Conversion from
306            StructuredTextDocuments to PlainTextDocuments.
307    
308    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
309    
310            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
311            for accessing term-document matrices.
312    
313            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
314            are installed.
315    
316    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
317    
318            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
319            Christian Buchta.
320    
321    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
322    
323            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
324    
325    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
326    
327            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
328    
329            * R/reader.R (readPDF): Added PDF reader.
330    
331    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
332    
333            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
334    
335            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
336    
337            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
338    
339            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
340    
341    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
342    
343            * R/distmeasure.R (dissimilarity): Replaced dists call from
344            package cba by new dist call from package proxy.
345    
346    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
347    
348            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
349    
350    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
351    
352            * R/termdocmatrix.R: require() uses the quietly option to suppress
353            loading messages.
354    
355    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
356    
357            * R/dictionary.R: Added dictionary support.
358    
359    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
360    
361            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
362            documents. This simplifies some functions, e.g., asPlain.
363    
364    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
365    
366            * inst/doc/tm.Rnw: Fixed some typos in vignette.
367    
368    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
369    
370            * R/textdoccol.R (replaceWords): Added method to replace a set of
371            words by a single word. Useful for synonyms.
372    
373    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
374    
375            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
376    
377    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
378    
379            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
380            vectors. Thanks to Ariel Maguyon for his error report.
381            (removeSparseTerms): New function to remove columns from a
382            term-document matrix exceeding a sparse factor.
383    
384    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
385    
386            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
387    
388    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
389    
390            * man/sFilter.Rd: Corrected documentation on statement format (use
391            '==' instead of '=').
392    
393    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
394    
395            * R/aobjects.R (StructuredTextDocument): Inherits from
396            TextDocument.
397    
398    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
399    
400            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
401            on sparse matrices as proposed by Martin Maechler.
402    
403    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
404    
405            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
406            \pkg{filehash} version makes them deprecated.
407    
408    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
409    
410            * R/termdocmatrix.R (textvector): Stemming is now performed before
411            erasing stopwords.
412            (weightMatrix): Adapted to handle sparse matrices.
413            (TermDocMatrix): Sparse matrix is now efficiently built by
414            direct stepwise insertion of row values into it.
415    
416    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
417    
418            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
419            due to ongoing problems. For our purposes the latter is as useful
420            as the replaced package.
421    
422    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
423    
424            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
425    
426            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
427    
428    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
429    
430            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
431            languages with available stopwords.
432    
433    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
434    
435            * inst/doc/tm.Rnw: Minor corrections in the vignette.
436    
437    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * DESCRIPTION: Update to version 0.2, since a lot of new features
440            have been integrated.
441    
442            * inst/stopwords: Updated existing stopwords and added stopwords
443            for various other languages.
444    
445    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
446    
447            * man/: Updated documentation.
448    
449            * Work/testDb.R: Script to test database stuff.
450    
451            * R/: Fixed various database related bugs. Seems to be rather
452            useable now, i.e., consider as alpha status for now.
453    
454    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
455    
456            * R/: Fixed some bugs related to database support.
457    
458    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
459    
460            * man/: Added a lot of examples to the manuals.
461    
462    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
463    
464            * man/: Updated parts of the documentation.
465    
466            * R/textdoccol.R (asPlain): Added conversion from newsgroup
467            documents to plain text documents.
468    
469    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
470    
471            * R/textdoccol.R: Finished experimental database support. Not yet
472            intensively tested.
473    
474            * R/source.R: Now each source has a default reader.
475    
476            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
477            class anymore.
478    
479            * R/plaintextdoc.R: Custom show method for plain text documents.
480    
481            * R/aobjects.R: Added a class for structured text documents.
482    
483            * R/reader.R: Replaced remaining \code{parser} occurrences with
484            \code{reader}.
485    
486            * R/textdoccol.R (summary): Indent tags.
487    
488            * R/textdoccol.R (removePunctuation): Transform method to remove
489            punctuation marks.
490    
491    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
492    
493            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
494            using prescindMeta().
495    
496    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
497    
498            * R/textdoccol.R: Improved database support.
499    
500    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
501    
502            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
503    
504            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
505            language code.
506    
507            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
508            into parserControl argument.
509    
510            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
511    
512    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
513    
514            * Work/tmDataSetup.R: The datasets acq and crude can now be
515            created on the fly.
516    
517            * R/stopwords.R: Introduced a function returning the stopwords for
518            a given language (English, German and French at the moment)
519    
520            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
521            otherwise falls back to Snowball package.
522    
523    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
524    
525            * man/dissimilarity-methods.Rd: Make clear that any method offered
526            by "dists" from package "cba" can be used.
527    
528    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
529    
530            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
531            to Kurt's latex suggestion. Removed points and underscores in
532            variable names for consistent naming.
533    
534            * DESCRIPTION: Update to version 0.1-2.
535    
536            * man/TextRepository.Rd: Fixed bug in documentation.
537    
538    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
539    
540            * DESCRIPTION: Update to version 0.1-1.
541    
542    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
543    
544            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
545            wordStem.
546    
547    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
548    
549            * R/: Changes due to Kurt's review.
550    
551    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
552    
553            * R/: Implemented improvements based upon comments by David
554            Meyer.
555    
556    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
557    
558            * inst/doc/: Rewrote vignette.
559    
560            * man/: Improved documentation.
561    
562    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
563    
564            * man/: Updated documentation.
565    
566            * DESCRIPTION: Changed package name to "tm". Updated version to
567            0.1 for first CRAN release.
568    
569            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
570            list archive example.
571    
572            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
573            archive example.
574    
575            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
576            from (several mails per box) mbox format to (single mail per file)
577            eml format.
578    
579    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
580    
581            * data/crude.rda: Rebuilt.
582    
583            * data/acq.rda: Rebuilt.
584    
585            * R/reader.R: Factored out reader and parser methods from
586            textdoccol.R.
587    
588            * R/source.R: Factored out Source methods from aobjects.R and
589            textdoccol.R.
590            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
591            feeds.
592    
593            * R/textdoccol.R (DirSource): Added support for recursive
594            traversal of directories.
595    
596    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
597    
598            * R/textdoccol.R ([[): Loads the document corpus automatically
599            into memory upon access.
600            (tm_transform, tm_filter): Removed several checks whether the
601            document is already loaded ([[ ensures this now).
602            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
603            mailing list archive.
604    
605    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
606    
607            * R/aobjects.R (TextDocument): Is now a virtual class.
608            (Source): Is now a virtual class.
609    
610    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
611    
612            * R/textdoccol.R (c): Support for an arbitrary number of document
613            collections.
614    
615    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
616    
617            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
618            append_meta and remove_meta.
619    
620            * R/textdoccol.R: Removed modify_metadata method.
621    
622            * R/textrepo.R: Removed modify_metadata method.
623    
624            * R/textdoccol.R (remove_meta): Supports removal of document
625            collection metadata and document (= in data frame) metadata.
626    
627    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
628    
629            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
630    
631            * data/crude.rda: Rebuilt.
632    
633            * data/acq.rda: Rebuilt.
634    
635            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
636    
637            * R/textdoccol.R ([): Bug fix for subsetting a document
638            collection's data frame.
639    
640    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
641    
642            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
643            to s_filter.
644    
645            * R/textdoccol.R: Local text documents' metadata can now be copied
646            to a document collection's data frame with prescind_meta.
647    
648    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
649    
650            * R/: Text documents' slot metadata is now accessible in s_filter.
651    
652            * R/: Rewrote s_filter function (has still some restrictions).
653    
654    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * R/: Various fixes in handling metadata.
657    
658            * R/: Added update mechanism for text document collections.
659    
660    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
661    
662            * R/: Merging of document collections now creates a binary tree
663            for reconstructing merged document collections.
664    
665            * R/: Redesign of metadata for document collections.
666    
667    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
668    
669            * R/: Messages now use \code{ngettext}.
670    
671    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
672    
673            * R/: Added functions for modifying and removing metadata.
674    
675    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
676    
677            * man/: Updated some documentation.
678    
679            * R/: Corrected some connection issues.
680    
681            * inst/doc: Worked on the vignette.
682    
683    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
684    
685            * inst/: Added texts and started vignette.
686    
687            * R/: Final changes based upon David's comments.
688    
689    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
690    
691            * NAMESPACE: Corrected exports (generic methods need exportMethods
692            directives!).
693    
694    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
695    
696            * R/: Modified the TextDocCol constructur and various parsers. It
697            is now modular and supports various file formats via plugins (see
698            the new "Source" class).
699    
700    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
701    
702            * man/: Revised documentation after previous code changes.
703    
704    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
705    
706            * R/: Remaining changes as discussed with David.
707    
708    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
709    
710            * R/: Some changes as suggested by David. The rest will follow
711            within the next days.
712    
713    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
714    
715            * man/: Finished documentation.
716    
717    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
718    
719            * man/: Wrote some documentation.
720    
721    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
722    
723            * R/: Further syntactic sugar in form of additional assignment and
724            accessor methods.
725    
726    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
727    
728            * R/: Syntactic sugar in form of "length", "show" and "summary"
729            operators.
730    
731    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
732    
733            * R/: Diverse updates. Mainly on default operators ("[" or "c")
734            and dissimilarities.
735    
736    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
737    
738            * R/: Added similarity functions.
739    
740            * data/: Added english stopwords.
741    
742    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
743    
744            * data/: Examples compiled for new features
745    
746            * R/: Changes due to new structure.
747    
748            * NAMESPACE: Corrected namespace to reflect new structure.
749    
750            * R/termdocmatrix.R: Adapted for new naming scheme.
751    
752    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
753    
754            * R/textdoccol.R: Adapted code for new class structure. Wrote
755            several transform and filter functions operating on text document
756            collections (alias text document databases).
757    
758            * R/aobjects.R: Adapted class structure with inheritance,
759            repositories and additional meta data. Loading files on demand is
760            now possible.
761    
762    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
763    
764            * R/: Some cosmetic cleanups.
765    
766            * inst/: Removed vignette on clustering. That and much more is now
767            described in the JSS paper on text mining. Based upon that
768            article an elaborated vignette will be incorporated in the future.
769    
770    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
771    
772            * R/: Updated generic S4 methods to comply with signature changes
773            in newer versions of R (> 2.3)
774    
775    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
776    
777            * ext/R/importRIS.R: Automatic RIS import is now possible.
778    
779    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
780    
781            * R/textdoccol.R: Added RIS HTML input format.
782    
783    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
784    
785            * R/textdoccol.R: Removed bug that caused invalid text document
786            collections when handling many input files.
787    
788    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
789    
790            * R/textdoccol.R: Restructured and extended file import
791            mechanism.
792    
793            * inst/doc/clustering.Rnw: Adapted vignette for use with
794            ReutNews.rda
795    
796            * man/ReutNews.Rd: Documentation for ReutNews.rda
797    
798            * data/ReutNews.rda: A tiny Reuters21578 example data set.
799    
800    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
801    
802            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
803            clustering facilities of this package.
804    
805    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
806    
807            * R/aobjects.R: Changed package document structure to avoid class
808            dependency problems.
809    
810  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
811    
812            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
813            data set.
814    
815          * Finished documentation and reordered directory structure. Now "R          * Finished documentation and reordered directory structure. Now "R
816          CMD check textmin" works without errors.          CMD check textmin" works without errors.
817    

Legend:
Removed from v.28  
changed lines
  Added in v.925

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge