SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC pkg/ChangeLog revision 924, Fri Apr 3 15:41:48 2009 UTC
# Line 1  Line 1 
1    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/weight.R: Remove weightLogical since it does not return a
4            dgCMatrix.
5    
6            * R/termdocmatrix.R: Further work on new TermDocumentMatrix.
7    
8    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
9    
10            * inst/doc/extensions.Rnw: Finished vignette.
11    
12    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
13    
14            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
15            DocumentTermMatrix representations.
16    
17    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
18    
19            * R/reader.R (readXML): New reader for arbitrary XML files.
20    
21    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
22    
23            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
24            (XMLSource): New XMLSource class for arbitrary XML files.
25            (Source): New slot Vectorized.
26    
27    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
28    
29            * R/reader.R (readCustom): Experimental reader which can be
30            customized via user-defined mappings.
31    
32            * R/reader.R: Always use UTC time zone.
33    
34            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
35    
36    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/reader.R (readDOC): Options can be passed over to antiword.
39    
40            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
41            pdftotext.
42    
43    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
44    
45            * R/source.R (DirSource): Add pattern and ignore.case arguments
46            which are internally passed over to list.files().
47    
48    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
49    
50            * inst/doc/tm.Rnw: Suppress pointless loading message.
51    
52    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
53    
54            * DESCRIPTION: Speed up package loading (via moving packages not
55            strictly necessary for normal operation to Suggests instead of
56            Depends).
57    
58    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
59    
60            * R/reader.R (readNewsgroup): The date format is now configurable.
61    
62    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
63    
64            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
65    
66    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
67    
68            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
69    
70    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
71    
72            * R/source.R (DataframeSource): New source class for data frames.
73    
74            * R/source.R: Fixed non-standard call evaluation.
75    
76    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
77    
78            * R/source.R (URISource): New source class for a single document.
79    
80    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
81    
82            * R/source.R: Refactoring.
83    
84    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
85    
86            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
87            Rmpi installations more gracefully.
88    
89    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
90    
91            * R/source.R (Source): Add Length slot.
92    
93    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
94    
95            * R/AAA.R: Unify duplicated .onLoad function.
96    
97    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
98    
99            * DESCRIPTION (Suggests): Added Rmpi.
100    
101    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
102    
103            * R/source.R (getElem): Fix 'no visible binding' warning.
104    
105            * man/WeightFunction.Rd: Fix signature.
106    
107    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
108    
109            * R/weight.R: Introduce name abbreviations for weighting functions.
110    
111    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
112    
113            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
114    
115            * R/cluster.R: Provide convenience functions for using a MPI
116            cluster.
117    
118            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
119            available.
120    
121            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
122            available.
123    
124    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
125    
126            * R/textdoccol.R (lapply): Removed debug print out.
127    
128    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
129    
130            * R/reader.R (readRCV1): Improved meta data extraction from
131            Reuters Corpus Volume 1 documents.
132    
133    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
134    
135            * R/transform.R: Ensure that all mappings preserve multiline
136            structures.
137    
138    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
139    
140            * R/filter.R: Every filter has now an attribute indicating whether
141            it sould be applied to document level (doclevel).
142    
143            * R/textdoccol.R (tmFilter): Set searchFullText as new default
144            filter.
145    
146    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
147    
148            * R/transform.R (replacePatterns): Replaced removeWords by
149            replacePatterns. Suggested by Christian Buchta.
150    
151            * R/textdoccol.R (inspect): Improved formatting.
152    
153    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * inst/CITATION: Updated JSS article information.
156    
157            * R/textdoccol.R (setAs): Added coerce method from list to
158            corpus.
159    
160            * R/meta.R (meta): Improved meta data handling.
161    
162    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
163    
164            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
165            Christian Buchta.
166    
167            * inst/CITATION: Added template to include JSS article reference.
168    
169    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
170    
171            * R/textdoccol.R (tmMap): Introduced lazy mapping.
172    
173            * R/source.R: Added VectorSource.
174    
175    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
176    
177            * man/: Language codes should be in ISO 639-1 format.
178    
179            * R/textdoccol.R (asPlain): Preserve local meta data.
180    
181    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
182    
183            * R/textdoccol.R (writeCorpus): Function for writing a corpus
184            containing plain text documents to disk.
185    
186    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
187    
188            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
189            always set correctly.
190    
191            * R/textdoccol.R: Set load = TRUE as default for load on demand
192            since in most cases this is the wanted behaviour.
193    
194    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
195    
196            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
197    
198            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
199    
200    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
201    
202            * R/meta.R (meta): New function for consistent access to meta data
203            of document collections, repositories, and texts.
204    
205    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
206    
207            * R/: Better support for encodings.
208    
209    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
210    
211            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
212            selection when no reader argument is given.
213    
214    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * R/source.R (CSVSource): Now uses read.csv instead of scan
217            internally.
218    
219    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
220    
221            * R/reader.R (getReaders): Returns available reader functions.
222    
223            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
224            as default.
225    
226    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
227    
228            * R/stopwords.R (stopwords): Shortened code, removed codetools
229            variable warnings.
230    
231            * man/: Documentation for showMeta, added an example for tmMap.
232    
233            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
234            some minor typos fixed.
235    
236    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
237    
238            * R/aobjects.R (showMeta): Added method for pretty printing a
239            text document's meta data.
240    
241    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
242    
243            * R/textdoccol.R (TextDocCol): Better handling of empty
244            arguments.
245    
246            * NAMESPACE: Exported readDOC.
247    
248            * man/completeStems.Rd: Added an example.
249    
250    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
251    
252            * R/stopwords.R (stopwords): Look up .dat files at every
253            call. Allows users to modify stopword .dat files interactively.
254    
255    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
256    
257            * R/termdocmatrix.R (termFreq): Correct processing of empty
258            documents.
259    
260    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
261    
262            * man/: Updated documentation.
263    
264    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
265    
266            * R/complete.R (completeStems): Completes (heuristically) word
267            stems.
268    
269            * R/termdocmatrix.R (TermDocMatrix2): New modular
270            constructor.
271    
272            * NAMESPACE: Exported termFreq.
273    
274    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
275    
276            * R/reader.R (readDOC): Added MS Word reader (using antiword).
277    
278    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
279    
280            * R/weight.R: Weighting functions for TermDocMatrix.
281    
282    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
283    
284            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
285            functions for accessing dimension, column, and row names.
286    
287            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
288    
289    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
292    
293    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
296    
297    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
298    
299            * R/reader.R (readPDF): Removed manual checks for pdftotext and
300            pdfinfo. The system call gives a warning anyway.
301    
302    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
303    
304            * R/textdoccol.R (asPlain): Conversion from
305            StructuredTextDocuments to PlainTextDocuments.
306    
307    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
308    
309            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
310            for accessing term-document matrices.
311    
312            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
313            are installed.
314    
315    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
316    
317            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
318            Christian Buchta.
319    
320    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
321    
322            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
323    
324    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
325    
326            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
327    
328            * R/reader.R (readPDF): Added PDF reader.
329    
330    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
333    
334            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
335    
336            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
337    
338            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
339    
340    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
341    
342            * R/distmeasure.R (dissimilarity): Replaced dists call from
343            package cba by new dist call from package proxy.
344    
345    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
346    
347            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
348    
349    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
350    
351            * R/termdocmatrix.R: require() uses the quietly option to suppress
352            loading messages.
353    
354    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
355    
356            * R/dictionary.R: Added dictionary support.
357    
358    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
359    
360            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
361            documents. This simplifies some functions, e.g., asPlain.
362    
363    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
364    
365            * inst/doc/tm.Rnw: Fixed some typos in vignette.
366    
367    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
368    
369            * R/textdoccol.R (replaceWords): Added method to replace a set of
370            words by a single word. Useful for synonyms.
371    
372    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
373    
374            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
375    
376    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
377    
378            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
379            vectors. Thanks to Ariel Maguyon for his error report.
380            (removeSparseTerms): New function to remove columns from a
381            term-document matrix exceeding a sparse factor.
382    
383    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
384    
385            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
386    
387    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
388    
389            * man/sFilter.Rd: Corrected documentation on statement format (use
390            '==' instead of '=').
391    
392    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
393    
394            * R/aobjects.R (StructuredTextDocument): Inherits from
395            TextDocument.
396    
397    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
398    
399            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
400            on sparse matrices as proposed by Martin Maechler.
401    
402    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
403    
404            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
405            \pkg{filehash} version makes them deprecated.
406    
407    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * R/termdocmatrix.R (textvector): Stemming is now performed before
410            erasing stopwords.
411            (weightMatrix): Adapted to handle sparse matrices.
412            (TermDocMatrix): Sparse matrix is now efficiently built by
413            direct stepwise insertion of row values into it.
414    
415    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
416    
417            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
418            due to ongoing problems. For our purposes the latter is as useful
419            as the replaced package.
420    
421    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
422    
423            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
424    
425            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
426    
427    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
428    
429            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
430            languages with available stopwords.
431    
432    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
433    
434            * inst/doc/tm.Rnw: Minor corrections in the vignette.
435    
436    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
437    
438            * DESCRIPTION: Update to version 0.2, since a lot of new features
439            have been integrated.
440    
441            * inst/stopwords: Updated existing stopwords and added stopwords
442            for various other languages.
443    
444    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
445    
446            * man/: Updated documentation.
447    
448            * Work/testDb.R: Script to test database stuff.
449    
450            * R/: Fixed various database related bugs. Seems to be rather
451            useable now, i.e., consider as alpha status for now.
452    
453    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
454    
455            * R/: Fixed some bugs related to database support.
456    
457    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
458    
459            * man/: Added a lot of examples to the manuals.
460    
461    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
462    
463            * man/: Updated parts of the documentation.
464    
465            * R/textdoccol.R (asPlain): Added conversion from newsgroup
466            documents to plain text documents.
467    
468    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
469    
470            * R/textdoccol.R: Finished experimental database support. Not yet
471            intensively tested.
472    
473            * R/source.R: Now each source has a default reader.
474    
475            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
476            class anymore.
477    
478            * R/plaintextdoc.R: Custom show method for plain text documents.
479    
480            * R/aobjects.R: Added a class for structured text documents.
481    
482            * R/reader.R: Replaced remaining \code{parser} occurrences with
483            \code{reader}.
484    
485            * R/textdoccol.R (summary): Indent tags.
486    
487            * R/textdoccol.R (removePunctuation): Transform method to remove
488            punctuation marks.
489    
490    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
491    
492            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
493            using prescindMeta().
494    
495    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
496    
497            * R/textdoccol.R: Improved database support.
498    
499    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
500    
501            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
502    
503            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
504            language code.
505    
506            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
507            into parserControl argument.
508    
509            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
510    
511    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
512    
513            * Work/tmDataSetup.R: The datasets acq and crude can now be
514            created on the fly.
515    
516            * R/stopwords.R: Introduced a function returning the stopwords for
517            a given language (English, German and French at the moment)
518    
519            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
520            otherwise falls back to Snowball package.
521    
522    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
523    
524            * man/dissimilarity-methods.Rd: Make clear that any method offered
525            by "dists" from package "cba" can be used.
526    
527    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
528    
529            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
530            to Kurt's latex suggestion. Removed points and underscores in
531            variable names for consistent naming.
532    
533            * DESCRIPTION: Update to version 0.1-2.
534    
535            * man/TextRepository.Rd: Fixed bug in documentation.
536    
537    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
538    
539            * DESCRIPTION: Update to version 0.1-1.
540    
541    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
542    
543            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
544            wordStem.
545    
546    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
547    
548            * R/: Changes due to Kurt's review.
549    
550    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
551    
552            * R/: Implemented improvements based upon comments by David
553            Meyer.
554    
555    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
556    
557            * inst/doc/: Rewrote vignette.
558    
559            * man/: Improved documentation.
560    
561    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
562    
563            * man/: Updated documentation.
564    
565            * DESCRIPTION: Changed package name to "tm". Updated version to
566            0.1 for first CRAN release.
567    
568            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
569            list archive example.
570    
571            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
572            archive example.
573    
574            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
575            from (several mails per box) mbox format to (single mail per file)
576            eml format.
577    
578    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
579    
580            * data/crude.rda: Rebuilt.
581    
582            * data/acq.rda: Rebuilt.
583    
584            * R/reader.R: Factored out reader and parser methods from
585            textdoccol.R.
586    
587            * R/source.R: Factored out Source methods from aobjects.R and
588            textdoccol.R.
589            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
590            feeds.
591    
592            * R/textdoccol.R (DirSource): Added support for recursive
593            traversal of directories.
594    
595    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
596    
597            * R/textdoccol.R ([[): Loads the document corpus automatically
598            into memory upon access.
599            (tm_transform, tm_filter): Removed several checks whether the
600            document is already loaded ([[ ensures this now).
601            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
602            mailing list archive.
603    
604    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
605    
606            * R/aobjects.R (TextDocument): Is now a virtual class.
607            (Source): Is now a virtual class.
608    
609    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
610    
611            * R/textdoccol.R (c): Support for an arbitrary number of document
612            collections.
613    
614    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
615    
616            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
617            append_meta and remove_meta.
618    
619            * R/textdoccol.R: Removed modify_metadata method.
620    
621            * R/textrepo.R: Removed modify_metadata method.
622    
623            * R/textdoccol.R (remove_meta): Supports removal of document
624            collection metadata and document (= in data frame) metadata.
625    
626    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
627    
628            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
629    
630            * data/crude.rda: Rebuilt.
631    
632            * data/acq.rda: Rebuilt.
633    
634            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
635    
636            * R/textdoccol.R ([): Bug fix for subsetting a document
637            collection's data frame.
638    
639    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
640    
641            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
642            to s_filter.
643    
644            * R/textdoccol.R: Local text documents' metadata can now be copied
645            to a document collection's data frame with prescind_meta.
646    
647    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
648    
649            * R/: Text documents' slot metadata is now accessible in s_filter.
650    
651            * R/: Rewrote s_filter function (has still some restrictions).
652    
653    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
654    
655            * R/: Various fixes in handling metadata.
656    
657            * R/: Added update mechanism for text document collections.
658    
659    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
660    
661            * R/: Merging of document collections now creates a binary tree
662            for reconstructing merged document collections.
663    
664            * R/: Redesign of metadata for document collections.
665    
666    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
667    
668            * R/: Messages now use \code{ngettext}.
669    
670    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
671    
672            * R/: Added functions for modifying and removing metadata.
673    
674    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
675    
676            * man/: Updated some documentation.
677    
678            * R/: Corrected some connection issues.
679    
680            * inst/doc: Worked on the vignette.
681    
682    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
683    
684            * inst/: Added texts and started vignette.
685    
686            * R/: Final changes based upon David's comments.
687    
688    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
689    
690            * NAMESPACE: Corrected exports (generic methods need exportMethods
691            directives!).
692    
693    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
694    
695            * R/: Modified the TextDocCol constructur and various parsers. It
696            is now modular and supports various file formats via plugins (see
697            the new "Source" class).
698    
699    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
700    
701            * man/: Revised documentation after previous code changes.
702    
703    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
704    
705            * R/: Remaining changes as discussed with David.
706    
707    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
708    
709            * R/: Some changes as suggested by David. The rest will follow
710            within the next days.
711    
712    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
713    
714            * man/: Finished documentation.
715    
716    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
717    
718            * man/: Wrote some documentation.
719    
720    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
721    
722            * R/: Further syntactic sugar in form of additional assignment and
723            accessor methods.
724    
725    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
726    
727            * R/: Syntactic sugar in form of "length", "show" and "summary"
728            operators.
729    
730    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
731    
732            * R/: Diverse updates. Mainly on default operators ("[" or "c")
733            and dissimilarities.
734    
735    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
736    
737            * R/: Added similarity functions.
738    
739            * data/: Added english stopwords.
740    
741    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
742    
743            * data/: Examples compiled for new features
744    
745            * R/: Changes due to new structure.
746    
747            * NAMESPACE: Corrected namespace to reflect new structure.
748    
749            * R/termdocmatrix.R: Adapted for new naming scheme.
750    
751    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * R/textdoccol.R: Adapted code for new class structure. Wrote
754            several transform and filter functions operating on text document
755            collections (alias text document databases).
756    
757            * R/aobjects.R: Adapted class structure with inheritance,
758            repositories and additional meta data. Loading files on demand is
759            now possible.
760    
761    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
762    
763            * R/: Some cosmetic cleanups.
764    
765            * inst/: Removed vignette on clustering. That and much more is now
766            described in the JSS paper on text mining. Based upon that
767            article an elaborated vignette will be incorporated in the future.
768    
769    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
770    
771            * R/: Updated generic S4 methods to comply with signature changes
772            in newer versions of R (> 2.3)
773    
774    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
775    
776            * ext/R/importRIS.R: Automatic RIS import is now possible.
777    
778    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
779    
780            * R/textdoccol.R: Added RIS HTML input format.
781    
782    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
783    
784            * R/textdoccol.R: Removed bug that caused invalid text document
785            collections when handling many input files.
786    
787    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
788    
789            * R/textdoccol.R: Restructured and extended file import
790            mechanism.
791    
792            * inst/doc/clustering.Rnw: Adapted vignette for use with
793            ReutNews.rda
794    
795            * man/ReutNews.Rd: Documentation for ReutNews.rda
796    
797            * data/ReutNews.rda: A tiny Reuters21578 example data set.
798    
799    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
800    
801            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
802            clustering facilities of this package.
803    
804    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
805    
806            * R/aobjects.R: Changed package document structure to avoid class
807            dependency problems.
808    
809    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
810    
811            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
812            data set.
813    
814            *  Finished documentation and reordered directory structure. Now "R
815            CMD check textmin" works without errors.
816    
817    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
818    
819            * src/: Various splits can now be easily created for the
820            Reuters21578 data set.
821    
822    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
823    
824            *  Updated documentation
825    
826    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
827    
828            *  Wrote R documentation for some classes and methods.
829    
830    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
831    
832            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
833            files. See the questionnaire data/Umfrage.csv for such an example.
834            We are now able to import files in Reuters-21578 XML format.
835    
836            *  Changed class interfaces in various files. Weighting of the text
837            matrix is now possible.
838    
839    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
840    
841            * R/textdoccol.R: One can build term-document matrices if
842            nessecary (with buildTDM(...)) and fill the field tdm from a text
843            document collection with it.
844    
845            * R/textmatrix.R: Wrote S4 class for term-document matrices.
846    
847    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
848    
849            * R/textdoccol.R: We now can read in a whole XML file with several
850            news items.
851    
852  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
853    
854          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.924

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge