SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC trunk/tm/ChangeLog revision 861, Thu Jul 24 09:55:09 2008 UTC
# Line 1  Line 1 
1    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
4    
5            * R/cluster.R: Provide convenience functions for using a MPI
6            cluster.
7    
8            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
9            available.
10    
11            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
12            available.
13    
14    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
15    
16            * R/textdoccol.R (lapply): Removed debug print out.
17    
18    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
19    
20            * R/reader.R (readRCV1): Improved meta data extraction from
21            Reuters Corpus Volume 1 documents.
22    
23    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
24    
25            * R/transform.R: Ensure that all mappings preserve multiline
26            structures.
27    
28    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
29    
30            * R/filter.R: Every filter has now an attribute indicating whether
31            it sould be applied to document level (doclevel).
32    
33            * R/textdoccol.R (tmFilter): Set searchFullText as new default
34            filter.
35    
36    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
37    
38            * R/transform.R (replacePatterns): Replaced removeWords by
39            replacePatterns. Suggested by Christian Buchta.
40    
41            * R/textdoccol.R (inspect): Improved formatting.
42    
43    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
44    
45            * inst/CITATION: Updated JSS article information.
46    
47            * R/textdoccol.R (setAs): Added coerce method from list to
48            corpus.
49    
50            * R/meta.R (meta): Improved meta data handling.
51    
52    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
53    
54            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
55            Christian Buchta.
56    
57            * inst/CITATION: Added template to include JSS article reference.
58    
59    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
60    
61            * R/textdoccol.R (tmMap): Introduced lazy mapping.
62    
63            * R/source.R: Added VectorSource.
64    
65    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
66    
67            * man/: Language codes should be in ISO 639-1 format.
68    
69            * R/textdoccol.R (asPlain): Preserve local meta data.
70    
71    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
72    
73            * R/textdoccol.R (writeCorpus): Function for writing a corpus
74            containing plain text documents to disk.
75    
76    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
77    
78            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
79            always set correctly.
80    
81            * R/textdoccol.R: Set load = TRUE as default for load on demand
82            since in most cases this is the wanted behaviour.
83    
84    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
85    
86            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
87    
88            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
89    
90    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
91    
92            * R/meta.R (meta): New function for consistent access to meta data
93            of document collections, repositories, and texts.
94    
95    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
96    
97            * R/: Better support for encodings.
98    
99    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
100    
101            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
102            selection when no reader argument is given.
103    
104    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
105    
106            * R/source.R (CSVSource): Now uses read.csv instead of scan
107            internally.
108    
109    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
110    
111            * R/reader.R (getReaders): Returns available reader functions.
112    
113            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
114            as default.
115    
116    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
117    
118            * R/stopwords.R (stopwords): Shortened code, removed codetools
119            variable warnings.
120    
121            * man/: Documentation for showMeta, added an example for tmMap.
122    
123            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
124            some minor typos fixed.
125    
126    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
127    
128            * R/aobjects.R (showMeta): Added method for pretty printing a
129            text document's meta data.
130    
131    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
132    
133            * R/textdoccol.R (TextDocCol): Better handling of empty
134            arguments.
135    
136            * NAMESPACE: Exported readDOC.
137    
138            * man/completeStems.Rd: Added an example.
139    
140    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
141    
142            * R/stopwords.R (stopwords): Look up .dat files at every
143            call. Allows users to modify stopword .dat files interactively.
144    
145    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
146    
147            * R/termdocmatrix.R (termFreq): Correct processing of empty
148            documents.
149    
150    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
151    
152            * man/: Updated documentation.
153    
154    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
155    
156            * R/complete.R (completeStems): Completes (heuristically) word
157            stems.
158    
159            * R/termdocmatrix.R (TermDocMatrix2): New modular
160            constructor.
161    
162            * NAMESPACE: Exported termFreq.
163    
164    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
165    
166            * R/reader.R (readDOC): Added MS Word reader (using antiword).
167    
168    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
169    
170            * R/weight.R: Weighting functions for TermDocMatrix.
171    
172    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
173    
174            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
175            functions for accessing dimension, column, and row names.
176    
177            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
178    
179    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
180    
181            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
182    
183    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
184    
185            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
186    
187    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
188    
189            * R/reader.R (readPDF): Removed manual checks for pdftotext and
190            pdfinfo. The system call gives a warning anyway.
191    
192    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
193    
194            * R/textdoccol.R (asPlain): Conversion from
195            StructuredTextDocuments to PlainTextDocuments.
196    
197    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
198    
199            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
200            for accessing term-document matrices.
201    
202            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
203            are installed.
204    
205    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
206    
207            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
208            Christian Buchta.
209    
210    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
213    
214    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
217    
218            * R/reader.R (readPDF): Added PDF reader.
219    
220    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
221    
222            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
223    
224            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
225    
226            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
227    
228            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
229    
230    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
231    
232            * R/distmeasure.R (dissimilarity): Replaced dists call from
233            package cba by new dist call from package proxy.
234    
235    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
236    
237            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
238    
239    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
240    
241            * R/termdocmatrix.R: require() uses the quietly option to suppress
242            loading messages.
243    
244    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * R/dictionary.R: Added dictionary support.
247    
248    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
249    
250            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
251            documents. This simplifies some functions, e.g., asPlain.
252    
253    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * inst/doc/tm.Rnw: Fixed some typos in vignette.
256    
257    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * R/textdoccol.R (replaceWords): Added method to replace a set of
260            words by a single word. Useful for synonyms.
261    
262    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
263    
264            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
265    
266    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
267    
268            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
269            vectors. Thanks to Ariel Maguyon for his error report.
270            (removeSparseTerms): New function to remove columns from a
271            term-document matrix exceeding a sparse factor.
272    
273    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
274    
275            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
276    
277    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
278    
279            * man/sFilter.Rd: Corrected documentation on statement format (use
280            '==' instead of '=').
281    
282    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
283    
284            * R/aobjects.R (StructuredTextDocument): Inherits from
285            TextDocument.
286    
287    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
288    
289            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
290            on sparse matrices as proposed by Martin Maechler.
291    
292    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
293    
294            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
295            \pkg{filehash} version makes them deprecated.
296    
297    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
298    
299            * R/termdocmatrix.R (textvector): Stemming is now performed before
300            erasing stopwords.
301            (weightMatrix): Adapted to handle sparse matrices.
302            (TermDocMatrix): Sparse matrix is now efficiently built by
303            direct stepwise insertion of row values into it.
304    
305    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
306    
307            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
308            due to ongoing problems. For our purposes the latter is as useful
309            as the replaced package.
310    
311    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
312    
313            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
314    
315            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
316    
317    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
320            languages with available stopwords.
321    
322    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * inst/doc/tm.Rnw: Minor corrections in the vignette.
325    
326    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
327    
328            * DESCRIPTION: Update to version 0.2, since a lot of new features
329            have been integrated.
330    
331            * inst/stopwords: Updated existing stopwords and added stopwords
332            for various other languages.
333    
334    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * man/: Updated documentation.
337    
338            * Work/testDb.R: Script to test database stuff.
339    
340            * R/: Fixed various database related bugs. Seems to be rather
341            useable now, i.e., consider as alpha status for now.
342    
343    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * R/: Fixed some bugs related to database support.
346    
347    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * man/: Added a lot of examples to the manuals.
350    
351    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
352    
353            * man/: Updated parts of the documentation.
354    
355            * R/textdoccol.R (asPlain): Added conversion from newsgroup
356            documents to plain text documents.
357    
358    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
359    
360            * R/textdoccol.R: Finished experimental database support. Not yet
361            intensively tested.
362    
363            * R/source.R: Now each source has a default reader.
364    
365            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
366            class anymore.
367    
368            * R/plaintextdoc.R: Custom show method for plain text documents.
369    
370            * R/aobjects.R: Added a class for structured text documents.
371    
372            * R/reader.R: Replaced remaining \code{parser} occurrences with
373            \code{reader}.
374    
375            * R/textdoccol.R (summary): Indent tags.
376    
377            * R/textdoccol.R (removePunctuation): Transform method to remove
378            punctuation marks.
379    
380    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
381    
382            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
383            using prescindMeta().
384    
385    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
386    
387            * R/textdoccol.R: Improved database support.
388    
389    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
390    
391            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
392    
393            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
394            language code.
395    
396            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
397            into parserControl argument.
398    
399            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
400    
401    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
402    
403            * Work/tmDataSetup.R: The datasets acq and crude can now be
404            created on the fly.
405    
406            * R/stopwords.R: Introduced a function returning the stopwords for
407            a given language (English, German and French at the moment)
408    
409            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
410            otherwise falls back to Snowball package.
411    
412    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * man/dissimilarity-methods.Rd: Make clear that any method offered
415            by "dists" from package "cba" can be used.
416    
417    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
418    
419            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
420            to Kurt's latex suggestion. Removed points and underscores in
421            variable names for consistent naming.
422    
423            * DESCRIPTION: Update to version 0.1-2.
424    
425            * man/TextRepository.Rd: Fixed bug in documentation.
426    
427    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
428    
429            * DESCRIPTION: Update to version 0.1-1.
430    
431    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
432    
433            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
434            wordStem.
435    
436    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
437    
438            * R/: Changes due to Kurt's review.
439    
440    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
441    
442            * R/: Implemented improvements based upon comments by David
443            Meyer.
444    
445    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
446    
447            * inst/doc/: Rewrote vignette.
448    
449            * man/: Improved documentation.
450    
451    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
452    
453            * man/: Updated documentation.
454    
455            * DESCRIPTION: Changed package name to "tm". Updated version to
456            0.1 for first CRAN release.
457    
458            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
459            list archive example.
460    
461            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
462            archive example.
463    
464            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
465            from (several mails per box) mbox format to (single mail per file)
466            eml format.
467    
468    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
469    
470            * data/crude.rda: Rebuilt.
471    
472            * data/acq.rda: Rebuilt.
473    
474            * R/reader.R: Factored out reader and parser methods from
475            textdoccol.R.
476    
477            * R/source.R: Factored out Source methods from aobjects.R and
478            textdoccol.R.
479            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
480            feeds.
481    
482            * R/textdoccol.R (DirSource): Added support for recursive
483            traversal of directories.
484    
485    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
486    
487            * R/textdoccol.R ([[): Loads the document corpus automatically
488            into memory upon access.
489            (tm_transform, tm_filter): Removed several checks whether the
490            document is already loaded ([[ ensures this now).
491            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
492            mailing list archive.
493    
494    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
495    
496            * R/aobjects.R (TextDocument): Is now a virtual class.
497            (Source): Is now a virtual class.
498    
499    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
500    
501            * R/textdoccol.R (c): Support for an arbitrary number of document
502            collections.
503    
504    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
505    
506            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
507            append_meta and remove_meta.
508    
509            * R/textdoccol.R: Removed modify_metadata method.
510    
511            * R/textrepo.R: Removed modify_metadata method.
512    
513            * R/textdoccol.R (remove_meta): Supports removal of document
514            collection metadata and document (= in data frame) metadata.
515    
516    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
517    
518            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
519    
520            * data/crude.rda: Rebuilt.
521    
522            * data/acq.rda: Rebuilt.
523    
524            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
525    
526            * R/textdoccol.R ([): Bug fix for subsetting a document
527            collection's data frame.
528    
529    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
530    
531            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
532            to s_filter.
533    
534            * R/textdoccol.R: Local text documents' metadata can now be copied
535            to a document collection's data frame with prescind_meta.
536    
537    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
538    
539            * R/: Text documents' slot metadata is now accessible in s_filter.
540    
541            * R/: Rewrote s_filter function (has still some restrictions).
542    
543    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
544    
545            * R/: Various fixes in handling metadata.
546    
547            * R/: Added update mechanism for text document collections.
548    
549    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
550    
551            * R/: Merging of document collections now creates a binary tree
552            for reconstructing merged document collections.
553    
554            * R/: Redesign of metadata for document collections.
555    
556    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
557    
558            * R/: Messages now use \code{ngettext}.
559    
560    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
561    
562            * R/: Added functions for modifying and removing metadata.
563    
564    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
565    
566            * man/: Updated some documentation.
567    
568            * R/: Corrected some connection issues.
569    
570            * inst/doc: Worked on the vignette.
571    
572    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
573    
574            * inst/: Added texts and started vignette.
575    
576            * R/: Final changes based upon David's comments.
577    
578    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
579    
580            * NAMESPACE: Corrected exports (generic methods need exportMethods
581            directives!).
582    
583    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
584    
585            * R/: Modified the TextDocCol constructur and various parsers. It
586            is now modular and supports various file formats via plugins (see
587            the new "Source" class).
588    
589    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
590    
591            * man/: Revised documentation after previous code changes.
592    
593    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
594    
595            * R/: Remaining changes as discussed with David.
596    
597    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
598    
599            * R/: Some changes as suggested by David. The rest will follow
600            within the next days.
601    
602    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
603    
604            * man/: Finished documentation.
605    
606    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
607    
608            * man/: Wrote some documentation.
609    
610    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
611    
612            * R/: Further syntactic sugar in form of additional assignment and
613            accessor methods.
614    
615    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
616    
617            * R/: Syntactic sugar in form of "length", "show" and "summary"
618            operators.
619    
620    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
621    
622            * R/: Diverse updates. Mainly on default operators ("[" or "c")
623            and dissimilarities.
624    
625    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
626    
627            * R/: Added similarity functions.
628    
629            * data/: Added english stopwords.
630    
631    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
632    
633            * data/: Examples compiled for new features
634    
635            * R/: Changes due to new structure.
636    
637            * NAMESPACE: Corrected namespace to reflect new structure.
638    
639            * R/termdocmatrix.R: Adapted for new naming scheme.
640    
641    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
642    
643            * R/textdoccol.R: Adapted code for new class structure. Wrote
644            several transform and filter functions operating on text document
645            collections (alias text document databases).
646    
647            * R/aobjects.R: Adapted class structure with inheritance,
648            repositories and additional meta data. Loading files on demand is
649            now possible.
650    
651    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
652    
653            * R/: Some cosmetic cleanups.
654    
655            * inst/: Removed vignette on clustering. That and much more is now
656            described in the JSS paper on text mining. Based upon that
657            article an elaborated vignette will be incorporated in the future.
658    
659    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
660    
661            * R/: Updated generic S4 methods to comply with signature changes
662            in newer versions of R (> 2.3)
663    
664    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
665    
666            * ext/R/importRIS.R: Automatic RIS import is now possible.
667    
668    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
669    
670            * R/textdoccol.R: Added RIS HTML input format.
671    
672    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
673    
674            * R/textdoccol.R: Removed bug that caused invalid text document
675            collections when handling many input files.
676    
677    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
678    
679            * R/textdoccol.R: Restructured and extended file import
680            mechanism.
681    
682            * inst/doc/clustering.Rnw: Adapted vignette for use with
683            ReutNews.rda
684    
685            * man/ReutNews.Rd: Documentation for ReutNews.rda
686    
687            * data/ReutNews.rda: A tiny Reuters21578 example data set.
688    
689    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
690    
691            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
692            clustering facilities of this package.
693    
694    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
695    
696            * R/aobjects.R: Changed package document structure to avoid class
697            dependency problems.
698    
699    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
700    
701            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
702            data set.
703    
704            *  Finished documentation and reordered directory structure. Now "R
705            CMD check textmin" works without errors.
706    
707    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
708    
709            * src/: Various splits can now be easily created for the
710            Reuters21578 data set.
711    
712    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
713    
714            *  Updated documentation
715    
716    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
717    
718            *  Wrote R documentation for some classes and methods.
719    
720    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
721    
722            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
723            files. See the questionnaire data/Umfrage.csv for such an example.
724            We are now able to import files in Reuters-21578 XML format.
725    
726            *  Changed class interfaces in various files. Weighting of the text
727            matrix is now possible.
728    
729    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
730    
731            * R/textdoccol.R: One can build term-document matrices if
732            nessecary (with buildTDM(...)) and fill the field tdm from a text
733            document collection with it.
734    
735            * R/textmatrix.R: Wrote S4 class for term-document matrices.
736    
737    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
738    
739            * R/textdoccol.R: We now can read in a whole XML file with several
740            news items.
741    
742  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
743    
744          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.861

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge