SCM

SCM Repository

[tm] Log of /trunk/tm/R/reader.R
[tm] / trunk / tm / R / reader.R  
ViewVC logotype

Log of /trunk/tm/R/reader.R

Parent Directory Parent Directory


Sticky Revision:
(Current path doesn't exist after revision 883)

Revision 789 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 16 11:26:19 2007 UTC (11 years, 10 months ago) by feinerer
File length: 11726 byte(s)
Diff to previous 777
Added MS Word reader (using antiword).

Revision 777 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 28 07:19:12 2007 UTC (11 years, 11 months ago) by feinerer
File length: 11124 byte(s)
Diff to previous 776
Function generators are now real S4 classes instead of S3 attributes.

Revision 776 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jul 29 15:27:41 2007 UTC (12 years ago) by feinerer
File length: 11313 byte(s)
Diff to previous 767
Removed manual pdftotext and pdfinfo checks (the system call gives a warning anyway).

Revision 767 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 14 16:50:44 2007 UTC (12 years, 1 month ago) by feinerer
File length: 11495 byte(s)
Diff to previous 766
Added simple HTML reader to produce StructuredTextDocuments.

Revision 766 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 14 08:46:23 2007 UTC (12 years, 1 month ago) by feinerer
File length: 9971 byte(s)
Diff to previous 757
Added PDF reader based on pdftotext and pdfinfo.

Revision 757 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jun 7 17:41:56 2007 UTC (12 years, 2 months ago) by feinerer
File length: 8151 byte(s)
Diff to previous 744
Added classes for Reuters21578 XML and RCV1 documents.

Revision 744 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 23 00:35:10 2007 UTC (12 years, 3 months ago) by feinerer
File length: 8147 byte(s)
Diff to previous 722
TermDocMatrix is now built by direct stepwise insertion, i.e., we save a lot of memory on construction.

Revision 722 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 1 15:53:58 2007 UTC (12 years, 4 months ago) by feinerer
File length: 8149 byte(s)
Diff to previous 717
Prettyprint summary, print method for plain text docs, removePunctuation.

Revision 717 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 16 11:13:04 2007 UTC (12 years, 5 months ago) by feinerer
File length: 8124 byte(s)
Diff to previous 698
Added Language slot to text documents. Refactored TextDocCol constructor.

Revision 698 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 6 17:05:44 2007 UTC (12 years, 7 months ago) by feinerer
File length: 7816 byte(s)
Diff to previous 697
Changes due to Kurt's review.

Revision 697 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 5 23:09:12 2007 UTC (12 years, 7 months ago) by feinerer
File length: 7835 byte(s)
Diff to previous 694
Fixed codetools warnings.

Revision 694 - (view) (download) (annotate) - [select for diffs]
Modified Sun Dec 31 14:47:46 2006 UTC (12 years, 7 months ago) by feinerer
File length: 7927 byte(s)
Diff to previous 693
Implemented improvements based upon comments by David.

Revision 693 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 22 13:21:30 2006 UTC (12 years, 7 months ago) by feinerer
File length: 8352 byte(s)
Diff to previous 690
Renamed textmin to tm directory since the package name changed.

Revision 690 - (view) (download) (annotate) - [select for diffs]
Modified Sat Dec 16 17:22:56 2006 UTC (12 years, 8 months ago) by feinerer
Original Path: trunk/textmin/R/reader.R
File length: 8352 byte(s)
Diff to previous 689
Renamed package to 'tm'. Updated documentation (man) for CRAN release.

Revision 689 - (view) (download) (annotate) - [select for diffs]
Added Fri Dec 8 14:21:46 2006 UTC (12 years, 8 months ago) by feinerer
Original Path: trunk/textmin/R/reader.R
File length: 8368 byte(s)
Implemented changes as proposed at the Forschungsklausur on 01.12.2006.

This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

Sort log by:

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge