Software Index
Linux Software Programming  

Text::TermExtract

download download home home   report broken
important software information
company name:
Michael Schilli
license: Freeware
minimum requirements:
functional limitations:
Text::TermExtract description
Text::TermExtract is a Perl module to extract terms from text.

SYNOPSIS

use Text::TermExtract;

my $text = { Hey, hey, how's it going? Wanna go to Wendy's
tonight? Wendy's has great sandwiches." };

my $ext = Text::TermExtract->new();

for my $word ( $ext->terms_extract( $text, { max => 3 }) ) {
print "$word
";
}

# "sandwiches"
# "tonight"
# "wendy"

Text::TermExtract takes a simple approach at extracting the most interesting terms from documents of arbitrary length.

There's more scientific methods to term extraction, like Yahoo's online term extraction API (but you can't have it locally) and the Lingua::YaTeA module on CPAN (which is so poorly documented that I couldn't figure out how to use it).

So I wrote Text::TermExtract, which first tries to guess the language a text is written in, kicks out the language- specific stopwords, weighs the rest with a hand-crafted formula and returns a list of (hopefully) interesting words.

This is a very crude approach to term extraction, if you have a better method and want to include it in Text::TermExtract, drop me an email, I'm interested.. . Publisher of Text::TermExtract, Author of Text::TermExtract 0.02. Text::TermExtract is a Perl module to extract terms from text. SYNOPSIS use Text::TermExtract; my $text = Hey, hey, how's it going? Wanna go to Wendy's tonight? Wendy's has gr
Similar software
ECMerge Pro (Linux) (Popularity: ) : ECMerge compares and merges local/FTP/SCC text/images/folders, side-by-side or 3-way. It is designed for software engineers, web authors and other professionals who work with multiple revisions of text files or who need to keep multiple folder hierarchies in sync. Two text ...
ECMerge Pro (Solaris) (Popularity: ) : ECMerge compares and merges files and folders, side-by-side or 3-way.
It is designed for software engineers, web authors and other professionals who work with multiple revisions of text files or who need to keep multiple folder hierarchies in sync. Two ...

ECMerge Standard (Solaris) (Popularity: ) : ECMerge compares and merges files and folders, side-by-side.
It is designed for software engineers, web authors and other professionals who work with multiple revisions of text files or who need to keep multiple folder hierarchies in sync. Define favourite comparisons ...

Movie Player Pro ActiveX OCX SDK (Popularity: ) : For Professional Windows Developers who need to provide video/audio media playback function within their business application.

Overlay text and bitmap on video in same time.
Multi-Line Scrolling text on video.
Support Mov, M4a, Mp4, 3gp, Divx, AVI, WMV, MPEG-1,RM(need RM ...

VideoCap Pro Video OCX ActiveX (Popularity: ) : Capture Video from capture card, tv tuner, dv cam, dvd player to AVI or WMV 9, WMV8 file format.
Draw overlay bitmap on live video or save to video file. User define transparent color, alpha value.
Draw overlay time stamp ...

HippoEDIT (Popularity: ) : HippoEDIT is professional Windows Text Editor for programmers and advanced users that speed ups text typing and source code analyzing by using smart and sophisticated features, that helping you be more productive and creative. It is lightweight, fast and highly ...
Text::MacroScript (Popularity: ) : Text::MacroScript is a macro pre-processor with embedded perl capability.

SYNOPSIS

use Text::MacroScript ;

# new() for macro processing

my $Macro = Text::MacroScript->new ;
while( ) {
print $Macro->expand( $_ ) if $_ ;
}

# Canonical use (the filename improves error messages):
my $Macro = Text::MacroScript->new ;
while( ) {
print ...

Scintilla (Popularity: ) : Scintilla is a free source code editing component. Scintilla comes with complete source code and a license that permits use in any free project or commercial product.

As well as features found in standard text editing components, Scintilla includes features especially ...

MYTUI (Popularity: ) : MYTUI is a TUI widget library based on curses. It is written in C and provides many ready-to-use widgets for rapid application development of text user interfaces. It is mainly delivered to develop UNIX-based applications. Basically a curses or ncurses ...
WebSweep for Linux (Popularity: ) : WebSweep is an HTML converter that will take any text file (or related) and it will transform it into a web page ready for web deployment.

WebSweep is an easy-to-use application that also has the possibility to detect and transform all ...

Test::Verbose (Popularity: ) : Given a list of test scripts, source file names, directories and/or package names, attempts to find and execute the appropriate test scripts.

This (via the associated tv command) is useful when developing code or test scripts: just map "tv %" to ...

Text::MacroScript (Popularity: ) : Text::MacroScript is a macro pre-processor with embedded perl capability.

SYNOPSIS

use Text::MacroScript ;

# new() for macro processing

my $Macro = Text::MacroScript->new ;
while( ) {
print $Macro->expand( $_ ) if $_ ;
}

# Canonical use (the filename improves error messages):
my $Macro = Text::MacroScript->new ;
while( ) {
print ...

User reviews

Write a review:
1 2 3 4 5 6 7 8 9 10
1=poor 10=excellent
Write review*
Your name*
Email*
  (Comments are moderated, and will not appear on this site until the editor has approved them)
 
Rate me
supported os's
stats
downloads 22
version 0.02
size in Kb 10
popularity   
1500/1272475
user rating 0/10
our rating 0 Stars
share info
Recommend Text::TermExtract
Report spyware
New Software
Popular Software
Latest Reviews