Skip site navigation (1)Skip section navigation (2)

FreeBSD Manual Pages

  
 
  

home | help
COUNT2HUGE(1)	      User Contributed Perl Documentation	 COUNT2HUGE(1)

NAME
       count2huge.pl - Convert the output of count.pl to huge-count.pl.

SYNOPSIS
       count2huge.pl takes the output of count.pl and sorts the	bigrams	by
       alphabet	order. The output bigrams have the same	order with the output
       of huge-count.pl

DESCRIPTION
       count2huge.pl convert the output	of the count.pl	to huge-count.pl for
       the same	input text and options.	The reason we do this is because for
       the vector relatedness measure of UMLS-Similarity, it requires the
       bigrams which starts with the same term are grouped together. When the
       bigrams are sorted, it could optimize the processing time. For details,
       please see the vector-input.pl of UMLS-Similarity.

       See perldoc count2huge.pl

USGAE
       count2huge.pl [OPTION] SOURCE DESTINATIONDIR

INPUT
   Required Arguments:
       SOURCE

       Input to	count2huge.pl is a single flat bigrams list file generated by
       count.pl.

       DESTINATIONDIR

       count2huge.pl sorts the bigrams in the alphabet order. The result file
       count2huge.output is located in the destination directory. It has the
       same result with	the output of huge-count.pl which has same text	and
       options of count.pl.

       --split N

       huge-split will divide the output bigrmas tokenlist generated by
       count.pl	Each part created with --split N will contain N	lines. Value
       of N should be chosen such that huge-sort.pl can	be efficiently run on
       any part	containing N lines from	the file contains all bigrams file.

       We suggest that N is equal to the number	of KB of memory	you have. If
       the computer has	8 GB RAM, which	is 8,000,000 KB, N should be set to
       8000000.

       Other Options:

       --help

       Displays	the help information.

       --version

       Displays	the version information.

AUTHOR
       Ying Liu, University of Minnesota, Twin Cities.	liux0395@umn.edu

COPYRIGHT
       Copyright (C) 2009-2011,	Ying Liu

       Ying Liu, University of Minnesota, Twin Cities.	liux0395@umn.edu

       Ted Pedersen, University	of Minnesota, Duluth.  tpederse@umn.edu

       This program is free software; you can redistribute it and/or modify it
       under the terms of the GNU General Public License as published by the
       Free Software Foundation; either	version	2 of the License, or (at your
       option) any later version.  This	program	is distributed in the hope
       that it will be useful, but WITHOUT ANY WARRANTY; without even the
       implied warranty	of MERCHANTABILITY or FITNESS FOR A PARTICULAR
       PURPOSE.	 See the GNU General Public License for	more details.

       You should have received	a copy of the GNU General Public License along
       with this program; if not, write	to the Free Software Foundation, Inc.,
       59 Temple Place - Suite 330, Boston, MA	02111-1307, USA.

perl v5.32.1			  2011-03-31			 COUNT2HUGE(1)

NAME | SYNOPSIS | DESCRIPTION | USGAE | INPUT | AUTHOR | COPYRIGHT

Want to link to this manual page? Use this URL:
<https://www.freebsd.org/cgi/man.cgi?query=count2huge.pl&sektion=1&manpath=FreeBSD+13.0-RELEASE+and+Ports>

home | help