Main Page   Class Hierarchy   Compound List   File List   Compound Members   File Members  

ripper.cc File Reference

#include <fstream>
#include <sstream>
#include <vector>
#include "handler-parser.h"
#include "repos-reader.h"
#include "parseelt.h"
#include "graphbuilder.h"
#include "linkgraph.h"
#include "talker.h"

Include dependency graph for ripper.cc:

Include dependency graph

Go to the source code of this file.

Compounds

class  Ripper
 Rips the repository and interfaces with main(). Calls GraphBuilder to build the web link graph. More...


Defines

#define RIPPER_NAMELEN   20
#define RIPPER_TMPDIRLEN   20

Functions

void error (string errmsg)
void usage ()
void OutOfMemory ()
 Prints statistics when we run out of memory. More...

int main (int argc, char **argv)
 Reads the document repository and produces a WebNode object for each read document. This is handled by GraphBuilder. After the WebNodeList (web link graph) is constructed, enters optionally into interactive mode, by communicating with an external program (jack) through pipes. More...


Variables

char * defaultrippername = ""
char * defaulttempdir = "/tmp/"
int num_docs_processed = 0
Ripperripper = NULL


Detailed Description

Definition in file ripper.cc.


Define Documentation

#define RIPPER_NAMELEN   20
 

Definition at line 73 of file ripper.cc.

Referenced by Ripper::ParseCmdLineArgs(), and Ripper::SetupHandlers().

#define RIPPER_TMPDIRLEN   20
 

Definition at line 74 of file ripper.cc.

Referenced by Ripper::ParseCmdLineArgs().


Function Documentation

void error string    errmsg [static]
 

Definition at line 43 of file ripper.cc.

Referenced by usage().

int main int    argc,
char **    argv
 

Reads the document repository and produces a WebNode object for each read document. This is handled by GraphBuilder. After the WebNodeList (web link graph) is constructed, enters optionally into interactive mode, by communicating with an external program (jack) through pipes.

Definition at line 394 of file ripper.cc.

References Ripper::flags_, Ripper::GetGraphBuilder(), Ripper::indexout, OutOfMemory(), Ripper::ParseCmdLineArgs(), Ripper::PrintStatistics(), WebLinkGraph::PrintWebGraph(), Ripper::PublishWebGraph(), Ripper::rep_files_, Ripper::rippername, Ripper::RipRepository(), Ripper::SetupHandlers(), Talker::Talk(), and Ripper::tempdir.

void OutOfMemory  
 

Prints statistics when we run out of memory.

Definition at line 379 of file ripper.cc.

References Ripper::PrintStatistics().

Referenced by main().

void usage   [static]
 

Definition at line 45 of file ripper.cc.

References error().

Referenced by Ripper::ParseCmdLineArgs(), and Ripper::SetupHandlers().


Variable Documentation

char* defaultrippername = "" [static]
 

Definition at line 69 of file ripper.cc.

Referenced by Ripper::Ripper().

char* defaulttempdir = "/tmp/" [static]
 

Definition at line 70 of file ripper.cc.

Referenced by Ripper::Ripper().

int num_docs_processed = 0
 

Definition at line 71 of file ripper.cc.

Referenced by Ripper::ParseCmdLineArgs(), Talker::PrintStatistics(), Ripper::PrintStatistics(), and Ripper::RipRepository().

Ripper* ripper = NULL
 

Definition at line 376 of file ripper.cc.


Generated on Wed May 29 11:37:21 2002 for MarkovPR by doxygen1.2.15