problem with mp3 parser

classic Classic list List threaded Threaded
10 messages Options
Reply | Threaded
Open this post in threaded view
|

problem with mp3 parser

alxsss
Hello,

I have build mp3 parser and put it in C:\nutch\plugins . However, nutch does not find mp3's. I checked C:\Tomcat\webapps\ROOT\WEB-INF\classes\plugins dir. There is no parser-mp3 folder.

Any idea how to fix this?

Thanks.
Alex.

________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - http://o.aolcdn.com/cdn.webmail.aol.com/mailtour/aol/en-us/text.htm?ncid=aimcmp00050000000001
Reply | Threaded
Open this post in threaded view
|

problem with mp3 parser

alxsss

 Hi All,

I have in nutch/conf/nutch-default.xml the following


 <property>
? <name>plugin.includes</name>
? <value>nutch-extensionpoints|protocol-http|urlfilter-regex|parse-(text|html|js|mp3)|index-(basic|more)|query-(basic|more|site|url)|summary-basic|scoring-opic</value>


 ...


However in

C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml


<property>
  <name>plugin.includes</name>
  <value>protocol-http|urlfilter-regex|parse-(text|html|js)|index-basic|query-(basic|site|url)|summary-basic|scoring-opic|urlnormalizer-(pass|regex|basic)</value>
 

as you see mp3 is missing. And mp3 plugin is also missing in tomcat's plugin dir.

Any ideas why this happened?

Thanks.
Alex.


-----Original Message-----
From: [hidden email]
To: [hidden email]
Sent: Fri, 7 Dec 2007 3:08 pm
Subject: problem with mp3 parser










Hello,

I have build mp3 parser and put it in C:\nutch\plugins . However, nutch does not
find mp3's. I checked C:\Tomcat\webapps\ROOT\WEB-INF\classes\plugins dir. There
is no parser-mp3 folder.

Any idea how to fix this?

Thanks.
Alex.

________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! -
http://o.aolcdn.com/cdn.webmail.aol.com/mailtour/aol/en-us/text.htm?ncid=aimcmp00050000000001



 


________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - http://webmail.aim.com
Reply | Threaded
Open this post in threaded view
|

Re: problem with mp3 parser

Hasan Diwan
Think you may need the jar file in plugin/mp3/lib?

On 12/11/07, [hidden email] <[hidden email]> wrote:

>
>  Hi All,
>
> I have in nutch/conf/nutch-default.xml the following
>
>
>  <property>
> ? <name>plugin.includes</name>
> ?
> <value>nutch-extensionpoints|protocol-http|urlfilter-regex|parse-(text|html|js|mp3)|index-(basic|more)|query-(basic|more|site|url)|summary-basic|scoring-opic</value>
>
>
>  ...
>
>
> However in
>
> C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml
>
>
> <property>
>   <name>plugin.includes</name>
>
> <value>protocol-http|urlfilter-regex|parse-(text|html|js)|index-basic|query-(basic|site|url)|summary-basic|scoring-opic|urlnormalizer-(pass|regex|basic)</value>
>
>
> as you see mp3 is missing. And mp3 plugin is also missing in tomcat's plugin
> dir.
>
> Any ideas why this happened?
>
> Thanks.
> Alex.
>
>
> -----Original Message-----
> From: [hidden email]
> To: [hidden email]
> Sent: Fri, 7 Dec 2007 3:08 pm
> Subject: problem with mp3 parser
>
>
>
>
>
>
>
>
>
>
> Hello,
>
> I have build mp3 parser and put it in C:\nutch\plugins . However, nutch does
> not
> find mp3's. I checked C:\Tomcat\webapps\ROOT\WEB-INF\classes\plugins dir.
> There
> is no parser-mp3 folder.
>
> Any idea how to fix this?
>
> Thanks.
> Alex.
>
> ________________________________________________________________________
> More new features than ever.  Check out the new AIM(R) Mail ! -
> http://o.aolcdn.com/cdn.webmail.aol.com/mailtour/aol/en-us/text.htm?ncid=aimcmp00050000000001
>
>
>
>
>
>
> ________________________________________________________________________
> More new features than ever.  Check out the new AIM(R) Mail ! -
> http://webmail.aim.com
>

--
Sent from Gmail for mobile | mobile.google.com

Cheers,
Hasan Diwan <[hidden email]>
Reply | Threaded
Open this post in threaded view
|

Re: problem with mp3 parser

alxsss

 I have this file file:///C:/nutch/plugins/parse-mp3/jid3lib-0.5.4.jar




 


 

-----Original Message-----
From: Hasan Diwan <[hidden email]>
To: [hidden email]
Sent: Tue, 11 Dec 2007 6:45 pm
Subject: Re: problem with mp3 parser










Think you may need the jar file in plugin/mp3/lib?

On 12/11/07, [hidden email] <[hidden email]> wrote:

>
>  Hi All,
>
> I have in nutch/conf/nutch-default.xml the following
>
>
>  <property>
> ? <name>plugin.includes</name>
> ?
> <value>nutch-extensionpoints|protocol-http|urlfilter-regex|parse-(text|html|js|mp3)|index-(basic|more)|query-(basic|more|site|url)|summary-basic|scoring-opic</value>
>
>
>  ...
>
>
> However in
>
> C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml
>
>
> <property>
>   <name>plugin.includes</name>
>
> <value>protocol-http|urlfilter-regex|parse-(text|html|js)|index-basic|query-(basic|site|url)|summary-basic|scoring-opic|urlnormalizer-(pass|regex|basic)</value>
>
>
> as you see mp3 is missing. And mp3 plugin is also missing in tomcat's plugin
> dir.
>
> Any ideas why this happened?
>
> Thanks.
> Alex.
>
>
> -----Original Message-----
> From: [hidden email]
> To: [hidden email]
> Sent: Fri, 7 Dec 2007 3:08 pm
> Subject: problem with mp3 parser
>
>
>
>
>
>
>
>
>
>
> Hello,
>
> I have build mp3 parser and put it in C:\nutch\plugins . However, nutch does
> not
> find mp3's. I checked C:\Tomcat\webapps\ROOT\WEB-INF\classes\plugins dir.
> There
> is no parser-mp3 folder.
>
> Any idea how to fix this?
>
> Thanks.
> Alex.
>
> ________________________________________________________________________
> More new features than ever.  Check out the new AIM(R) Mail ! -
> http://o.aolcdn.com/cdn.webmail.aol.com/mailtour/aol/en-us/text.htm?ncid=aimcmp00050000000001
>
>
>
>
>
>
> ________________________________________________________________________
> More new features than ever.  Check out the new AIM(R) Mail ! -
> http://webmail.aim.com
>

--
Sent from Gmail for mobile | mobile.google.com

Cheers,
Hasan Diwan <[hidden email]>



 


________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - http://webmail.aim.com
Reply | Threaded
Open this post in threaded view
|

Re: problem with mp3 parser

Hasan Diwan
On 12/12/2007, [hidden email] <[hidden email]> wrote:
>  I have this file file:///C:/nutch/plugins/parse-mp3/jid3lib-0.5.4.jar

Try putting it in file:///C:/nutch/plugins/parse-mp3/lib/jid3lib-0.5.4.jar
--
Cheers,
Hasan Diwan <[hidden email]>
Reply | Threaded
Open this post in threaded view
|

Re: problem with mp3 parser

alxsss

 


 It did not help. Also I checked the search.dir value does not change in C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml although I changed it in nutch/conf/nutch-deafult.xml. Should the size of nutch*.war file to change depending on how many sites are fetched. Also if I out all nutch command in a file and execute it, nutch gives errors like some directory is not found, although the dir is there.

Thanks for any ideas.
Alex.




 

-----Original Message-----
From: Hasan Diwan <[hidden email]>
To: [hidden email]
Sent: Wed, 12 Dec 2007 9:34 am
Subject: Re: problem with mp3 parser










On 12/12/2007, [hidden email] <[hidden email]> wrote:
>  I have this file file:///C:/nutch/plugins/parse-mp3/jid3lib-0.5.4.jar

Try putting it in file:///C:/nutch/plugins/parse-mp3/lib/jid3lib-0.5.4.jar
--
Cheers,
Hasan Diwan <[hidden email]>



 


________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - http://webmail.aim.com
Reply | Threaded
Open this post in threaded view
|

Re: problem with mp3 parser

Hasan Diwan
On 12/12/2007, [hidden email] <[hidden email]> wrote:
>  It did not help. Also I checked the search.dir value does not change in C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml although I changed it in nutch/conf/nutch-deafult.xml. Should the size of nutch*.war file to change depending on how many sites are fetched. Also if I out all nutch command in a file and execute it, nutch gives errors like some directory is not found, although the dir is there.

No, the data is stored outside the web archive.

Is your machine externally accessible? If so, please email me offlist
and I'd love to take a (brief) look and let you know if I see
anything.
--
Cheers,
Hasan Diwan <[hidden email]>
Reply | Threaded
Open this post in threaded view
|

RE: problem with mp3 parser

DANIEL CLARK-4
In reply to this post by alxsss
1) Add the following to conf/parse-plugins.xml

        <mimeType name="audio/mpeg">
                <plugin id="parse-mp3" />
        </mimeType>

2) Make sure the following is in conf/parse-plugins.xml.

                <alias name="parse-mp3"
                        extension-id="org.apache.nutch.parse.mp3.MP3Parser"
/>

3) plugins/parse-mp3/plugin.xml should contain...

<plugin
   id="parse-mp3"
   name="MP3 Parse Plug-in"
   version="1.0.0"
   provider-name="nutch.org">

   <runtime>
      <library name="parse-mp3.jar">
         <export name="*"/>
      </library>
      <library name="jid3lib-0.5.4.jar"/>
   </runtime>

   <requires>
      <import plugin="nutch-extensionpoints"/>
   </requires>

   <extension id="org.apache.nutch.parse.mp3"
              name="MP3Parse"
              point="org.apache.nutch.parse.Parser">

      <implementation id="org.apache.nutch.parse.mp3.MP3Parser"
                      class="org.apache.nutch.parse.mp3.MP3Parser">
        <parameter name="contentType" value="audio/mpeg"/>
        <parameter name="pathSuffix" value=""/>
      </implementation>

   </extension>

</plugin>

4) Make sure jid3lib-0.5.4.jar and parse-mp3.jar is in directory
plugins/parse-mp3.

-----Original Message-----
From: [hidden email] [mailto:[hidden email]]
Sent: Wednesday, December 12, 2007 3:25 PM
To: [hidden email]
Subject: Re: problem with mp3 parser


 


 It did not help. Also I checked the search.dir value does not change in
C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml although I changed
it in nutch/conf/nutch-deafult.xml. Should the size of nutch*.war file to
change depending on how many sites are fetched. Also if I out all nutch
command in a file and execute it, nutch gives errors like some directory is
not found, although the dir is there.

Thanks for any ideas.
Alex.




 

-----Original Message-----
From: Hasan Diwan <[hidden email]>
To: [hidden email]
Sent: Wed, 12 Dec 2007 9:34 am
Subject: Re: problem with mp3 parser










On 12/12/2007, [hidden email] <[hidden email]> wrote:
>  I have this file file:///C:/nutch/plugins/parse-mp3/jid3lib-0.5.4.jar

Try putting it in file:///C:/nutch/plugins/parse-mp3/lib/jid3lib-0.5.4.jar
--
Cheers,
Hasan Diwan <[hidden email]>



 


________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! -
http://webmail.aim.com

Reply | Threaded
Open this post in threaded view
|

Re: problem with mp3 parser

alxsss

 Thanks for your comment. I had all of these except I had



<runtime>
      <library name="parse-mp3.jar">
         <export name="*"/>
      </library>
      <library name="jid3lib-0.5.1.jar"/>
   </runtime>


 instead

jid3lib-0.5.4.jar  that I used. I corrected it, but still did not get mp3 plugin in tomcat/webapps.. dir.
Should I compile mp3 parser again?

Also, as i wrote before changes in nutch/conf/nutch-default.xml does not go to
C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml, although it must be.

Any ideas why this does not work.

Thanks in advance.

Alex.



 


 

-----Original Message-----
From: Daniel Clark <[hidden email]>
To: [hidden email]
Sent: Wed, 12 Dec 2007 1:12 pm
Subject: RE: problem with mp3 parser










1) Add the following to conf/parse-plugins.xml

    <mimeType name="audio/mpeg">
        <plugin id="parse-mp3" />
    </mimeType>

2) Make sure the following is in conf/parse-plugins.xml.

        <alias name="parse-mp3"
            extension-id="org.apache.nutch.parse.mp3.MP3Parser"
/>

3) plugins/parse-mp3/plugin.xml should contain...

<plugin
   id="parse-mp3"
   name="MP3 Parse Plug-in"
   version="1.0.0"
   provider-name="nutch.org">

   <runtime>
      <library name="parse-mp3.jar">
         <export name="*"/>
      </library>
      <library name="jid3lib-0.5.4.jar"/>
   </runtime>

   <requires>
      <import plugin="nutch-extensionpoints"/>
   </requires>

   <extension id="org.apache.nutch.parse.mp3"
              name="MP3Parse"
              point="org.apache.nutch.parse.Parser">

      <implementation id="org.apache.nutch.parse.mp3.MP3Parser"
                      class="org.apache.nutch.parse.mp3.MP3Parser">
        <parameter name="contentType" value="audio/mpeg"/>
        <parameter name="pathSuffix" value=""/>
      </implementation>

   </extension>

</plugin>

4) Make sure jid3lib-0.5.4.jar and parse-mp3.jar is in directory
plugins/parse-mp3.

-----Original Message-----
From: [hidden email] [mailto:[hidden email]]
Sent: Wednesday, December 12, 2007 3:25 PM
To: [hidden email]
Subject: Re: problem with mp3 parser


 


 It did not help. Also I checked the search.dir value does not change in
C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml although I changed
it in nutch/conf/nutch-deafult.xml. Should the size of nutch*.war file to
change depending on how many sites are fetched. Also if I out all nutch
command in a file and execute it, nutch gives errors like some directory is
not found, although the dir is there.

Thanks for any ideas.
Alex.




 

-----Original Message-----
From: Hasan Diwan <[hidden email]>
To: [hidden email]
Sent: Wed, 12 Dec 2007 9:34 am
Subject: Re: problem with mp3 parser










On 12/12/2007, [hidden email] <[hidden email]> wrote:
>  I have this file file:///C:/nutch/plugins/parse-mp3/jid3lib-0.5.4.jar

Try putting it in file:///C:/nutch/plugins/parse-mp3/lib/jid3lib-0.5.4.jar
--
Cheers,
Hasan Diwan <[hidden email]>



 


________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! -
http://webmail.aim.com




 


________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - http://webmail.aim.com
Reply | Threaded
Open this post in threaded view
|

Re: problem with mp3 parser

alxsss
In reply to this post by Hasan Diwan

 Unfortunately, my computer is not available remotely. What does offlist mean?

thanks.
Alex.


 


 

-----Original Message-----
From: Hasan Diwan <[hidden email]>
To: [hidden email]
Sent: Wed, 12 Dec 2007 1:05 pm
Subject: Re: problem with mp3 parser










On 12/12/2007, [hidden email] <[hidden email]> wrote:
>  It did not help. Also I checked the search.dir value does not change in
C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml although I changed it
in nutch/conf/nutch-deafult.xml. Should the size of nutch*.war file to change
depending on how many sites are fetched. Also if I out all nutch command in a
file and execute it, nutch gives errors like some directory is not found,
although the dir is there.

No, the data is stored outside the web archive.

Is your machine externally accessible? If so, please email me offlist
and I'd love to take a (brief) look and let you know if I see
anything.
--
Cheers,
Hasan Diwan <[hidden email]>



 


________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - http://webmail.aim.com