Today's Messages (OFF) | Unanswered Messages (ON)
| Forum: TestForum |
|---|
| Topic: fudcode test |
|---|
| fudcode test |
Sat, 19 October 2002 09:31 |
|
Run index -H to recreate citations.
And this is
another
code
fragment
presented.
[Updated on: Fri, 01 November 2002 06:34]
-- kir
|
|
| | Forum: Announcements |
|---|
| Topic: ASPseek 1.2.9 released |
|---|
| ASPseek 1.2.9 released |
Wed, 03 July 2002 04:00 |
|
This is really the most-awaited ASPseek release. It has a number of changes; hope that release notes below will guide you through that stuff. Please note the upgrade info in release notes.
I would like to thank Matt Sullivan for a number of valuable patches he sent that are included in this release. Thanks also goes to Jeff Watts for implementing \N functionality in Replace command of aspseek.conf - a feature that was requested by a number of users.
Being a release manager, I am taking an opportunity and dedicating this release to my dearest wife Elena, who is celebrating her Birthday today. Happy Birthday to You, Lenka!
Links: changelog | download
Below are release notes for 1.2.9:
Quite a lot of changes. Several bugs were fixed, including two rare memleaks in searchd and several coredumps, thus lead to improved stability. This release should also compile cleanly on FreeBSD.
This release also contains several fixes from Matt Sullivan <matt@sullivan.gen.nz>. Below is description of patches from the author:
- Fixed non thread safe use of scanner typeTable which caused corruption of the table in medium to high load situations (in particular this permanently broke use of "-" in queries (until next searchd restart) i.e. 'abc -xyz' would become 'abc AND xyz').
- Fixed a small bug in templates.cpp which caused newlines to be added before ending font tag during cached page highlighting (effect was that cached page would not appear exactly as original in some cases).
- Fixed rare segfault resulting from buffer overflow when creating query key for query cache (many stemmed words could overflow buffer).
- Improved tag parsing to handle omitted quotes, fixes cases such as <A HREF="http://www.server.com"; TARGET=_new">. Side effect is more URLs are discovered. Previously remainder of document would be ignored (resulting in URLs were not added).
- Fixed problem where script name has no suffix (exacerbated by addition of host to script_name) also removes prepending of hostname to script_name (not removed in mod_aspseek although it should be optional here also).
- Initial URL insertion (via "Server" config parameters or use of '-i') and URL deletion ('-C'; was this by design?) does not use delmap.
- Fixes order of logging of "Adding URL" in single threaded mode to be consistent with both realtime and threaded index modes i.e. log before call to HTTPGetUrlAndStore() rather than after (in past I think this has been a source of some user confusion when messages such as "URL deleted" appear before rather than after "Adding URL".
- Adds support for HTTP method POST to s.cgi.
- Adds feature which allows non-incrementing of hops value when redirects encountered. Adds two config options: IncrementHopsOnRedirect and RedirectLoopLimit.
Great work, Matt!
Since this version index implements new strategy of indexing "dead" sites (sites that does not respond to requests). Now number of threads that are processing such sites are limited to quarter of total number of threads, as long as there are enough non-dead sites to process.
Also, a new nice feature was added. The "Replace" aspseek.conf directive now works as sed's "s" command and so can accept \( and \) constructions in search expression, and \1 to \9 - in replacement. See aspseek.conf(5) man page for more details. Code was contributed by Jeff Watts <jeff.watts@ni.com>.
If you are upgrading from 1.2.8 or earlier versions, please note the following:
- If you had many sites and re-indexed them, it is advisable to run index -H to re-create citation index files. Versions of ASPseek prior to this had a bug that caused extra bytes to be written to the above mentioned files in the process of merging of direct citation index.
- If you have used "Cache" feature, please rename the SQL table "cache" to "rescache". This is done with "ALTER TABLE cache RENAME TO rescache" SQL statement.
[Updated on: Thu, 31 October 2002 18:49]
|
|
| | Topic: CVS is moved to cvs.aspseek.org |
|---|
| CVS is moved to cvs.aspseek.org |
Sat, 13 July 2002 04:00 |
|
We have migrated CVS that holds aspseek source code from internal ASPLinux server to to a new location - cvs.aspseek.org. The main reason behind it was to open write CVS access to most active outside developers (previously only read-only access was available, and the only way of contributing to the code was to send a patch).
Anonymous CVS access is available as usual - instructions were updated to reflect a new location.
Also, ViewCVS was installed to a new CVS server, you can see it in action.
If you want to join ASPseek development, please read this introductory article.
[Updated on: Thu, 31 October 2002 19:05]
|
|
| | Topic: ASPseek 1.2.10 released |
|---|
| ASPseek 1.2.10 released |
Mon, 22 July 2002 04:00 |
|
This is mostly bug-fixing release. One nasty bug that was introduced in previous release and resulted in index coredump while doing reverse citations merging is fixed. Note that you need to run index -H to recreate citation index if you have used 1.2.9; alternative way is clear the whole database and re-index everything from scratch.
Several less important bugs were fixed as well. Configuration files were revised, and some ancient unused stuff cleared out of them. Several minor documentation bugs were fixed. A few minor fixes went to init.d/aspseek, postinstall script for MySQL, and .spec file used to create RPM package.
Only one feature was added - DBLibDir directive for searchd and index, which can be used to add a directory to the list of directories used for searching database backend library.
Links: changelog | download
[Updated on: Thu, 31 October 2002 18:52]
|
|
| | Topic: New web site is online |
|---|
| New web site is online |
Thu, 31 October 2002 19:47 |
|
After about a month of slow development, I have finally decided to show a new aspseek.org site to the public. Contents is almost the same as one at the old site (which is available as old.aspseek.org), it's just the appearance what was changed. Site now feels more clear and elegant to me; your opinions are warmly welcome (Hint: use "Reply" button below).
New site uses CSS2 (including CSS positioning), so users of "old" browsers (like NS4) may experience some problems. Solution: upgrade your browser (I just do not want to use HTML tables for page layout anymore).
The most exciting thing about the new site is a new forum. To tell you the truth, I was extremely disappointed with the old "home-grown" forum used at www.aspseek.org before: it was ugly, cumbersome and hard to deal with. So I was just unable to use it and help ASPseek users solve their problem. Hope that new forum will fix the problem. New forum (which uses PHP and FUDforum software) has a large set of abilities, including avatars, skins, polls, formatting and what not.
But the coolest thing is mailing list integration, which works in both directions. In other words, new messages in "ASPseek users" forum are mailed to the aseek-users@ mailing list, and vise versa - messages from mailing list appears in forum. That essentially means you can use your e-mail reader and forum.aspseek.org site interchangeably. I'm amazed, to say at least.
Disclaimer: please consider the site as beta; there may be some bugs etc. Please send bug reports about the site as replies to this message, and do not forget to tell you OS/browser and attach a screenshot. Yes, I'm also interested in testimonials as well
-- kir
|
|
| | Forum: Mailing list: aseek-devel |
|---|
| Topic: [aseek-devel] VERY slow searches when many urls are returned |
|---|
| [aseek-devel] VERY slow searches when many urls are returned |
Thu, 21 November 2002 23:30 |
pepelaz Messages: 9
Registered: November 2002
Location: Nashville, TN, USA |
Junior Member |
|
|
I have this huge problem with search speed.
Whenever a query gives many results, the search takes too long.
For example, search for 'work' gives like 30k of links and takes 18.5
seconds. And search for 'good work' gives about 12k docs and takes less
than 2 secs. Searches that give few 100's of links take about 0.1 of a
second!
What is the problem? All CPU cycles are consumed by searchd during the
search time, so it is not the database. I encounter exactly the same
problem with aspseek search.
Please, help.
Best
Max
|
|
| | Topic: [aseek-devel] autoconf/automake upgrade |
|---|
| [aseek-devel] autoconf/automake upgrade |
Thu, 19 December 2002 05:41 |
|
Matt, what do you think about changing all auto*-related stuff in ASPseek
branch v_1_3 to use newer automake/autoconf (say, automake-1.6 and
autoconf-2.53)?? There are some nifty features in never versions, as long
as some incompatibilities, so the change will not be backward-compatible.
Normal users (except those who grab the sources from CVS) will hardly
notice the upgrade, as these are developer-only tools. As for the CVS
users, I can set up nighly CVS tarballs building.
--
== kir_at_asplinux.ru == 7551596_at_ICQ == 6722750_at_sms.beemail.ru ==
Dream like you'll live forever...Love like you've never been hurt...
Work like you don't need the money...and Dance like nobody is watching!
-- Satchel Paige
|
|
| | Forum: Mailing list: aseek-users |
|---|
| Topic: [aseek-users] list has been created |
|---|
| | Topic: [aseek-users] Re: searchd core dumps |
|---|
| [aseek-users] Re: searchd core dumps |
Thu, 25 January 2001 12:39 |
|
Have you tried ASPSeek-1.0.1? What causes this coredump?
Achilleas Mantzios wrote:
>
> Here is the output of the debugger
>
> (gdb) where
> #0 0x401b07b2 in __libc_accept () from /lib/libc.so.6
> #1 0x40064833 in accept (fd=2, addr={__sockaddr__ = 0xbfffe5f0,
> __sockaddr_at__ = 0xbfffe5f0, __sockaddr_ax25__ = 0xbfffe5f0,
> __sockaddr_dl__ = 0xbfffe5f0, __sockaddr_eon__ = 0xbfffe5f0,
> __sockaddr_in__ = 0xbfffe5f0, __sockaddr_in6__ = 0xbfffe5f0,
> __sockaddr_inarp__ = 0xbfffe5f0, __sockaddr_ipx__ = 0xbfffe5f0,
> __sockaddr_iso__ = 0xbfffe5f0, __sockaddr_ns__ = 0xbfffe5f0,
> __sockaddr_un__ = 0xbfffe5f0, __sockaddr_x25__ = 0xbfffe5f0},
> addr_len=0xbfffe5ec) at wrapsyscall.c:189
> #2 0x8057083 in daemon () at daemon.cpp:770
> #3 0x805756b in startdaemon () at daemon.cpp:819
> #4 0x80594fd in main (argc=0, argv=0xbffffa64) at daemon.cpp:916
> #5 0x400edb65 in __libc_start_main (main=0x8058b00 <main>, argc=2,
> ubp_av=0xbffffa64, init=0x804a090 <_init>, fini=0x809440c <_fini>,
> rtld_fini=0x4000df24 <_dl_fini>, stack_end=0xbffffa5c)
> at ../sysdeps/generic/libc-start.c:111
> (gdb)
>
> Any clue??
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
There are two ways to write error-free programs; only the third one works.
(C) 1982, Alan J. Perlis
|
|
| | Topic: [aseek-users] Re: var/aspseek/logs.txt |
|---|
| [aseek-users] Re: var/aspseek/logs.txt |
Thu, 25 January 2001 12:39 |
|
Achilleas Mantzios wrote:
>
> Is there a detailed description of the log file??
no. Basically index prints progress info, telling how many URLs
is has processed in this turn, for how many seconds. This was
used for debug and can't be relied upon. Maybe we will print
something more sensible there in next releases.
>
> Is
>
> select count(distinct url_id) from urlword ;
>
> the most accurate way of knowing my total number of urls added?? (during the
> index phase)
First, you don't need "distinct", as url_id in urlword table is unique.
Second, yes, this gets you the total number or URLs in database, but URLs
with status==0 are not indexed yet, and there are many other values for
status, like 404 - Document not found. Anyway, you'll get total number
of URLs, not just URLs that were indexed during last session. And you'd
better use index -S anyway to produce some sensible info. Also, there's
var/aspseek/total file, in which you can find a number to put on the
search homepage (like on google.com or aspseek.com).
So far index does not print number of URLs being indexed in session,
but you can print variable curUrl to print it, somethere just before
exit.
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
There are two ways to write error-free programs; only the third one works.
(C) 1982, Alan J. Perlis
|
|
| | Topic: [aseek-users] Re: [aseek]: searchd problem (yet another searchd problem) |
|---|
| [aseek-users] Re: [aseek]: searchd problem (yet another searchd problem) |
Thu, 25 January 2001 12:43 |
|
Achilleas Mantzios wrote:
>
> I found out , that when someone telnets to searchd
> telnet localhost 12345
> then searchd simply crashes
Hmm..we'll look into this. For now, use 1.0.1 and AllowFrom commands
in searchd.conf to protect searchd from attacks.
Also, you can run searchd from init, so init will restart it in case
of crash.
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
There are two ways to write error-free programs; only the third one works.
(C) 1982, Alan J. Perlis
|
|
| | Topic: [aseek-users] Re: [aseek]: General info , yep its me again :) |
|---|
| [aseek-users] Re: [aseek]: General info , yep its me again :) |
Fri, 26 January 2001 05:06 |
|
Achilleas Mantzios wrote:
>
> I know i've been bombing you with questions but
> can you please tell me
>
> a) how many urls aspseek crawls per day (lets say the average case for an
> average 200 kbytes/s connection with the web server, and an average document
> size of (400 words, or 4kbytes) in your own runs??? (VERY IMPORTANT)
> b) Can we run aspseek on multiple machines all feeding up one db??
> c) can you plz send me your aspseek.conf file (of a typical run). What i need
> to see is your initial seed of servers and how you use MaxDocSize, Follow,
> FollowOutSide
>
> Plz send this info if you can, or i will wait for the person who is currently
> on vacation.
As for b) - no, but we can ask s.cgi to connect to many search daemons, thus
making distributed system, though in a different way. So, you can have one
machine indexing these sites, other indexing those sites, and the third one
running s.cgi with two "DaemonAddress" variables in template, pointing to
two machines. s.cgi will do the work of merging and sorting the results.
As for other questions, yes, only he can answer. Please be patient and
wait...
PS BTW you can subscribe to aseek-users mailing list. Just send mail
to majordomo@lists.asplinux.ru with the line
subscribe aseek-users
in body.
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
There are two ways to write error-free programs; only the third one works.
(C) 1982, Alan J. Perlis
|
|
| | Topic: [aseek-users] some times Search results not sensible |
|---|
| [aseek-users] some times Search results not sensible |
Fri, 26 January 2001 09:12 |
Achilleas Mantzios Messages: 21
Registered: October 2002
|
Junior Member |
|
|
In addtition to my previous post,
it is easy to see the greek pages,
by choosing as charset
iso8859-7 (the official greek encoding, and not the win1234 or something like
that that msoft has put out).
===========================Previous Message==================
just try
http://194.219.147.3/cgi-bin/s.cgi?q=%EA%FC%ED%E9%F4%F3%E1
The problem is that the first hit is completely irrelevant,
allthough a word in it is very weakly related (acoustically only, .... i dont
think... it is a possible clue..).
Of cource it is difficult to focus on the
problem cause its a greek word. Anybody faced the same problem with their
national lang??
Btw Kir the group problem seems to persist.
A breaks results from the same site onto different groups.
Of cource i built my db across the 1.0.0 & 1.0.1 releases,
but neither the index executable or the db schema seem to differ at all.
to see that just try
http://194.219.147.3/cgi-bin/s.cgi?q=woman
to see it in action.
--
Application Developer
Eurisko A.E.
9
106 71
: +301 3633362
: +301 3633074
e-mail: achilleas.mantzios@eurisko.gr
|
|
| | Topic: [aseek-users] [Fwd: Re: some times Search results not sensible] |
|---|
| [aseek-users] [Fwd: Re: some times Search results not sensible] |
Fri, 26 January 2001 10:00 |
|
-------- Original Message --------
From: Kir Kolyshkin <kir@asplinux.ru>
Subject: Re: some times Search results not sensible
To: Achilleas Mantzios <achilleas.mantzios@eurisko.gr>
Achilleas Mantzios wrote:
> Btw Kir the group problem seems to persist.
> A breaks results from the same site onto different groups.
> Of cource i built my db across the 1.0.0 & 1.0.1 releases,
> but neither the index executable or the db schema seem to differ at all.
>
> to see that just try
> http://194.219.147.3/cgi-bin/s.cgi?q=woman
> to see it in action.
Well, now you have two different sites in result, so it's now grouped
by sited. Just insert <INPUT TYPE=hidden NAME=gr VALUE=off> into search
form and grouping will be turned off, like in
http://194.219.147.3/cgi-bin/s.cgi?q=woman&gr=off
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
There are two ways to write error-free programs; only the third one works.
(C) 1982, Alan J. Perlis
|
|
| | Topic: [aseek-users] Re: New message on the WebBoard "mnoGoSearch web board": Differences beetwen |
|---|
| [aseek-users] Re: New message on the WebBoard "mnoGoSearch web board": Differences beetwen |
Mon, 29 January 2001 04:48 |
|
Catalin Braescu wrote:
> What DBMS (database) does ASPSeek use? What is the sugested hardware for a
> Linux box that have to handle about 400,000 documents? What size the index
> file will have? many thanks for any hints.
Now ASPSeek can use only MySQL. We are in a process of development support for
other SQL databases.
Well, 400.000 is not so big for ASPSeek, so ordinary PII/PIII with 64-128 Mb
of RAM will be enough I beleive. More RAM leads to faster indexing/searching
speed. And don't forget to tune your MySQL (described in ASPSeek FAQ).
Size of index is about 1/3 to 1/2 of sum of indexed pages size. So it
depends of what pages do you want to index.
Please feel free to send your ASPSeek-related questions to aseek@sw.com.sg,
or subscribe to ASPSeek users mailing list (you can do so via a link in
ASPSeek homepage http://www.sw.com.sg/products/aspseek/).
Regards,
Kir.
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
There are two ways to write error-free programs; only the third one works.
(C) 1982, Alan J. Perlis
|
|
| | Topic: [aseek-users] Re: [aseek]: Separate frontend and backend machines |
|---|
| [aseek-users] Re: [aseek]: Separate frontend and backend machines |
Tue, 30 January 2001 09:31 |
|
tld@id.dk wrote:
> What I cannot do (and really would like to) is to tell the s.cgi program to connect
> to the machine running the searchd. Where and hwo do I tell it that searchd does not
> run on the localhost?
>
> Hope you can help me on this.
Well, it's as easy as finding the line DaemonAddress at the top of etc/s.htm file
and replacing 127.0.0.1 to your daemon address. You should also allow access to
searchd from frontend machine. To do that, replace AllowFrom 127.0.0.1 directive
in searchd.conf to IP of your frontend machine, and restart searchd.
> BTW - FANTASTIC program.
Thanx!
Please report about your further progress with ASPSeek.
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
There are two ways to write error-free programs; only the third one works.
(C) 1982, Alan J. Perlis
|
|
| | Topic: [aseek-users] Re: [aseek]: Separate frontend and backend machines |
|---|
| [aseek-users] Re: [aseek]: Separate frontend and backend machines |
Wed, 31 January 2001 11:15 |
|
According to RFC1945, all redirection URLs should be absolute.
Quote:
"10.11 Location
The Location response-header field defines the exact location of the
resource that was identified by the Request-URI. For 3xx responses,
the location must indicate the server's preferred URL for automatic
redirection to the resource. Only one absolute URL is allowed.
Location = "Location" ":" absoluteURI
An example is
Location: http://www.w3.org/hypertext/WWW/NewLocation.html"
PS Could you please subscribe to aseek-users@, so we will discuss all the
stuff there? Send "subscribe aseek-users" to majordomo@lists.asplinux.ru
Thomas Lund wrote:
>
> Nope - didn't get the error when changing charset back to what I had (at
> some point). Problem is gone. Weird.
>
> Different question.
>
> If you go to the URL:
>
> http://intranet.enovasion.dk/cgi-bin/s.cgi?tmpl=/web/intrane t.enovasion.dk/um.html
> I have put up a test for one of our customers. Try searching on the word
> "Danida". The 2 uppermost hits are documents that redirect the user over
> to some .asp sites with the real content. But the word danida is in the
> title.
>
> If you press the cached result link for one of them then the cached
> copies redirect the browser to a relative url that does not exists on
> the machine with the search frontend.
>
> Do you expand URL links in your cached copies, or do you just save the
> relative URL's? Maybe you missed the possibility to have redirects in
> the <head> section. The 2 redirects we have are build up like the file
> attached.
>
> It is possible for us to change all redirections to be absolute, but it
> would be nice to have this done automatically.
>
> /Thomas
>
> Kir Kolyshkin wrote:
>
> > About that stupid ' bug in searchd. Have you changed charsets or something
> > like that? If you done that, please send us files...maybe there's a bug there.
> >
> > BTW We recently added more charsets recently, if you want some, we'll
> > send it to you.
> >
> > -- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
> > There are two ways to write error-free programs; only the third one works.
> > (C) 1982, Alan J. Perlis
>
> ------------------------------------------------------------ ----------------------
> <html>
> <head>
> <title>Danida Publikationsliste</title>
> <!-- <meta http-equiv="REFRESH" content="0; url=/cgi-bin/dyn3nt/dyn3.exe?prog=show&pageid=127"& gt; link til gammel dynamic visningsside -->
> <meta http-equiv="REFRESH" content="0; url=/aspfiles/ny_pub_oversigt.asp">
> </head>
> <body>
> </body>
> </html>
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
There are two ways to write error-free programs; only the third one works.
(C) 1982, Alan J. Perlis
|
|
| | Topic: [aseek-users] Re: [aseek]: Question About ASPSeek |
|---|
| [aseek-users] Re: [aseek]: Question About ASPSeek |
Thu, 01 February 2001 06:36 |
|
akmjsuber wrote:
>
> Can aspseek be used with Oracle instead MySql?
>
> Thanks
> Adam
Well, our 1.1.0 version will be able to use different SQL servers, not just
MySQL. Drivers for PostgreSQL and Oracle will be the first one available.
If all will go smooth, we expect that ASPSeek/Oracle will be available in a
month or two...
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
"Ok, the guy who made the netfilter Makefile was probably on some interesting
and probably illegal drugs when we wrote it." -- Linus Torvalds.
|
|
| | Topic: [aseek-users] Feature request |
|---|
| [aseek-users] Feature request |
Sat, 03 February 2001 12:55 |
Brett Pappas Messages: 8
Registered: October 2002
|
Junior Member |
|
|
How about an option to allow the server to index pages in a filesystem rather than through http? For instance there could be a config option that maps a URL to a directory so for instance:
http://www.mydomain.com/ = /home/httpd/docs/index.html
This was the results can be stored as the proper url even though all of the indexing took place without having to put any strain on the web server...
It would still function the same way by spidering the site instead of getting a listing of files (which might include files that the maintainer does not want to be seen). To build on this even further you can have it setup so that this works for only certain file types (like htm, html, txt...) and if it comes across a file type that isn't listed in that config option it will fall back to retrieving it using http instead. This would be useful in order to avoid indexing a dynamic page such as shtml or php which might be including other files or database queries.
So the positive effect that this feature would have is conserving cpu since the web server does not get involved (and does not log unnecessary stats). Also it can be flexibel enough to still retrieve dynamic pages via the web like normal. This would also benefit my other goal of getting last-modified times from dynamic pages that otherwise don't include this in the http header since it can now just check the file systems last mod time instead. Not to mention that this could be a good way of indexing pages or data that don't even fall under a web server.
--Brett
|
|
| | Topic: [aseek-users] Failed to build ASPSeek 1.0.2 on Solaris |
|---|
| [aseek-users] Failed to build ASPSeek 1.0.2 on Solaris |
Wed, 07 February 2001 08:11 |
Ludger Merkens Messages: 7
Registered: October 2002
|
Junior Member |
|
|
Hello,
I today tried to build aspseek-1.0.2. on Solaris.
SunOS 5.6 Generic_105181-23 sun4u sparc
In daemon.cpp:
daemon.cpp: In function `void daemon()':
daemon.cpp:770: passing `socklen_t *' as argument 3 of `accept(int,
sockaddr *, int *)' changes signedness
gcc --version shows: 2.95.2
Compiling aspseek-1.0.1 on the same configuration gives
daemon.cpp: In function `void daemon()':
daemon.cpp:769: `socklen_t' undeclared (first use this function)
daemon.cpp:769: (Each undeclared identifier is reported only once
daemon.cpp:769: for each function it appears in.)
daemon.cpp:769: parse error before `='
daemon.cpp:770: `cadlen' undeclared (first use this function)
My next step was to try the cvs version, but I failed to login.
cvs [login aborted]: authorization failed: server www.asplinux.ru rejected
access
With user anoncvs and password anoncvs.
Greetings
Ludger Merkens
|
|
| | Topic: [aseek-users] Re: [aseek]: Failed to build ASPSeek 1.0.2 on Solaris |
|---|
| [aseek-users] Re: [aseek]: Failed to build ASPSeek 1.0.2 on Solaris |
Wed, 07 February 2001 09:36 |
|
Ludger Merkens wrote:
>
> On Wed, 7 Feb 2001, Kir Kolyshkin wrote:
>
> > Ludger Merkens wrote:
> > >
> > > Hello,
> > >
> > > I today tried to build aspseek-1.0.2. on Solaris.
> > > SunOS 5.6 Generic_105181-23 sun4u sparc
> > >
> > > In daemon.cpp:
> > > daemon.cpp: In function `void daemon()':
> > > daemon.cpp:770: passing `socklen_t *' as argument 3 of `accept(int,
> > > sockaddr *, int *)' changes signedness
> >
> > try to modify line 1280 in include/defines.h. Replace
>
> Whats that line number? In my file it is line 42 (no joke)
It was actually byte offset, sorry :)
> >
> > typedef unsigned int socklen_t;
> >
> > to
> >
> > typedef int socklen_t;
> >
> > Please report your success (or failure) with it to this list.
>
> Builds ok with this change. Further testings will follow.
ok, great, please keep inform us how things are doing.
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
"Maybe somebody should tell gcc maintainers about programmers
that know more than the compiler again." -- Linus Torvalds.
|
|
| | Topic: [aseek-users] Re: [aseek]: troubles compiling on Solaris... |
|---|
| [aseek-users] Re: [aseek]: troubles compiling on Solaris... |
Tue, 13 February 2001 04:46 |
|
Please replace on line 42 of include/defines.h
typedef unsigned int socklen_t;
to
typedef int socklen_t;
And don't forget to report your success/failure, so in case of
success I will fix it in upcoming 1.0.3
Thanx!
Benjamin Benson wrote:
>
> Kir,
>
> Using the newest release 1.0.2, I get a similar error:
>
> aemon.cpp: In function `void daemon()':
> daemon.cpp:770: passing `socklen_t *' as argument 3 of `accept(int,
> sockaddr *, int *)' changes signedness
> *** Error code 1
> make: Fatal error: Command failed for target `daemon.o'
> Current working directory /opt/src/aspseek-1.0.2/src
> *** Error code 1
>
> At 04:35 PM 2/5/01 +0300, you wrote:
> >I have fixed this in 1.0.2 sources (available from CVS, we are planning to
> >make a release tomorrow).
> >
> >Well, I doubt this will work on Solaris, even after compiling, but if you can
> >help with that, please do.
> >
> >Benjamin Benson wrote:
> > >
> > > I've had success with the first 10-15 minutes of compiling ASPSeek, but now
> > > it is abruptly breaking on deamon.cpp:
> > > I'm compiling on Sun/Solaris 2.6.
> > >
> > > [109 tcsh] root@bensun /opt/src/aspseek-1.0.1 -> make
> > > Making all in src
> > > c++ -DHAVE_CONFIG_H -I. -I. -I../include -I/usr/local/include
> > > -I/opt/mysql/include -g -O2 -D_REENTRANT -D_POSIX_PTHREAD_SEMANTICS
> > > -DUSE_SOLARIS -c daemon.cpp
> > > daemon.cpp: In function `void daemon()':
> > > daemon.cpp:769: `socklen_t' undeclared (first use this function)
> > > daemon.cpp:769: (Each undeclared identifier is reported only once
> > > daemon.cpp:769: for each function it appears in.)
> > > daemon.cpp:769: parse error before `='
> > > daemon.cpp:770: `cadlen' undeclared (first use this function)
> > > *** Error code 1
> > > make: Fatal error: Command failed for target `daemon.o'
> > > Current working directory /opt/src/aspseek-1.0.1/src
> > > *** Error code 1
> > > make: Fatal error: Command failed for target `all-recursive'
> > >
> > > : Benjamin Graham Benson
> > > : http://www.spiderline.com/ben/
> > >
> > > ------------------------------------------------------------ --------
> > > To unsubscribe send the word "unsubscribe" to aseek-request@sw.com.sg
> > > To get usage guidelines send the word "help" to Majordomo@sw.com.sg
> >
> >-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
> >"Maybe somebody should tell gcc maintainers about programmers
> > that know more than the compiler again." -- Linus Torvalds.
>
> : Benjamin Graham Benson
> : http://www.spiderline.com/ben/
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
"Maybe somebody should tell gcc maintainers about programmers
that know more than the compiler again." -- Linus Torvalds.
|
|
| | Topic: [aseek-users] Re: [aseek]: Cannot compile ASPSeek (resend; fixed) |
|---|
| [aseek-users] Re: [aseek]: Cannot compile ASPSeek (resend; fixed) |
Wed, 14 February 2001 13:55 |
|
"Christopher W. Curtis" wrote:
>
> Kir Kolyshkin wrote:
> >
> > Well, to compile CVS version, you need some developer's tools,
> > namely automake and autoconf. And, after cvs checkout, the
>
> > first thing you run should be ./cvsprep. It creates everything
> > needed, no need to hack something.
>
> Aha! I missed that part of INSTALL.
This is not in INSTALL. INSTALL is written for ordinary users in mind,
they use distro tarball, there all the files are in place. But if you
take sources from CVS, everything that is autogenerated is out of CVS,
so you should regenerate it, thus need to run ./cvsprep.
> I figured out how to create the
> MySQL tables (eventually) by myself though, I did have to remove the
> 'unique' keyword from 'index'. I guess my server is too old to
> understand that construct (and indexes are always unique anyway).
Well, if you have old MySQL, please consider to upgrade, we do not
guarantee that ASPSeek will work with old MySQL versions. Also,
you'll close some security holes, which is Good Thing (TM).
-- |< [] [] |_ kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
"Maybe somebody should tell gcc maintainers about programmers
that know more than the compiler again." -- Linus Torvalds.
|
|
| | Topic: [aseek-users] Want to contribute |
|---|
| [aseek-users] Want to contribute |
Thu, 15 February 2001 12:05 |
S Mulyo Messages: 1
Registered: October 2002
|
Junior Member |
|
|
Dear Alexander F. Avdonkin, Kir Kolyshkin, Igor Sukhih,
I want to contribute to this great open source search engine. I think it's a great idea. I think it has similarities with Google, though, probably the big difference is the database engine behind it and Google's clustering. I have been admiring Google & studying its research paper.
Unfortunately, I am just a perl programmer and beginner in C/C++. And, I have been experimenting making Crawler with Perl, as I know Google made its Crawler with Python.
May be I can contribute by trying Aspseek's Crawler and IR. I think its IR needs to be improved a little bit if you try to query 'www.aspseek.com.sg' on both www.aspseek.com.sg & google, then you will know what I mean.
And, I have looked & printed the source codes and it gives me a spinning headache. May be (it's just an idea), if we have like a flow chart or the macro algorithm of Aspseek, it will help the potential contributors (programmers) to contribute into the program's hard core more quickly. I think if the programmers community can receive this positively, AspSeek may be able to become the "super search" Search engine, just like Linux does. (well just a wish).
Anyway, thank you all for such generous contribution to the world of Open Source.
Sincerely,
Mulyo
|
|
| | Topic: [aseek-users] Re: UdmSearch: Output result? |
|---|
| [aseek-users] Re: UdmSearch: Output result? |
Thu, 15 February 2001 12:36 |
|
fabrice valere wrote:
>
> > Hello !
>
> I remenber that you are a aspseek specialist !
> Could you help me
> I install it and it's seem to working ( sometimes )=>
> search.larevet.com
> When I search for one word => ok there is no problem
> but when I try with two words as
> martin john
> aspseek answer Error in query: and the daemon is shutdown
>
> Do you know this problem?
Yes, and I hope it's fixed in 1.0.3 release, which we plan
to release early next week.
If you don't want to wait, the fix is simple - please open
search.cpp in your favourite text editor, and replace all
occurences of "pos[s]" to "pos[s - 1]". There are four.
Then recompile searchd and reinstall it.
Alternatively, you can get the latest sources from anonymous
cvs. See http://www.sw.com.sg/products/aspseek/cvs/ for details.
After checking out, please run ./cvsprep, then do as usual.
Hope it will help, anyway, please report your success/failure!
Regards,
Kir.
PS You can subscribe to ASPSeek users mailing list, just
send mail to majordomo@lists.asplinux.ru, and put the line
subscribe aseek-users
in the body of the message.
-- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
Join CCAUWM - Citizens' Campaign for Abolition of the Use
of the Word Microsoft (or of Microsoft Word - you choose)
|
|
| | Topic: [aseek-users] Excerpts |
|---|
| [aseek-users] Excerpts |
Thu, 01 March 2001 06:03 |
Valdas Andrulis Messages: 2
Registered: October 2002
|
Junior Member |
|
|
Hello,
I think, that before doing anything with excerpts, mostly for displaying
in results, all html tags should be removed, otherwise it may result
in badly formed html document, because of incorect, unfinished tags.
Valdas
|
|
| | Topic: [aseek-users] upgrade |
|---|
| [aseek-users] upgrade |
Fri, 02 March 2001 11:02 |
fabrice valere Messages: 16
Registered: October 2002
|
Junior Member |
|
|
I'am trying to upgrade aspseek
but it tell it every time
Can't open template file '/home/geneasearch/www/aspseek/etc//s.htm'!
with a double slash in the path ?
what is that
help me please
|
|
| | Topic: [aseek-users] Re: upgrade |
|---|
| [aseek-users] Re: upgrade |
Fri, 02 March 2001 11:20 |
|
fabrice valere wrote:
>
> I'am trying to upgrade aspseek
> but it tell it every time
> Can't open template file '/home/geneasearch/www/aspseek/etc//s.htm'!
> with a double slash in the path ?
> what is that
> help me please
Double slash can't be a problem. Check file permissions.
-- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
Join CCAUWM - Citizens' Campaign for Abolition of the Use
of the Word Microsoft (or of Microsoft Word - you choose)
|
|
| | Topic: [aseek-users] 1.0.3 problems |
|---|
| [aseek-users] 1.0.3 problems |
Mon, 05 March 2001 12:16 |
Daniell Freed Messages: 10
Registered: October 2002
|
Junior Member |
|
|
I just downloaded and compiled 1.0.3 and installed it to replace my
1.0.2 (after backing up 1.0.2 first). When I search now I get very
weird results. I get results, but the titles are alway "No Title" and
the links are all broken or lead to more lists of "No Title" results.
Did the s.htm format change, because I have customized my results
template quit a bit. I glaced through the s.htm-dist, and didn't notice
any differences, but it was only a quick glance.
Anyone have any thoughts?
--
Daniell Freed
Computer Services
Dewitt, Ross, & Stevens S.C.
He who fights with monsters might take care
lest he thereby become a monster.
And if you gaze for long into an abyss,
the abyss gazes also into you.
Beyond Good and Evil
Friedrich Wilhelm Nietzche
|
|
| | Topic: [aseek-users] patch to try |
|---|
| [aseek-users] patch to try |
Tue, 06 March 2001 18:07 |
|
This patch should fix the problem of indexing part of long
tags such as
<A HREF=.... onmouseover="document.write('<BR>foo');">
Earlier index considered closing bracket after BR to be
closing char for <A, so foo was indexed. Patch should
fix this problem.
Can somebody test it and report success/failure? It can
decrease indexing speed a little, so if you'll test speed
difference I would be very appreciated.
--kir.
-------- Original Message --------
From: Kir Kolyshkin <kir@asplinux.ru>
Subject: Re: http://intranet.enovasion.dk/cgi-bin/s.cgi
To: clausanne@email.dk
Kir Kolyshkin wrote:
> As for index bug (showing onmouseout) it is caused by symbols
> <> inside <AREA... tag.
Please test the attached patch - it should fix your "onmouseout"
problem. To apply, copy the file to your aspseek directory with
cpp files (aspseek-1.0.3/src/), then do
patch -p0 < charsets.diff
make
make install
Try to clear and reindex everything, it should work. Please report
success or failure. This will be included in 1.0.4 if it works.
-- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
Answers: $1, short: $5, correct: $25, dumb questions are still free.
|
|
| | Topic: SV: [aseek-users] patch to try |
|---|
| SV: [aseek-users] patch to try |
Wed, 07 March 2001 02:46 |
Claus Jul Larsen Messages: 63
Registered: October 2002
|
Member |
|
|
Hi Kir!
It's really great. I don't try it because my server administrator is not
here now.
By the way I told about the problem with HTML-entities. It's really
impontant this bug is fixed because of the danish html-pages have a lot of
html-entities. Eg. &aering; for ?, æ for ?, ø for o.
Also the HTML-entities may not be changed through the index process.
Yes I will set the "Powered by Aspseek"-gif. The page you've seen is a
test-page.
Thank you
Claus Jul Larsen
-----Oprindelig meddelelse-----
Fra: Kir Kolyshkin [mailto:kir@asplinux.ru]
Sendt: 7. marts 2001 00:11
Til: aseek-users@lists.asplinux.ru
Emne: [aseek-users] patch to try
This patch should fix the problem of indexing part of long
tags such as
<A HREF=.... onmouseover="document.write('<BR>foo');">
Earlier index considered closing bracket after BR to be
closing char for <A, so foo was indexed. Patch should
fix this problem.
Can somebody test it and report success/failure? It can
decrease indexing speed a little, so if you'll test speed
difference I would be very appreciated.
--kir.
-------- Original Message --------
From: Kir Kolyshkin <kir@asplinux.ru>
Subject: Re: http://intranet.enovasion.dk/cgi-bin/s.cgi
To: clausanne@email.dk
Kir Kolyshkin wrote:
> As for index bug (showing onmouseout) it is caused by symbols
> <> inside <AREA... tag.
Please test the attached patch - it should fix your "onmouseout"
problem. To apply, copy the file to your aspseek directory with
cpp files (aspseek-1.0.3/src/), then do
patch -p0 < charsets.diff
make
make install
Try to clear and reindex everything, it should work. Please report
success or failure. This will be included in 1.0.4 if it works.
-- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
Answers: $1, short: $5, correct: $25, dumb questions are still free.
|
|
| | Topic: SV: SV: SV: [aseek-users] Comparing ASPSeek to mnoGoSearch |
|---|
| SV: SV: SV: [aseek-users] Comparing ASPSeek to mnoGoSearch |
Wed, 07 March 2001 09:36 |
Claus Jul Larsen Messages: 63
Registered: October 2002
|
Member |
|
|
Try this URL:
http://search.enovasion.dk/cgi-bin/s.cgi?tmpl=%2Fweb%2Fsearc h.enovasion.dk%2
Fum1.html&gr=off&ul=http%3A%2F%2Fwww.um.dk%2F&q= politik
Result no 9+13+47+50
No 9 - ing;
No 13 - ?146; - must be ’
No 50 - fuml;r - problem with HTML-entities for
I use the ISO88591 charset
It's not only the danish character - it's trouble with the deustch - See the
mail from Thomas Walter
Claus
-----Oprindelig meddelelse-----
Fra: Kir Kolyshkin [mailto:kir@asplinux.ru]
Sendt: 7. marts 2001 15:06
Til: aseek-users@lists.asplinux.ru
Emne: Re: SV: SV: [aseek-users] Comparing ASPSeek to mnoGoSearch
Please send a couple of URLs with that contains these codes, and tell
what charset do you use for it.
Claus Jul Larsen wrote:
>
> Hi Kir!
>
> > It sounds very good! Do you tell me the date for release of version 1.1?
> > I've a lot of troubles with danish characteres so the unicode could help
> on
> > this problem :-)
>
> >What are your problems? Please describe...
>
> My problem is the HTML-entities:
>
> Eg the danish characteres I use the codes:
>
> &oeslash; for
> &aering; for
>
> A search result results in
>
> ...Udenrigsministeriet DANIDAS HUMANITRE POLITIK DANIDAS HUMANITlig;RE
> POLITIK
> ^^^^^
>
> Must be
> Æ
>
> Danida sash;gte i 1994 at leve op til behovet for nytnkning for bing;de
> ^^^^^^
> Must be
> å
>
> I hope you understand the problem. I can see that aspseek want to change
the
> HTML-entities to normal characteres right?
>
> Claus
-- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
Answers: $1, short: $5, correct: $25, dumb questions are still free.
|
|
| | Topic: Re: SV: SV: [aseek-users] Comparing ASPSeek to mnoGoSearch |
|---|
| Re: SV: SV: [aseek-users] Comparing ASPSeek to mnoGoSearch |
Sun, 11 March 2001 15:20 |
|
Aha, it seems that your server don't provide correct charset to
ASPSeek's index, so you need to uprgade your Apache (latest 1.3.x
put charset=iso-8859-1 by default, as it is required by http 1.1).
Then, your problem should be fixed after complete reindexing...
Thomas 'Balu' Walter wrote:
>
> +-Kir Kolyshkin-(kir@asplinux.ru)-[07.03.01 15:48]:
> > Thomas 'Balu' Walter wrote:
> > >
> > > +-Claus Jul Larsen-(cjlarsen@enovasion.dk)-[07.03.01 14:57]:
> > > [and others: HTML-Entities]
> > >
> > > Searching at http://www.jugendpolitik.net -> "Suchen" (german! search
> > > for "jugend") gives results were are written capital () - or
> > > htmlish: not ü but Ü
> >
> > Do you mean that all characters marked with blue underline in the
> > attached picture should be lowercase, not uppercase?
>
> Exactly.
>
> Balu
-- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
Answers: $1, short: $5, correct: $25, dumb questions are still free.
|
|
| | Topic: SV: SV: [aseek-users] patch to try |
|---|
| SV: SV: [aseek-users] patch to try |
Mon, 12 March 2001 09:23 |
Claus Jul Larsen Messages: 63
Registered: October 2002
|
Member |
|
|
It doesn't help.. $ cvs checkout -r v_1_1 aspseek ... no doc/unicode.txt ..
:-(
BTW: Do aspseek support frames?
Claus
-----Oprindelig meddelelse-----
Fra: Alexander F. Avdonkin [mailto:al@sw.com.sg]
Sendt: 12. marts 2001 15:01
Til: aseek-users@lists.asplinux.ru
Emne: Re: SV: [aseek-users] patch to try
We've added it several hours ago.
Try to checkout again and look it in the directory "doc"
----- Original Message -----
From: "Claus Jul Larsen" <cjlarsen@enovasion.dk>
To: <aseek-users@lists.asplinux.ru>
Sent: Monday, March 12, 2001 9:52 PM
Subject: SV: SV: [aseek-users] patch to try
> When I write to the CVS: cvs -z9 co -r v_1_1 aspseek - the unicode.txt
> doesn't come?
>
> -----Oprindelig meddelelse-----
> Fra: Alexander F. Avdonkin [mailto:al@sw.com.sg]
> Sendt: 12. marts 2001 14:23
> Til: aseek-users@lists.asplinux.ru
> Emne: Re: SV: [aseek-users] patch to try
>
>
> It seems, that you have a problem with HTML entities like ø.
> They will be handled correctly in version 1.1 if configured
> with --enable-unicode.
> We've just put docs about unicode support to file doc/unicode.txt to CVS.
>
> Alexander.
>
> ----- Original Message -----
> From: "Claus Jul Larsen" <cjlarsen@enovasion.dk>
> To: <aseek-users@lists.asplinux.ru>
> Sent: Monday, March 12, 2001 9:17 PM
> Subject: SV: SV: [aseek-users] patch to try
>
>
> > I've downloaded the not-released aspseek 1.1 from cvs - but nothing
helps.
> I
> > can see the patch i charset.cpp but it is not help on my problem .. :-(
> >
> > The trouble is in the HTML-code:
> >
> > <map name="navigation">
> > <area href="/index.html" coords="1,18,33,41" shape="rect"
> > onmouseover="dcs('Tilbage til forsiden'); return true;"
onmouseout="nd();
> > return true;">
> > <area href="/dk/soeg/index.html" coords="41,18,67,42" shape="rect"
> > onmouseover="dcs('Søge i Experimentariums databaser'); return
true;"
> > onmouseout="nd(); return true;">
> > <area href="/dk/udstillinger/index.html" coords="71,18,138,42"
> shape="rect">
> > <area href="/dk/praktiske_oplysninger/index.html" coords="144,18,267,42"
> > shape="rect" onmouseover="dcs('Åbningstider<BR>Priser <Br>Hvor
finder
> > jeg Experimentarium?'); return true;" onmouseout="nd(); return true;">
> > ...
> > </map>
> >
> > The trouble is the <BR> into the area href ...
> >
> >
> >
> > Claus
> >
> > -----Oprindelig meddelelse-----
> > Fra: Kir Kolyshkin [mailto:kir@asplinux.ru]
> > Sendt: 10. marts 2001 12:49
> > Til: aseek-users@lists.asplinux.ru
> > Emne: Re: SV: [aseek-users] patch to try
> >
> >
> > Hmm...maybe it depends on some other patch which is not in 1.0.3
> > Anyway, please grab the latest 1.0 sources from CVS and test it.
> > Instructions are at http://www.aspseek.org/cvs.html
> >
> >
> > Claus Jul Larsen wrote:
> > >
> > > Hi Kir
> > >
> > > We tried to install the patch but it dosen't work:
> > >
> > > [root@wartburg src]# patch -p0 < charsets.diff
> > > patching file `charsets.cpp'
> > > Hunk #1 FAILED at 641.
> > > 1 out of 1 hunk FAILED -- saving rejects to charsets.cpp.rej
> > >
> > > Why?
> > >
> > > Claus
> > >
> > > -----Oprindelig meddelelse-----
> > > Fra: Kir Kolyshkin [mailto:kir@asplinux.ru]
> > > Sendt: 7. marts 2001 00:11
> > > Til: aseek-users@lists.asplinux.ru
> > > Emne: [aseek-users] patch to try
> > >
> > > This patch should fix the problem of indexing part of long
> > > tags such as
> > > <A HREF=.... onmouseover="document.write('<BR>foo');">
> > >
> > > Earlier index considered closing bracket after BR to be
> > > closing char for <A, so foo was indexed. Patch should
> > > fix this problem.
> > >
> > > Can somebody test it and report success/failure? It can
> > > decrease indexing speed a little, so if you'll test speed
> > > difference I would be very appreciated.
> > >
> > > --kir.
> > >
> > > -------- Original Message --------
> > > From: Kir Kolyshkin <kir@asplinux.ru>
> > > Subject: Re: http://intranet.enovasion.dk/cgi-bin/s.cgi
> > > To: clausanne@email.dk
> > >
> > > Kir Kolyshkin wrote:
> > > > As for index bug (showing onmouseout) it is caused by symbols
> > > > <> inside <AREA... tag.
> > >
> > > Please test the attached patch - it should fix your "onmouseout"
> > > problem. To apply, copy the file to your aspseek directory with
> > > cpp files (aspseek-1.0.3/src/), then do
> > >
> > > patch -p0 < charsets.diff
> > > make
> > > make install
> > >
> > > Try to clear and reindex everything, it should work. Please report
> > > success or failure. This will be included in 1.0.4 if it works.
> > >
> > > -- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
> > > Answers: $1, short: $5, correct: $25, dumb questions are still free.
> >
> > -- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
> > Answers: $1, short: $5, correct: $25, dumb questions are still free.
|
|
| | Topic: [aseek-users] SV: [aseek-users] aspseek 1.0.7́م =?utf-8?B?bGFibGU=?= |
|---|
| [aseek-users] SV: [aseek-users] aspseek 1.0.7́م =?utf-8?B?bGFibGU=?= |
Tue, 13 March 2001 08:25 |
Claus Jul Larsen Messages: 63
Registered: October 2002
|
Member |
|
|
Hi
Great with 1.04 - The problem with the '>' is OK now!
Another problem which is very strangely:
A bit of the searchresult outputted by aspseek:
Bø;rnenes vandpyt. Der findes desuden et 850 m2 særudstillingsområde.
Endvidere forefindes caféeacute;, butik, møde...
...Tuborg Havn, 5 km nord for København og besøges årligt af ca 360.000
gæster. 147;BørnePavillonen” - et småbøash;rnsområde Experimentariums nye
børneområde henvender sig til de 3-6 åg;rige piger... ...
The real content in the html-page on
http://www.experimentarium.dk/dk/pressecenter/fact-sheet/ind ex.html is:
Børnenes vandpyt. Der findes desuden et 850 m2
særudstillingsområde. Endvidere forefindes café, butik,
møde
It's the HTML-entities which are converted wrongly. Bø;rnenes vandpyt
(aspseek) -> Børnenes vandpyt (not aspseek)
I've observed that the html-entities is converted correctly some places but
not on anthoter places.
?????
Claus
-----Oprindelig meddelelse-----
Fra: Kir Kolyshkin [mailto:kir@asplinux.ru]
Sendt: 13. marts 2001 13:43
Til: aseek-users@lists.asplinux.ru; aseek@sw.com.sg
Emne: [aseek-users] aspseek 1.0.4 is available
Hi,
We have just released ASPSeek v.1.0.4. Changes are:
Fixed bug with robots.txt
Fixed processing links with apostrophes
Small fixes to INSTALL, searchd.conf, s.htm, Makefiles
Searchd binds to loopback if access is allowed only from localhost
Fixed HTTP basic authorization support
Added s.cgi ability to accept several ul= arguments
Fixed processing tags with symbol '>' inside
Fixed potential buffer overflows in s.cgi
Fixed escaping some SQL queries
Fixed ranks reloading
Fixed rare searchd coredump when sorting results by date
s.cgi now prints charset in Content-Type header (not in META)
In case of SQL query error bad query is printed
This version fixes some security holes in s.cgi, so it would
be very wise decision to upgrade immediately.
Enjoy,
Kir.
-- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
Answers: $1, short: $5, correct: $25, dumb questions are still free.
|
|
| | Topic: [aseek-users] codepages |
|---|
| [aseek-users] codepages |
Tue, 13 March 2001 11:20 |
Jan Karabina Messages: 15
Registered: October 2002
|
Junior Member |
|
|
Hi
i need add two additional charsets to aspseek
but i dont understand format of charset files.
manual say: characters are in alphabetical order, big letter first
but wich characters - national or standard?
how aseek does recoding of character sets?
I am sending (in attachment) description of czech search engine sherlock
(www.sherlock.cz its down, now:-(
you can download full source code (under GPL) and documentation from
http://artax.karlin.mff.cuni.cz/~karaj0bm/
sherlock-1.0 original version
sherlock-1.0.1 i removed some bugs and added some docs from this version
original author is not maintaining sherlock for one year...:(
sherlock uses great codepage guessing algorithm.
aspseek authors can inspire there...:-)
PS: sorry of my CzEnglish :)
--
Jan Karabina mailto:karabina@penguin.cz (private: karabij@feld.cvut.cz)
|
|
| | Topic: [aseek-users] small bug |
|---|
| [aseek-users] small bug |
Tue, 13 March 2001 11:21 |
Jan Karabina Messages: 15
Registered: October 2002
|
Junior Member |
|
|
Hi all
when you click on Cached document, aspseek dont recode character set of
document (document is displayed in LocalCharset, as is stored in
database).
for example: i have LocalCharset=iso88592
and document is windows1250 charset
when i click to cached document i get:
<base href=...>
<meta httpequiv=... charset=iso88592> at the second line of document,
then
<p>this is aspseek cache of document...blablabla"
<html> here starts cached document
<head>
<meta httpequiv=... charset=windows-1250>
</head>
<body>
<p>some text of document BUT in charset iso88592, in LocalCharset, like
is stored in database...
</body>
</html>
--
Jan Karabina mailto:karabina@penguin.cz
|
|
| | Topic: SV: [aseek-users] Problem with the search |
|---|
| SV: [aseek-users] Problem with the search |
Wed, 14 March 2001 03:48 |
Claus Jul Larsen Messages: 63
Registered: October 2002
|
Member |
|
|
The URL is:
http://search.enovasion.dk/cgi-bin/shell.pl?front=1&tmpl =um.html
I've procedued a shell.pl because the absolute path is showed in the URL.
It's a secure mistake.
Claus
-----Oprindelig meddelelse-----
Fra: Alexander F. Avdonkin [mailto:al@sw.com.sg]
Sendt: 14. marts 2001 09:35
Til: aseek-users@lists.asplinux.ru
Emne: Re: [aseek-users] Problem with the search
Could you provide URL, from where I can try to reproduce this ?
Alexander.
----- Original Message -----
From: "Claus Jul Larsen" <cjlarsen@enovasion.dk>
To: <aseek-users@lists.asplinux.ru>
Sent: Wednesday, March 14, 2001 4:21 PM
Subject: [aseek-users] Problem with the search
> Hey
>
> I've observed a new problem. When I search the searchbase which have the
> extract word 'Lykketoft' with the string 'lykketoft' no result is
outputted.
> When I seach with wildcard 'lykketoft*' the result seems to find the
> searchword. Why?
>
>
> Med venlig hilsen
>
> Claus Jul Larsen
> System Developer
>
> ____________________________________________________________ ___
>
> e|novasion
> Store Kongensgade 23A
> 1264 Kbenhavn K
>
> Teksttelefon: 77 31 20 10 (kald frst til 70 11 44 11 og bed
> telefondamen om 77 31 20 10)
> Fax: 77 31 19 50
> E-mail: cjlarsen@enovasion.dk
> Web: www.enovasion.dk
> ____________________________________________________________ ___
|
|
| | Topic: SV: [aseek-users] Problem with the search |
|---|
| SV: [aseek-users] Problem with the search |
Wed, 14 March 2001 09:15 |
Claus Jul Larsen Messages: 63
Registered: October 2002
|
Member |
|
|
Hi Alexander
The problem is fixed with the 1.04. :-)
Claus
-----Oprindelig meddelelse-----
Fra: Alexander F. Avdonkin [mailto:al@sw.com.sg]
Sendt: 14. marts 2001 09:35
Til: aseek-users@lists.asplinux.ru
Emne: Re: [aseek-users] Problem with the search
Could you provide URL, from where I can try to reproduce this ?
Alexander.
----- Original Message -----
From: "Claus Jul Larsen" <cjlarsen@enovasion.dk>
To: <aseek-users@lists.asplinux.ru>
Sent: Wednesday, March 14, 2001 4:21 PM
Subject: [aseek-users] Problem with the search
> Hey
>
> I've observed a new problem. When I search the searchbase which have the
> extract word 'Lykketoft' with the string 'lykketoft' no result is
outputted.
> When I seach with wildcard 'lykketoft*' the result seems to find the
> searchword. Why?
>
>
> Med venlig hilsen
>
> Claus Jul Larsen
> System Developer
>
> ____________________________________________________________ ___
>
> e|novasion
> Store Kongensgade 23A
> 1264 Kbenhavn K
>
> Teksttelefon: 77 31 20 10 (kald frst til 70 11 44 11 og bed
> telefondamen om 77 31 20 10)
> Fax: 77 31 19 50
> E-mail: cjlarsen@enovasion.dk
> Web: www.enovasion.dk
> ____________________________________________________________ ___
|
|
| | Topic: [aseek-users] aspseek-1.1.0 devel released |
|---|
| [aseek-users] aspseek-1.1.0 devel released |
Thu, 15 March 2001 10:15 |
|
We would like to announce that first version of ASPSeek from development
branch was just released today. Please note that this version is not stable,
so we don't recommend it for use on production servers.
Changes since 1.0 includes:
+ UNICODE support (now you can deal with many charsets at once, it should
also fix problems with some characters (like umlauts) in 1.0). Please note
that two-byte characters are still unsupported, and database format for
UNICODE version is incompatible with 1.0. Read doc/unicode.txt for details.
+ New pluggable database driver architecture. That basically means that ASPSeek
can be ported relatively easy to deal with RDBMSes, and that database driver
is now dynamically loadable at startup. We also added driver for Oracle 8i.
+ External converters support. If you wanted to index PDF files but didn't know
how, then this is for you. See doc/converters.txt for details.
Otherwise this version is like 1.0.4, I mean all changes from 1.0 were ported
here. We have also fixed some bugs found in 1.0.4.
For now, two versions will co-exist for as long as 1.1 will be "unstable", so
there will be more 1.0 releases with bugfixes etc. We will drop 1.0 branch
after releasing 1.2.0 stable (don't ask me about the date!)
Download and enjoy, and don't forget to report your success/problems with 1.1
-- kir@asplinux.ru http://kir.sever.net ICQ 7551596 --
Answers: $1, short: $5, correct: $25, dumb questions are still free.
|
|
|
Pages (11):
[1 ]
Current Time: Thu Sep 9 10:35:29 EDT 2010
Total time taken to generate the page: 0.16334 seconds |