Image 01
profile-image

maps4711

Matthias Grimrath
Various KDE 1.-4. Improvements
rewritten webarchiver plugin

Various KDE 1.-4. Improvements 17 comments

Score 50.0%
Feb 25 2008
It is already saved although a bit hidden. Open the archived page in Konqueror and press Ctrl-U (or select View->Show Source from the Menu). The original URL is saved inside a HTML comment at the top.

About Meta-information: Sounds interesting, but I have not yet looked at how it is handled by KDE. - Nov 22 2007
You are talking about other the way round: A certain file type (for example a Word Document)
has one or more file extensions (*.doc; *.DOC)

The problem here is that two different file types (KDE web archive, Java archives) share
the same extension (*.war).

But your post made me look at it again. In the mime type property dialog it is possible to
add another application to a filetype and give it higher priority over the default one.
In this case it means adding the Java application server to the list of applications that
handles files of type web archive.

A drawback is that KDE will always default-open .war files with the Java server regardless
of whether the file is a web or Java archive. - Mar 22 2007
If .war gets renamed then I guess all users with their existing .war file
collection would start revolting.

I don't know if there is way to tell KDE that the same file extension refers
to two different file types. - Jun 22 2006
* I rarely use firefox ;-)
* good idea, but no time, sorry. - Jun 01 2006
.war is plain .tar.gz format in disguise, you can extract it with e.g.

$ mkdir webpage && cd webpage
$ tar -xzf ../webpage-archive.war
$ <open index.html with your browser> - Jun 01 2006
There are two problems:

1) Images or other things loaded by Javascript, Java or plugins can change
unpredictably everytime the webpage is viewed. For example, an embedded java
script may load a different image on the first of each month. Therefore, the
webarchiver is not able to know beforehand what may lurk inside a java
script block.

2) The design of the new (and AFAIK old) webarchiver is to be able to make
a snapshot of the current webpage only (as far as it is possible). What are
you looking for is a tool that also downloads hyperlinked pages and images.

It is possible but a time-consuming task to add that to the webarchiver and,
frankly, I don't want to add such bloat, because there are already tools
out there that do this job. I suggest you use 'wget' or a similiar program
that can download webpages recursively. - May 31 2006
I have been using this patches privately for about a year because I really wanted them.
So chances are good I will support them in future KDE versions :-) - Dec 16 2005
I did so a few times, but were constantly hitting a KDE feature freeze.
Someone pointed me to kde-apps.org to get it out so people can test it.
In the time between I was too busy / lazy doing other things (working
for example :-)

But anyway, this is good idea.

http://bugs.kde.org/show_bug.cgi?id=98695
http://bugs.kde.org/show_bug.cgi?id=118475 - Dec 16 2005
improved offline web browsing

Various KDE 1.-4. Improvements 10 comments

Score 50.0%
Feb 25 2008
Sorry for the late reply, I haven't kept an eye on kde-apps.org.

Someone already pointed out that bug to me. I have just uploaded a
(hopefully) corrected version. - May 08 2007
Partially confirmed. Clicking on a link did indeed send the referrer
when it should not.

Pictures and other embedded objects were correctly filtered out, at least here.

> but at the first glance, metaData("referrer") works.

Just for clarification: The SendReferrerAcrossDomains option does not
turn off sending of referrer-URLs completely! Use SendReferrer for
that. It sends the referrer URL as long as the request goes to the domain
of the top-level webpage.

For example, a webpage on 'www.whatwhereever.com/webpage.html'
that has 'img src=www.whatwhereever.com/images/space.gif' requests 'space.gif'
together with a referrer URL. Another 'img src=www.adannoyer.com/banner.gif'
does not get a referrer URL.

The same procedure applies when clicking on links. Clicking on links
in google page results should not send a referrer URL.

If you observe different behaviour report a bug. - Jun 29 2006
didn't find that update button first. *ouch* I hope no-one got confused. - May 30 2006
Thanks. I made the changes about a year ago but haven't come around to publish them. - Dec 16 2005
I did so a few times, but were constantly hitting a KDE feature freeze.
Someone pointed me to kde-apps.org to get it out so people can test it.
In the time between I was too busy / lazy doing other things (working
for example :-)

But anyway, this is good idea.

http://bugs.kde.org/show_bug.cgi?id=98695
http://bugs.kde.org/show_bug.cgi?id=118475 - Dec 16 2005