f



htmldata 1.1.1

Bug-fix release of htmldata, fixes error when parsing whitespace inside tags.

Description of the htmldata module, from its PyPI site:

  http://cheeseshop.python.org/pypi/htmldata/

"Extract and modify HTML/CSS URLs, translate HTML documents <-> list data structures.

The htmldata module allows one to translate HTML documents back and forth to list data structures.
This allows for programmatic reading and writing of HTML documents, with much flexibility.

Functions are also available for extracting and/or modifying all URLs present in the HTML or
stylesheets of a document.

I have found this library useful for writing robots, for "wrapping" all of the URLs on websites
inside my own proxy CGI script, for filtering HTML, and for doing flexible wget-like mirroring.

It keeps things as simple as possible, so it should be easy to learn.

Supports XHTML, too."

Connelly Barnes
http://www.connellybarnes.com/


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 
0
Connelly
10/26/2007 9:14:48 PM
comp.lang.python.announce 7374 articles. 0 followers. Post Follow

0 Replies
916 Views

Similar Articles

[PageSpeed] 52

Reply:

Similar Artilces:

1 - 1, 1 -1, 1-1, 1
Hi, I have a question regarding lexical analysis. I recently came across a bug in our lexical analyser in phc (www.phpcompiler.org), that I am unsure how to solve. This is the problem: our current definition for integer constant looks something like INT ([1-9][0-9]*)|0 In particular, note that it does not allow for an (optional) "+" or "-" at the start of the integer. This means that the strings "1 - 1", "1 -1" and "1-1" all generate the same sequence of three tokens INT(1), OP(-), INT(1), for which the syntax analyser generates the subtree BIN_OP(-, 1, 1). For the string "1 - -1", the lexer (unsurprisingly) generates INT(1), OP(-), OP(-), INT(1). The syntax analyser recognises this as BIN_OP(1, UNARY_OP(-, 1)). In other words, the second "-" is treated as a unary operator, rather than as part of the number. This works fine, with the sole exception of the number "-2147483648". The problem is, of course, overflow: -2147483648 is a valid negative number (assuming 32-bit numbers), but the integer 2147483648 is _not_ a valid positive number. Thus, the above method of dealing with "-" as a unary operator breaks down. The solution is to interpret the "-" as part of the number, and generate INT(-2147483648), rather than OP(-), INT(...). However, changing the definition of INT to INT [+-]?([1-9][0-9]*)|0 causes "1-1" to be recognised as INT(1), INT(-1), which is...

1.1.1.1 ?
hi my firewall logs dropped packets from an internal IP address trying to contact 1.1.1.1 through port 9999. Any ideas whether 1.1.1.1 is valid IP? and what is port 9999?? thanks mike wrote: > hi > > my firewall logs dropped packets from an internal IP address trying to > contact 1.1.1.1 through port 9999. Any ideas whether 1.1.1.1 is valid > IP? and what is port 9999?? > thanks These trojans *BlitzNet*, *Backdoor.Oracle*, *Backdoor.Spadeace* uses port 9999 -- S.S. "StarScripter" <Star@privacy.net> wrote in message news:<bv8ejj$p54t3$1@ID-185702.new...

plot 1, -1, 1, -1, -1
Hi, How to plot those numbers in a rectangle style? thanks x = linspace(-4*pi,4*pi,1000); y = sin(x); plot(x,sign(y)), axis([-12 12 -2 2]) PZ <patrick.zou@gmail.com> wrote in message <d64c545f-f2ac-4f87-af89-051bda896707@x6g2000vbg.googlegroups.com>... > Hi, > > How to plot those numbers in a rectangle style? > > thanks help stairs stairs([-1 1 -1 1 -1 1]) hth Jos ...

Ip-Address 1.1.1.1
A traceroute to 213.176.224.4 showed following results: # traceroute -In -m10 213.176.224.4 traceroute to 213.176.224.4 (213.176.224.4), 10 hops max, 38 byte packets 1 212.152.136.1 103.658 ms 99.452 ms 109.767 ms 2 212.152.151.2 99.646 ms 99.680 ms 99.821 ms 3 62.218.1.93 99.689 ms 109.665 ms 99.751 ms 4 212.152.192.182 99.762 ms 1359.690 ms 99.718 ms 5 193.203.0.72 104.398 ms 105.014 ms 109.781 ms 6 146.188.2.229 109.696 ms 109.733 ms 99.771 ms 7 146.188.14.113 119.710 ms 199.720 ms 209.720 ms 8 146.188.49.194 119.695 ms 129.663 ms 119.808 ms 9 1.1.1.1 109.731 ms 129.672 ms 119.772 ms 10 * * * 1.1.1.1? I thought that 1.0.0.0/8 is reserved by iana according to: http://www.iana.org/assignments/ipv4-address-space or do I understand something wrong here? thx Leopold In article <pan.2005.02.07.19.19.33.29091@utanet.at>, Leopold Schweighofer <leos@utanet.at> wrote: :A traceroute to 213.176.224.4 showed following results: : 8 146.188.49.194 119.695 ms 129.663 ms 119.808 ms : 9 1.1.1.1 109.731 ms 129.672 ms 119.772 ms :I thought that 1.0.0.0/8 is reserved by iana according to: :http://www.iana.org/assignments/ipv4-address-space :or do I understand something wrong here? No you are correct. It appears what has happened is that UUNet/PIPEX have a NATing device attached to them which has been set with an outside IP of 1.1.1.1. That works fine for receiving traffic, and...

Alignment of foo[1][1][1][1]
Suppose I have type 'foo' and: sizeof (foo) == 16 alignof (foo) == 2 Suppose I have type 'foo[1][1][1][1]' and: sizeof (foo[1][1][1][1]) == 16 Can: alignof (foo[1][1][1][1]) == 4 ? I'd like to think not, but is it prohibited? If I do: typedef foo bar[1][1][1][1]; bar * my_bar = malloc(sizeof *bar); foo * my_foo = (foo *) my_bar; certainly 'my_bar' points to an object whose alignment satisfies type 'foo'. But what about the other way around? typedef foo bar[1][1][1][1]; foo * my_foo = malloc(sizeof *foo); bar * my_ba...

ERROR IN EXPRESSION; -1 - (-1) not 1
The expression isn't a = 1 - (1-), but: a = -1 - (1) der <der@noemail.com> scribbled the following: > The expression isn't a = 1 - (1-), but: a = -1 - (1) Ah, that's different, then. Then the answer is: The result will be -2. You have *no* guarantees about which of -1 or (1) is evaluated first, but as they don't have side effects, you don't have to care. -- /-- Joona Palaste (palaste@cc.helsinki.fi) ---------------------------\ | Kingpriest of "The Flying Lemon Tree" G++ FR FW+ M- #108 D+ ADA N+++| | http://www.helsinki.fi/~palaste W++ B OP+ | \----------------------------------------- Finland rules! ------------/ "To doo bee doo bee doo." - Frank Sinatra ...

slimKEYS 1.1.1
slimKEYS is a universal hotkey manager with a plug-in system that lets you perform an infinite number of actions simply by pressing system- wide hotkeys. It already comes with multiple built-in plug-ins: slimLAUNCH lets you open applications, documents or folders, or search for password entries stored securely, or open your del.icio.us posts simply by typing a few letters of their name or by directly assigning them specific hotkeys. slimSIZE lets you resize or move windows to any monitor and location just by pressing a hotkey. slimSEARCH helps you search on popular search engine...

CDSB 1.1.1
CDSB 1.1.1 is available. More tools can be installed on FreeBSD. For example, Calibre, HSPICE, SpiceExplorer, etc. Get more information from CDSB Wiki: http://code.google.com/p/cdsb/w/list Group: cdsbgroup@googlegroups.com ...

RedNotebook 1.1.1
RedNotebook 1.1.1 has been released. You can get the tarball, the Windows installer and links to distribution packages at http://rednotebook.sourceforge.net/downloads.html What is RedNotebook? -------------------- RedNotebook is a **graphical journal** and diary helping you keep track of notes and thoughts. It includes a calendar navigation, customizable templates, export functionality and word clouds. You can also format, tag and search your entries. RedNotebook is available in the repositories of most common Linux distributions and a Windows installer is available. What's new? ----------- * Let user delete category with 'DELETE' key (LP:608717) * Sort categories alphabetically (LP:612859) * Fix: After clicking "Change the text" on an annotation, directly edit it (LP:612861) * Fix: Journal -> _Journal in menu * Fix: Do not clear entry when category is changed in new-entry dialog * Fix: restore left divider position * Fix: Use rednotebook website for retrieving newest version information (LP:621975) * Windows: Shrink installer size * Windows: Update gtk libs * Windows: New theme * Windows: New icons * New translations: * English (United Kingdom) * Norwegian Bokmal * Many translations updated Cheers, Jendrik ...

DeepForm 1.1.1
DeepForm allows an analyst to create a document (named Model) that specifies the structure and the contents of the XML document containing the corporate data. DeepForm does not requires any knowledge of the XML format by the user. The data model creation The application allows the user to compone his own model using a quick and easy interface. The analyst can create his own model by simply combining the "basic" elements of DeepForm: * fields (customizables); * tables; * sections. DeepForm allows to create new models by using parts of other models pre...

3DLDF Release 1.1.5 and 1.1.5.1
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Release of 3DLDF 1.1.5. 3DLDF is a GNU package for three-dimensional drawing with MetaPost output. It is available from http://ftp.gnu.org/gnu/3dldf and other ftp servers. Please see the author's website, http://wwwuser.gwdg.de/~lfinsto1 and http://www.gnu.org/directory/graphics/3D/3DLDF.html for more information. In release 1.1.5, I've tied up some loose ends. I wanted to do this before starting on the input routine. * Added `const real step' argument to the version of Ellipse::intersection_points()...

[Announce] Gnosis Utils 1.1.1
This release adds gnosis.xml.relax to the package. Miscellaneous speedups and bugfixes to gnosis.xml.objectify. - addChild() convenience function Improvements to floating point handling in gnosis.xml.pickle. - minor improvements to tests Little used gnosis.trigramlib updated per needs of my personal spam filter. Minor fixes in gnosis.util.introspect. It may be obtained at: http://gnosis.cx/download/Gnosis_Utils-1.1.1.tar.gz The current release is always available as: http://gnosis.cx/download/Gnosis_Utils-current.tar.gz You may browse a snapshot at: http://gnosis.cx/down...

-1, +1
Hi, How can I generate a random number as -1 or +1 in a loop TIA Try, A=rand(5,5); A(A>0.5)=1; A(A<=0.5)=-1; This makes a matrix of random +1/-1 you can probably get what you need from this... Volvo wrote: > > > Hi, > > How can I generate a random number as > > -1 or +1 in a loop > > TIA > > > On Sun, 02 Nov 2003 14:03:30 -0500, Dan Ennis wrote: > Try, > > > A=rand(5,5); > A(A>0.5)=1; > A(A<=0.5)=-1; A shorter version would be A=sign(2*rand(5)-1); A(A==0)=1; % or however you want to assign it. Of cou...

1--1
I don't understand why this works as expected: select * from games order by 2 but this doesn't: select * from games order by 1--1 The "order by" is completely ignored. On Wednesday January 21 2015 11:44, in comp.databases.mysql, "Kiuhnm Mnhuik" <gandalf23@mail.com> wrote: > I don't understand why this works as expected: > select * > from games > order by 2 > but this doesn't: > select * > from games > order by 1--1 > The "order by" is completely ignored. No, it isn...

both 1 and not-1?
Okay, maybe I'm getting too tired for tonight, but ... how can add_to_array *possibly* die with a filter violation (which it does)? Certainly, is_filtered doesn't modify $visit -- or does it? sub add_to_array{ my $self = shift; my $visit = shift; if ( not $self->is_filtered($visit) ){ if ( $self->is_filtered($visit) ){die ("FILTER VIOLATION\n");} } } sub is_filtered{ my $self = shift; my $visit = shift; while ( my ($field, $pattern) = each %{ $self->{_excludepatterns} } ){ if ( $visit->{$field} =~ $pattern){ return 1;...

i++, ++i, i+=1 and i = i+1;
Hello, let say we have; 1) i++; /* use i and increment by one */ 2) ++i; /* increment i by one and use it */ 3) i += 1; 4) i = i+1; result (for value of i) of all 4 will be same; could anyone tell differences among them from any perspectives? I heard we'd better use 2) over 1). And 1) is faster than 3) or 4). Thanks in advance, jim said: > Hello, > let say we have; > > 1) i++; /* use i and increment by one */ > 2) ++i; /* increment i by one and use it */ > 3) i += 1; > 4) i = i+1; > > result (for value of i) of all 4 will be same; could anyone...

IMDbPY 4.1 and IMDbPYKit 1.1.1
IMDbPY 4.1 and IMDbPYKit 1.1.1 are available (tgz, deb, rpm, exe) from: http://imdbpy.sourceforge.net/ IMDbPY is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. IMDbPYKit (mostly developed by H. Turgut Uyar) is a web interface to IMDbPY, able to serve its output both in HTML and XML. With this release, a DTD for the XML output was formalized and support for i18n was introduced. A lot of bugs were fixed. Platform-independent and written in pure Python (and few C lines), IMDbPY can retrieve data from both the IMDb's web server and a local copy of the whole database. IMDbPY package can be very easily used by programmers and developers to provide access to the IMDb's data to their programs. Some simple example scripts are included in the package; other IMDbPY-based programs are available from the home page. -- Davide Alberani <alberanid@libero.it> [GPG KeyID: 0x465BFD47] http://erlug.linux.it/~da/ ...

Evaluation of 1/(1/a + 1/b + 1/r)
Folks, I am trying to write a program to evaluate the sum of two continued fractions, written in polynomials of T. I would like a short routine to evaluate 1/(1/a + 1/b + 1/r), just working with a, b, and r if they are not equal to zero. So, if a = 0, then evaluate 1/(1/b + 1/r) If a, b, = 0, then evaluate 1/r If all of a, b, r = 0, give me 0 as output. If (1/a + 1/b + 1/r) = 0, give me 0 as output. Assume that a, b, r are arbitrary polynomials in T. Can someone help? Thanks, Diana Clear[f]; f[a_,b_,r_]:=0/;a==b==r==0; f[a_,b_,r_]:=1/Total[ 1/Sele...

OO 1.1 and java 1.3.1
Am attempting to install OO 1.1, but with just Java 1.3.1, and the install documentation says that without at least 1.4.1, I cannot make use of full capabilties. I infer from this that I should be able to OO crippled. However, However, OO does not like it when I point to my 131 JRE. Comments? -- Sir: David I. Smith wrote: > Am attempting to install OO 1.1, but with just Java 1.3.1, and the > install documentation says that without at least 1.4.1, I cannot make > use of full capabilties. I infer from this that I should be able to > OO crippled. However, However, OO...

ActivePython 3.1.1.2 vs Python 3.1.1 for OSX?
I am just curious which I should use. I am going to start learning Python soon. Are they comparable and I just do a "eenie meenie minie moe"? Bob On 1 Oct, 00:51, Robert Hicks <sigz...@gmail.com> wrote: > I am just curious which I should use. I am going to start learning > Python soon. Are they comparable and I just do a "eenie meenie minie > moe"? > > Bob First off, a great choice of language to begin trying! Is it your first language (I'm guessing not), or do you come from another 'background'. Basically, Active is a possible 'supe...

ANN: Webware for Python 1.1.1 released
Version 1.1.1 of Webware for Python can now be downloaded at http://downloads.sourceforge.net/webware/Webware-1.1.1.tar.gz. This is a bugfix release with some fixes and small improvements. Webware for Python is a mature and stable web framework that has been around since more than a decade. Version 1.1.1 runs on Python 2.4 to 2.7. You can find more info on the Webware for Python homepage at http://www.webwareforpython.org/ Support is available via the Webware for Python mailing lists at https://sourceforge.net/mail/?group_id=4866 -- Christoph Zwerschke ...

Announcing ooRexx 4.1.1 Beta 1
All - The Open Object Rexx Project is proud to announce the first ooRexx 4.1.1 beta. The Beta 1 code is available at http://sourceforge.net/projects/oorexx/files/ Be sure to expand the "oorexx" tab and then the "4.1.1 beta" tab to locate the files for download. Installer packages are available for a few operating systems now. Additional files will be added shortly, as they become available. This version of ooRexx is a bug fix only release. There are no new features. Please read the CHANGES document to review the complete list of bugs fixed. Please let us know about...

Meaning of 1:1, 1:1 generalization, 1:n, 1:n non identifying, n:m
Hi All, I've been taking a look at DB Designer 4, and looking through the documentation (http://www.fabforce.net/dbdesigner4/doc/index.html) I am a little unclear on some of their nomenclature: '1:1' - Ok, one to one. Got it. '1:1 generalization' - Don't know this. Obviously different somehow from one to one, but how? '1:n' - One to many, I assume. '1:n non identifying' - Nonidentifying? What does this mean? 'n:m' - Many to many? Again, not sure. Can anyone help clarify? Thanks! -Josh Joshua Beall wrote: > I...

JDK 1.3.1/1.4/1.5
How can you display JDialog without the Close on the Top Right in any of these versions(JDK 1.3.1/1.4/1.5). The only thing it displays is the close button X and I don't want that? Thank you! vnssoftware wrote: > How can you display JDialog without the Close on the Top Right in any > of these versions(JDK 1.3.1/1.4/1.5). The only thing it displays is > the close button X and I don't want that? > > Thank you! Dialog.setUndecorated() since 1.4 -- Knute Johnson email s/nospam/knute/ Molon labe... ...

Web resources about - htmldata 1.1.1 - comp.lang.python.announce

Hottest 'ios4' Answers - Stack Overflow
... // to load html file from the bundle NSString *htmlFile = [[NSBundle mainBundle] pathForResource:@"helpfile" ofType:@"html"]; NSData *htmlData ...

Argentina - Economic analysis of government's policies, investment climate and political risk.
Native student Vicente Fernandez's analysis of jobs, growth, social justice and business opportunity

UAVSAR -
NASA's UAVSAR is observing the Natural Laboratories along the PacificRim from Japan to Chile, primarily for volcano studies. The U.S.Supersites ...

Resources last updated: 3/3/2016 6:53:57 PM