feeds2disk

feeds2disk [options] ARG

feeds2disk parses an online source of articles, like an RSS or ATOM feed and fetches the article contents organized in a nice hierarchy.

ARG can be one of:

file name - feeds2disk will try to load a recipe from the file

builtin recipe title - feeds2disk will load the builtin recipe and use it to fetch the feed. For e.g. Newsweek or “The BBC” or “The New York Times”

recipe as a string - feeds2disk will load the recipe directly from the string arg.

Available builtin recipes are: ‘Ars Technica’, ‘Associated Press’, ‘The Atlantic’, u’B92’, u’The BBC’, “Barron’s”, ‘Business Week’, ‘CNN’, ‘Christian Science Monitor’, u’Daily Telegraph’, ‘Die Zeit Nachrichten’, u’Discover Magazine’, u’EL PAIS’, ‘ESPN’, ‘The Economist’, ‘FAZ NET’, ‘Globe and Mail’, ‘Google Reader’, u’The Guardian’, u’The International Herald Tribune’, u’The Irish Times’, ‘Jerusalem Post’, ‘Jutarnji’, u’La Nacion’, u’The Los Angeles Times’, u’The Moscow Times’, ‘NASA’, u’New Scientist’, u’New York Review of Books’, ‘The New York Times’, ‘The New Yorker’, ‘Newsweek’, ‘Outlook India’, u’Politika Online’, ‘Portfolio’, ‘Reuters’, u’Scientific American’, ‘Spiegel Online’, ‘Sydney Morning Herald’, ‘USA Today’, ‘United Press International’, ‘The Wall Street Journal’, ‘Washington Post’, ‘Wired.com’

Whenever you pass arguments to feeds2disk that have spaces in them, enclose the arguments in quotation marks.

[options]

--version

show program’s version number and exit

--help, -h

show this help message and exit

--feeds

Specify a list of feeds to download. For example: “[‘http://feeds.newsweek.com/newsweek/TopNews‘, ‘http://feeds.newsweek.com/headlines/politics‘]” If you specify this option, any argument to %prog is ignored and a default recipe is used to download the feeds.

--verbose

Be more verbose while processing.

--title

The title for this recipe. Used as the title for any ebooks created from the downloaded feeds.

--username

Username for sites that require a login to access content.

--password

Password for sites that require a login to access content.

--lrf

Optimize fetching for subsequent conversion to LRF.

--recursions

Number of levels of links to follow on webpages that are linked to from feeds. Defaul 0

--output-dir

The directory in which to store the downloaded feeds. Defaults to the current directory.

--no-progress-bar

Dont show the progress bar

--debug

Very verbose output, useful for debugging.

--test

Useful for recipe development. Forces max_articles_per_feed to 2 and downloads at most 2 feeds.

WEB2DISK OPTIONS

Options to control web2disk (used to fetch websites linked from feeds)

--timeout, -t

Timeout in seconds to wait for a response from the server. Default: 10.0 s

--delay

Minimum interval in seconds between consecutive fetches. Default is 0 s

--encoding

The character encoding for the websites you are trying to download. The default is to try and guess the encoding.

--match-regexp

Only links that match this regular expression will be followed. This option can be specified multiple times, in which case as long as a link matches any one regexp, it will be followed. By default all links are followed.

--filter-regexp

Any link that matches this regular expression will be ignored. This option can be specified multiple times, in which case as long as any regexp matches a link, it will be ignored.By default, no links are ignored. If both –filter-regexp and –match-regexp are specified, then –filter-regexp is applied first.

--dont-download-stylesheets

Do not download CSS stylesheets.

Table Of Contents

Previous topic

fb22lrf

Next topic

feeds2epub

Quick search