Shell script to get PIDs from schedules

Sharon Kimble boudiccas at skimble.plus.com
Sun Nov 2 10:47:35 PST 2014


Charles Johnson <cehjohnson at gmail.com> writes:

> On 02/11/14 08:52, Chris Allison wrote:
>> Peter,
>>
>> some good ideas there, but there is no need to scrape the web pages
>> when all the schedule info you could possibly need is available in
>> xml, json and yaml files at urls of this form:
>>
>> www.bbc.co.uk/radio4/programmes/schedules/fm/this_week.json
>> www.bbc.co.uk/radio4extra/programmes/schedules/2014/11/1.json
>> www.bbc.co.uk/bbcfour/programmes/schedules/last_week.json
>>
>> etc.
> Thanks for that Chris. Have been excited enough by that first link into
> experimenting with the json parsing utility called 'jq'.
>
> A pipeline like the following will produce all the titles, pids and
> synopses:
>
> wget -O -
> http://www.bbc.co.uk/radio4/programmes/schedules/fm/this_week.json | jq
> '.[] | .[] | .[] | .[] | .programme as $P |
> $P.display_titles.title,$P.short_synopsis,$P.pid'
>
> So, just a 6-line tail with
>
> wget -q -O -
> http://www.bbc.co.uk/radio4/programmes/schedules/fm/this_week.json | jq
> '.[] | .[] | .[] | .[] | .programme as $P |
> $P.display_titles.title,$P.short_synopsis,$P.pid' | tail -n 6
>
> will get you the following:
>
> ============
> "The Film Programme"
> "Director Mike Leigh discusses art and movie-making in his latest film
> Mr Turner."
> "b04mgxtq"
> "Something Understood"
> "Mark Tully debates the cultural benefits of classical music with
> composer James MacMillan."
> "b04n2fmh"
> ============
>
Thanks for this Charles. With your last command

--8<---------------cut here---------------start------------->8---
wget -q -O - http://www.bbc.co.uk/radio4/programmes/schedules/fm/this_week.json | jq '.[] | .[] | .[] | .[] | .programme as $P | $P.display_titles.title,$P.short_synopsis,$P.pid' | tail - 6

--8<---------------cut here---------------end--------------->8---

It is failing for me  saying

╭────
│parse error: Invalid numeric literal at line 1, column 10
╰────

Presumably its referring to "-O", but what should it be please to
get it working properly?

Thanks
Sharon.
-- 
A taste of linux = http://www.sharons.org.uk
my git repo = https://bitbucket.org/boudiccas/dots
TGmeds = http://www.tgmeds.org.uk
Debian testing, fluxbox 1.3.5, emacs 24.4.1.0
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 818 bytes
Desc: not available
URL: <http://lists.infradead.org/pipermail/get_iplayer/attachments/20141102/e3b1b076/attachment.sig>


More information about the get_iplayer mailing list