BBC Collections
Ralph Corderoy
ralph at inputplus.co.uk
Mon Oct 9 11:37:06 PDT 2017
Hi Vangelis,
> just examining page source of
> http://www.bbc.co.uk/iplayer/group/p056n6px
> I'm seeing href="*" URIs with "#group=p056n6px"
> appended to them... That's the ones containing the PIDs.
Here in Unix-land, renowned for its text processing...
$ g=p056n6px
$ curl -sS 'http://www.bbc.co.uk/iplayer/group/'$g |
> grep -o 'http://www\.bbc\.co\.uk/[^ ]*#group='$g
http://www.bbc.co.uk/iplayer/episode/p055t73r/the-colony#group=p056n6px
http://www.bbc.co.uk/iplayer/episode/p055vzj1/tuesday-documentary-the-block#group=p056n6px
http://www.bbc.co.uk/iplayer/episode/p055sys5/man-alive-gale-is-dead#group=p056n6px
http://www.bbc.co.uk/iplayer/episode/p053r2q1/waiting-for-work#group=p056n6px
http://www.bbc.co.uk/iplayer/episode/p00gxvjj/borrowed-pasture#group=p056n6px
http://www.bbc.co.uk/iplayer/episode/b0074tkn/40-minutes-heart-of-the-angel#group=p056n6px
$
--
Cheers, Ralph.
https://plus.google.com/+RalphCorderoy
More information about the get_iplayer
mailing list