James Gallagher Just some random stuff from me

Meaning from a set of meeting minutes

Borne out of trying to get a meaningful view of Galway City Council meeting minutes (see Council Meetings) is a pet project of mine to parse those minutes. The idea being to extract contextual information like the councillors and officials present, the agenda items, the chronology & outcome of motions and similar information. I haven’t fully considered the end presentation – since I’ve been inspired by KildareStreet.com I may look at the open source software used there. Of course, the document is written for people to read so this presents a challenge to automate. I’ve opted to use Perl and so far have opted to split the document (after converting to plain text) based on a list of pre-defined section headings i.e. agenda items:

Consideration of Minutes
Reports of Committee Meetings
Consideration of Reports of Officials
Consideration of Reports of Mayor
Business Prescribed by Statute
Notice of Motions
Conferences
Questions
Correspondence
Any Other Business

Anything prior to the first occurrence of one of these section headings will be treated as preamble i.e. a clump of text that I’m not sure what to do with yet. So far it seems to be ad-hoc and may contain information about things like presentations to the councillors.

I’m looking to split out proposals by the delimiters ‘proposed’ and ‘seconded’ (i.e. the sentences with these words form the beginning and the end). That’s the next task for the time being. Trying to relate these to a motion reference will then be a fun task. Here’s an example:

Cllr. T. Costello proposed that:
“A Special Meeting be held in two weeks time i.e. 26th January to discuss Lead Contamination in the Public Watersupply.”
This was seconded by Cllr. Brolcháin N.Ó.

I’ve opted to use Perl for this, while I’m not proficient in Perl it’s certainly made things a lot easier. Easy string handling and accessible use of regular expressions are key here I think.

Since I’m not too bright, I’d really appreciate any thoughts people have on this – it may be a fools errand but I see no point keeping it to myself and easily giving up.

  • Well if you manage to do it, and I imagine you will, fair play. It would be a great resource!
  • Councils (that I'm aware of) don't have transcripts, there's only this.

    That said, from your description above it looks like a line containing 'proposed that', followed by N lines, followed by a line containing 'was seconded by', would be regex-able. And a bit of name parsing might allow a person to generate per-councillor RSS feeds of activity.

    You're not a long way from this sort of thing at that point.

    (I've not really addressed the contents of the rest of the minutes, but you get the thought process.)

    Perhaps join the KildareStreet-new-registered-organisation list and see if we can get some other eyes on it? That goes for all the other commenters too :)
  • Thanks for that John. So far I've just written the basic section parsing and the name parsing. So, from testing I seem to be able to pick up all the names that get mentioned in the minutes. Getting the motions isn't too bad but it looks like two things will be quite messy: relating a proposed motion back to its motion identifier and then picking up the subsequent voting. I'll probably just go with some kind of near the motion text logic to do that. Thanks also for the tip on the Google Group ... oh, and the inspiration!
  • So would this work on minutes only or would you be able to do something with a transcript?
  • I'd hope to be able to make it more generic once I get some more experience with techniques for this. The reason for the minutes choice is that I want the information and the improved context so it's a good motivator.
blog comments powered by Disqus
Social Links
Archives
Flickr
DSC_3229DSC_3221DSC_3228DSC_3225DSC_3206DSC_3201DSC_3198DSC_3197DSC_3194DSC_3181DSC_3180DSC_2940DSC_2932DSC_2928DSC_2925DSC_2919