Category Archives: Research

Come and do a funded PhD with me

Fancy doing a PhD with me at Cardiff Metropolitan University? I have a fully-funded studentship (for UK/EU students) starting in January, in collaboration with HP in Bristol:

The Department of Computing & Information Systems, Cardiff Metropolitan University, is pleased to offer a fully funded PhD Studentship in Provably Optimal Code Generation.

This research project (Scaling Superoptimisation for Enterprise Applications) is part of an on-going strategic collaboration between Cardiff Metropolitan University and Hewlett-Packard in Bristol; HP is a leading technology company that operates in more than 170 countries around the world, providing infrastructure and business offerings that span from handheld devices to some of the world’s most powerful supercomputers.

Applicants must have an excellent first degree in Computer Science, Computer Engineering, Mathematics or a related discipline, with interests/experience at the hardware/software interface and/or in mathematical foundations.

This three year PhD will commence in January 2015. The PhD bursary consists of the standard tuition fee for a Home/EU student (to be £3,760 in 2014/15) and a stipend linked to the minimum amount set annually by Research Councils UK (currently £13,590 p.a.).

Project Context:

Our world is increasingly dependent on the effectiveness and performance of software. Tools and methodologies for creating useful software artefacts have been around for many years, but the scalability of these systems for solving challenging real world problems are — in many important cases — poor. While there are numerous socio-technical issues associated with developing large software systems, there is a significant opportunity to address the optimisation of software in a strategic, adaptable and platform-independent way.

Superoptimisation is an approach to optimising code by aiming for optimality from the outset, rather than as the aggregation of heuristics that are neither intended nor guaranteed to give provable optimality. Building on previous work by Crick et al., this research project will further develop the theoretical foundations of superoptimisation, as well as developing a scalable toolchain for superoptimising enterprise-level software applications.

 
For informal enquiries, please send me an email: tcrick@cardiffmet.ac.uk (but please apply via FindAPhD or here).

Deadline for applications: Friday 31 October.

Tagged , , , ,

Paper submitted to Recomputability 2014: “Share and Enjoy”: Publishing Useful and Usable Scientific Models

Last month, me, Ben Hall, Samin Ishtiaq and Kenji Takeda (all Microsoft Research) submitted a paper to Recomputability 2014, to be held in conjunction with the 7th IEEE/ACM International Conference on Utility and Cloud Computing (UCC 2014) in London in December. This workshop is an interdisciplinary forum for academic and industrial researchers, practitioners and developers to discuss challenges, ideas, policy and practical experience in reproducibility, recomputation, reusability and reliability across utility and cloud computing. It aims to provide an opportunity to share and showcase best practice, as well as to offering a platform to further develop policy, initiatives and practical techniques for researchers in this domain.

In our paper, we discuss a number of issues in this space, proposing a new open platform for the sharing and reuse of scientific models and benchmarks. You can download our arXiv pre-print; the abstract is as follows:

The reproduction and replication of reported scientific results is a hot topic within the academic community. The retraction of numerous studies from a wide range of disciplines, from climate science to bioscience, has drawn the focus of many commentators, but there exists a wider socio-cultural problem that pervades the scientific community. Sharing data and models often requires extra effort, and this is currently seen as a significant overhead that may not be worth the time investment.

Automated systems, which allow easy reproduction of results, offer the potential to incentivise a culture change and drive the adoption of new techniques to improve the efficiency of scientific exploration. In this paper, we discuss the value of improved access and sharing of the two key types of results arising from work done in the computational sciences: models and algorithms. We propose the development of an integrated cloud-based system underpinning computational science, linking together software and data repositories, toolchains, workflows and outputs, providing a seamless automated infrastructure for the verification and validation of scientific models and in particular, performance benchmarks.

 
(see GitHub repo)

Tagged , , , , , ,

Come and do a (fully-funded) PhD with me

Fancy doing a PhD with me at Cardiff Metropolitan University? I have a fully-funded studentship (for UK/EU students) starting in September, in collaboration with HP in Bristol:

Scaling Superoptimisation for Enterprise Applications

Our world is increasingly dependent on the effectiveness and performance of software. Tools and methodologies for creating useful software artefacts have been around for many years, but the scalability of these systems for solving challenging real world problems are — in many important cases — poor. While there are numerous socio-technical issues associated with developing large software systems, there is a significant opportunity to address the optimisation of software in a strategic, adaptable and platform-independent way.

Superoptimisation is an approach to optimising code by aiming for optimality from the outset, rather than as the aggregation of heuristics that are neither intended nor guaranteed to give provable optimality. Building on previous work by Crick et al., this research project will further develop the theoretical foundations of superoptimisation, as well as developing a scalable toolchain for superoptimising enterprise-level industrial software applications. This research project is a collaboration between Cardiff Metropolitan University and Hewlett-Packard (HP) in Bristol; HP is a leading technology company that operates in more than 170 countries around the world, providing infrastructure and business offerings that span from handheld devices to some of the world’s most powerful supercomputers.

Applicants must have an excellent first degree in Computer Science, Computer Engineering, Electronics or a related discipline, with interests/experience in compilers, optimisation, logic programming, satisfiability modulo theories and mathematical foundations.

 
For informal enquiries, send me an email: tcrick@cardiffmet.ac.uk (but please apply via FindAPhD or here).

Deadline for applications: Friday 22 August.

Tagged , , , ,

Paper submitted to WSSSPE2: “Can I Implement Your Algorithm?”: A Model for Reproducible Research Software

Yesterday, me, Ben Hall and Samin Ishtiaq (both Microsoft Research Cambridge) submitted a paper to WSSSPE2, the 2nd Workshop on Sustainable Software for Science: Practice and Experiences to be held in conjunction with SC14 in New Orleans in November. As per the aims of the workshop: progress in scientific research is dependent on the quality and accessibility of software at all levels and it is critical to address challenges related to the development, deployment and maintenance of reusable software as well as education around software practices.

As discussed in our paper, we feel this multitude of research software engineering problems are not just manifest in computer science, but also across the computational science and engineering domains (particularly with regards to benchmarking and availability of code). We highlight a number of recommendations to address these issues, as well as proposing a new open platform for scientific software development. You can download our arXiv pre-print; the abstract is as follows:

The reproduction and replication of novel scientific results has become a major issue for a number of disciplines. In computer science and related disciplines such as systems biology, the issues closely revolve around the ability to implement novel algorithms and approaches. Taking an approach from the literature and applying it in a new codebase frequently requires local knowledge missing from the published manuscripts and project websites. Alongside this issue, benchmarking, and the development of fair, and widely available benchmark sets present another barrier. In this paper, we outline several suggestions to address these issues, driven by specific examples from a range of scientific domains. Finally, based on these suggestions, we propose a new open platform for scientific software development which effectively isolates specific dependencies from the individual researcher and their workstation and allows faster, more powerful sharing of the results of scientific software engineering.

 
(see GitHub repo)

Tagged , , , , , ,

Paper in ACM TOCE: “Restart: The Resurgence of Computer Science in UK Schools”

Further to the previous CAS papers, Neil Brown (University of Kent), Sue Sentance (formerly Anglia Ruskin University, now CAS), Simon Humphreys (CAS/BCS) and I have had a paper accepted into ACM Transactions on Computing Education: Restart: The Resurgence of Computer Science in UK Schools, part of a Special Issue on Computing Education in (K-12) Schools.

The paper will soon be available to download for free via the ACM Author-ize service (or you can download our pre-print); the abstract is as follows:

Computer science in UK schools is undergoing a remarkable transformation. While the changes are not consistent across each of the four devolved nations of the UK (England, Scotland, Wales and Northern Ireland), there are developments in each that are moving the subject to become mandatory for all pupils from age 5 onwards. In this article, we detail how computer science declined in the UK, and the developments that led to its revitalisation: a mixture of industry and interest group lobbying, with a particular focus on the value of the subject to all school pupils, not just those who would study it at degree level. This rapid growth in the subject is not without issues, however: there remain significant forthcoming challenges with its delivery, especially surrounding the issue of training sufficient numbers of teachers. We describe a national network of teaching excellence which is being set up to combat this problem, and look at the other challenges that lie ahead.

 
(see Publications)

Tagged , , , ,

Paper at HCII 2014: “Changing Faces: Identifying Complex Behavioural Profiles”

In June, my colleague Giles Oatley presented a joint paper entitled: Changing Faces: Identifying Complex Behavioural Profiles at HCII 2014, the 16th International Conference on Human-Computer Interaction in Crete.

If you do not have institutional access to SpringerLink, especially the Lecture Notes in Computer Science series, you can download our pre-print. The abstract is as follows:

There has been significant interest in the identification and profiling of insider threats, attracting high-profile policy focus and strategic research funding from governments and funding bodies. Recent examples attracting worldwide attention include the cases of Chelsea Manning, Edward Snowden and the US authorities. The challenges with profiling an individual across a range of activities is that their data footprint will legitimately vary significantly based on time and/or location. The insider threat problem is thus a specific instance of the more general problem of profiling complex behaviours. In this paper, we discuss our preliminary research models relating to profiling complex behaviours and present a set of experiments related to changing roles as viewed through large scale social network datasets, such as Twitter. We employ psycholinguistic metrics in this work, considering changing roles from the standpoint of a trait-based personality theory. We also present further representations, including an alternative psychological theory (not trait-based), and established techniques for crime modelling, spatio-temporal and graph/network, to investigate within a wider reasoning framework.

 
(see Publications)

Tagged , , , , ,

Call for Papers: Recomputability 2014

I am co-chairing Recomputability 2014, the first workshop to focus explicitly on recomputability and reproducibility in the context of utility and cloud computing and is open to all members of the cloud, big data, grid, cluster computing and open science communities. Recomputability 2014 is an affiliated workshop of the 7th IEEE/ACM International Conference on Utility and Cloud Computing (UCC 2014), to be held in London in December 2014.

Recomputability 2014 will provide an interdisciplinary forum for academic and industrial researchers, practitioners and developers to discuss challenges, ideas, policy and practical experience in reproducibility, recomputation, reusability and reliability across utility and cloud computing. It will provide an opportunity to share and showcase best practice, as well as to provide a platform to further develop policy, initiatives and practical techniques for researchers in this domain. Participation by early career researchers is strongly encouraged.

Proposed topics of interest include (but are not limited to):

  • infrastructure, tools and environments for recomputabilty and reproducibility in the cloud;
  • recomputability for virtual machines;
  • virtual machines as self-contained research objects or demonstrators;
  • describing and cataloging cloud setups;
  • the role of community/open access experimental frameworks and repositories for virtual machines and data, their operation and sustainability;
  • validation and verification of experimental results by the community;
  • sharing and publication issues;
  • recommending policy changes for recomputability and reproducibility;
  • improving education and training: best practice, novel uses, case studies;
  • encouraging industry’s role in recomputability and reproducibility.

Please see the full call for papers; deadline for submissions (online via EasyChair) is 10 August 2014 17 August 2014.

Tagged , , , , , ,

The personal cost of applying for research grants

For many academics, this article is a no-brainer. Research grant proposals take huge amounts of time to put together, with low success rates (e.g. EPSRC). It’s a huge cost:

The pressure to win high-status funding means that researchers go to extraordinary lengths to prepare their proposals, often sacrificing family time and personal relationships. During our research into the stressful process of applying for research grants, one researcher, typical of many, said, “My family hates my profession. Not just my partner and children, but my parents and siblings. The insecurity despite the crushing hours is a soul-destroying combination that is not sustainable.”

 

Tagged ,

Critical questions for computer science education research

Over the past two years, we have seen wholesale reform of computing (and more specifically, computer science) education in the UK. In England from September 2014, a new national curriculum subject Computing, with a challenging and aspirational programme of study (“A high-quality computing education equips pupils to use computational thinking and creativity to understand and change the world.”) will replace ICT; in Scotland, we see Computing Science forming part of their Curriculum for Excellence; in Wales, September’s review of the ICT curriculum is shaping the ongoing Curriculum for Wales review; along with burgeoning activity in Northern Ireland.

While there is a large corpus of computing education research, along with national and international policy reports, such as the ACM/CSTA’s Running on Empty (2010), the Royal Society’s Shut down or restart? report (2012) and ACM Europe’s informatics education report (2013), there still remain a number of critical questions in computer science education. The recent announcement of the UK Forum for Computing Education provides an opportunity to support this important research agenda. Further to a group discussion led by members of the CSTA at a recent ACM Education Council meeting, the following list of questions cover a breadth of issues and reflect the deep need for further research-grounded solutions to the issues we face.

  • What are the indicators of incoming student success in introductory level computer science in colleges and universities?
  • Does computer science learning in schools contribute to success/improvement in other disciplines, especially mathematics and science?
  • What is the link between age/educational development and the potential to learn and master computer science concepts?
  • Are there issues of ergonomics in the introduction of computing devices with young children?
  • Is there a link between previous mathematics learning and success in computer science at school level?
  • What are the major factors that lead to students making early choices not to pursue computer science?
  • What is the role of informal education programs in scaffolding learning in computer science, especially in communities where access to computer science learning in school is limited?
  • What are the potential benefits and drawbacks of MOOCs in school student learning?
  • What are the potential benefits and drawbacks of MOOCs for the professional development of computer science teachers?
  • What models professional development are most effective for improving teacher mastery of computer science concepts and pedagogy?
  • What are the impacts of current efforts to market computer science to students?
  • To what extent do poverty and lack of home access to computer science tools impact computer science performance and or interest in school?
  • Do one-to-one devices per child programs have any impact on computer science interest or performance?
  • What are the major factors in computer science teacher retention?
  • What is required to increase the availability of teacher preparation programs for computer science teachers?
  • What is the impact of transitioning the the content of teacher preparation courses in “educational technology/AV” to a focus on computational thinking across STEM?
  • What is the ideal balance between content knowledge learning and pedagogical learning in computer science teacher preparation and alternative certifications?
  • Do hybrid programs (educators and volunteer partnerships) improve student access to rigorous computer science courses and increase the pool of well-prepared computer science teachers?

Which of these do you think is most important? And what is missing? (the questions are listed in no particular order and have been labelled alphabetical for easy referencing in the comments)

Tagged , , , ,

Paper at AI-2013: “‘The First Day of Summer': Parsing Temporal Expressions with Distributed Semantics”

In December, my PhD student Benjamin Blamey presented a joint paper entitled: ‘The First Day of Summer': Parsing Temporal Expressions with Distributed Semantics at AI-2013, the 33rd SGAI International Conference on Artificial Intelligence in Cambridge.

If you do not have institutional access to SpringerLink, especially the Research and Development in Intelligent Systems series, you can download our pre-print. The abstract is as follows:

Detecting and understanding temporal expressions are key tasks in natural language processing (NLP), and are important for event detection and information retrieval. In the existing approaches, temporal semantics are typically represented as discrete ranges or specific dates, and the task is restricted to text that conforms to this representation. We propose an alternate paradigm: that of distributed temporal semantics –- where a probability density function models relative probabilities of the various interpretations. We extend SUTime, a state-of-the-art NLP system to incorporate our approach, and build definitions of new and existing temporal expressions. A worked example is used to demonstrate our approach: the estimation of the creation time of photos in online social networks (OSNs), with a brief discussion of how the proposed paradigm relates to the point- and interval-based systems of time. An interactive demonstration, along with source code and datasets, are available online.

 
(see Publications)

Tagged , , , , ,

Grant applications, early 20th century style

warburggrant

Facsimile of a research proposal submitted by Otto Warburg to the Notgemeinschaft der Deutschen Wissenschaft (Emergency Association of German Science), c.1921.

The application, which consisted of a single sentence, “I require 10,000 marks“, was funded in full.

(read the full Nature Reviews Cancer article)

Tagged , ,

2014 Software Sustainability Institute Fellowship

SSI

I’m delighted to have been named today as one of the sixteen Software Sustainability Institute Fellows for 2014.

The Software Sustainability Institute (SSI) is an EPSRC-funded project based at the universities of Edinburgh, Manchester, Oxford and Southampton, and draws on a team of experts with a breadth of experience in software development, project and programme management, research facilitation, publicity and community engagement. It’s a national facility for cultivating world-class research through software, whose goal is to make it easier to rely on software as a foundation of research; see their manifesto. The SSI works with researchers, developers, funders and infrastructure providers to identify the key issues and best practice surrounding scientific software.

During my fellowship, I’m particularly keen to work closely with Software Carpentry and Mozilla Science Lab to highlight the importance of software skills across the STEM disciplines. I’m also interested in a broader open science/open computation agenda; see the Recomputation Manifesto and the recently established recomputation.org project.

More to follow in 2014!

Tagged , , , , , , ,

Colloquial definitions of Big, Open and Personal Data

Here’s a useful (draft) set of colloquial definitions for Big, Open and Personal Data on GitHub from the Open Data Institute.

Why is this a worthwhile exercise? Well, Open Data gets conflated with Personal Data, everyone talks about Big Data (yet no-one is exactly sure what it is, but many have tried to define it)…and we all should be concerned about Personal Data.

typesofdata

1. Big Data is (i) data that you cannot handle with conventional tools or (ii) a term used as a vague metaphor for solving problems with data.

2. Open Data is data that anyone can use; without legal, technical or financial barriers.

3. Personal Data is data derived from people, where you can distinguish a person from other people in the group.

(also, can Big Open Personal (BOP) Data exist?)

Tagged , , , , ,

A set of top Computer Science Education blogs

Further to my most-read blog post (from May 2012: A set of top Computer Science blogs, 80,000 hits and counting), here’s a follow-up: blogs on computer science education.

As before, instead of a list, it more closely resembles a set: the order is irrelevant and there are no duplicate elements; membership of this set of blogs satisfies all of the following conditions:

  1. they focus on computer science education (research, policy and practice);
  2. they are of consistently high quality;
  3. I regularly read them.
  • Computing Education Blog by Mark Guzdial (@guzdial)

    Mark is a professor in the School of Interactive Computing at Georgia Institute of Technology and a researcher in computing education. His blog is about how people come to understanding computing, and how to facilitate that understanding, cross-cutting research, policy, practice and wider societal issues. And while it is US-focused (as you would expect), it is an excellent venue for the discussion of key topics in computer science education.

  • Teach Computing by Alan O’Donohoe (@teknoteacher)

    Alan is a busy chap: as well as being principal teacher of Computing at Our Lady’s High School in Preston, he’s the founder of both Hack To The Future and Raspberry Jam, the global community of events for everyone to discover the wonders of the Raspberry Pi. His blog tracks his five-year computing journey: from improving classroom practice (listen to his Teach Computing podcasts), contributing back to the community as a CAS Master Teacher, to shaping the development of a new curriculum subject in England.

  • Miss Philbin’s Teaching and Learning Journal by Carrie Anne Philbin (@MissPhilbin)

    Carrie Anne is an award-winning secondary teacher at Robert Clack School in Essex and a passionate advocate for women in technology. She is the creator of Geek Gurl Diaries, a YouTube web series for teenagers who want to be makers and creators of technology (which recently won a Talk Talk Digital Hero Award) and vice-chair of the CAS initiative #include to address diversity issues in computing. Her blog also covers the gamut of classroom practice, the transition from ICT to computing, supporting the wider community, to shaping policy in England.

  • Academic Computing by Neil Brown (@twistedsq)

    Neil is a research associate in the Programming Languages and Systems Group at the University of Kent, working on the BlueJ and Greenfoot projects. He writes thought-provoking pieces on topics spanning computing (and more broadly, STEM) education, programming and socio-technical issues. He also has a second blog on learning and applying mathematics through computing: The Sinepost.

  • An Open Mind by Miles Berry (@mberry)

    Miles is a principal lecturer and the subject leader for Computing Education at the University of Roehampton. He sits on the boards of both CAS and Naace, with wide experience of curriculum development in the UK. His blog, a personal perspective on education, technology and culture, covers a range of interesting pieces on computer science and programming pedagogy, CPD and agile practice.

  • Computer Science Teacher by Alfred Thompson (@alfredtwo)

    Alfred is a high school computer science teacher in New Hampshire, having previously been the K-12 Computer Science Academic Relations Manager for Microsoft and a software developer for 18 years. He currently sits on the board of the Computer Science Teachers Association. His blog covers a wide range of topics, including computer science and programming pedagogy, curriculum development and US education policy.

  • Knowing and Doing: reflections of an academic and computer scientist by Eugene Wallingford (@wallingf)

    Eugene is an associate professor and head of the Department of Computer Science at the University of Northern Iowa. He has been blogging since 2004 on topics across computing, software development, higher education, learning and teaching, as well as managing and leading.

  • Raspberry Pi Blog by the Raspberry Pi Foundation (@Raspberry_Pi)

    These guys need no introduction, especially after the two millionth Raspberry Pi was sold in October! With the huge success and penetration of the Raspberry Pi over the past two years, the platform now exists for the Foundation to fulfil its wider educational objectives. A diverse blog, ranging from technical posts, peripherals and resources, to superb examples of innovative uses of the Raspberry Pi.

  • CSTA Blog by the Computer Science Teachers Association (@csteachersa)

    The Computer Science Teachers Association is a membership organisation (free to join), supported by the ACM, that promotes and supports the teaching of computer science and other computing disciplines in the US, providing opportunities for K–12 teachers and students to better understand the computing disciplines and to more successfully prepare themselves to teach and learn. Its blog covers a wide range of topics across computer science education, programming, curriculum design and education policy,

  • CAS Online by Computing At School (@CompAtSch)

    Computing At School is a membership organisation (also free to join), supported by the BCS, that promotes and supports the teaching of computer science in UK schools. Formed in 2008, it now has over 7000 members from across schools, colleges, universities, industry and government and is the subject association for computer science. Along with numerous high-quality articles in the quarterly CAS newsletter, Switched On, CAS Online provides the UK computer science education community with a wide range of forums, events, policy discussions, consultations and a veritable wealth of resources to support learning and teaching.

This set is most definitely incomplete — please post your computer science education blog recommendations in the comments below. You can also read some of my posts on computer science education.

Tagged , , , , , , , , , , , , , , , , ,

Ten Simple Rules for Reproducible Computational Research

In a paper published last week in PLoS Computational Biology, Sandve, Nekrutenko, Taylor and Hovig highlight the issue of replication across the computational sciences. The dependence on software libraries, APIs and toolchains, coupled with massive amounts of data, interdisciplinary approaches and the increasing complexity of the questions being asked are complicating replication efforts.

To address this, they present ten simple rules for reproducibility of computational research:
 

Rule 1: For Every Result, Keep Track of How It Was Produced

Rule 2: Avoid Manual Data Manipulation Steps

Rule 3: Archive the Exact Versions of All External Programs Used

Rule 4: Version Control All Custom Scripts

Rule 5: Record All Intermediate Results, When Possible in Standardized Formats

Rule 6: For Analyses That Include Randomness, Note Underlying Random Seeds

Rule 7: Always Store Raw Data behind Plots

Rule 8: Generate Hierarchical Analysis Output, Allowing Layers of Increasing Detail to Be Inspected

Rule 9: Connect Textual Statements to Underlying Results

Rule 10: Provide Public Access to Scripts, Runs, and Results


The rationale underpinning these rules clearly resonates with the work of the Software Sustainability Institute: better science through superior software. Based at the universities of Edinburgh, Manchester, Oxford and Southampton, it is a national facility for cultivating world-class research through software (for example, Software Carpentry). An article that caught my eye in July was the Recomputation Manifesto: computational experiments should be recomputable for all time. In light of the wider open data and open science agenda, should we also be thinking about open software and open computation?

Tagged , , , , , , ,

“We Are Sorry to Inform You…”

How much damage could be caused by a peer reviewer having a bad day? You only have to have a look through the list of Turing Award winners (or some of the top cited papers in computer science) to see that, given the current standards for reviewing, many of those papers would never have been published. As highlighted in this CACM article, they would have come up against journal reviewers who would have rejected such works, considering them too speculative or theoretical. More specifically for UK academics, how many of them would be REF returnable?

Even if you have had a paper brutally rejected, enjoy these fictitious reviews (also here) of seminal papers in computer science, including work from Turing, Hoare, Dijkstra and Shannon. For example:

On Computable Numbers, with an Application to the Entscheidungsproblem

This is a bizarre paper….If the article is accepted, Turing should remember that the language of this journal is English and change the title accordingly.

Tagged , ,

CAS paper at SIGCSE’13: “Bringing Computer Science Back Into Schools: Lessons From The UK”

Further to the previous CAS papers, Neil Brown (University of Kent) presented a paper entitled: Bringing Computer Science Back Into Schools: Lessons From The UK at SIGCSE’13, the 44th ACM Technical Symposium on Computer Science Education, in Denver in March.

The paper is available to download for free via the ACM Author-ize service below; you can also listen to Neil’s voice-over of the presentation slides. The abstract is as follows:


Computer science in UK schools is a subject in decline: the ratio of Computing to Maths A-Level students (i.e. ages 16–18) has fallen from 1:2 in 2003 to 1:20 in 2011 and in 2012. In 2011 and again in 2012, the ratio for female students was 1:100, with less than 300 female students taking Computing A-Level in the whole of the UK each year. Similar problems have been observed in the USA and other countries, despite the increased need for computer science skills caused by IT growth in industry and society. In the UK, the Computing At School (CAS) group was formed to try to improve the state of computer science in schools. Using a combination of grassroots teacher activities and policy lobbying at a national level, CAS has been able to rapidly gain traction in the fight for computer science in schools. We examine the reasons for this success, the challenges and dangers that lie ahead, and suggest how the experience of CAS in the UK can benefit other similar organisations, such as the CSTA in the USA.

 

ACM DL Author-ize service

Neil C. C. Brown, Michael Kölling, Tom Crick, Simon Peyton Jones, Simon Humphreys, Sue Sentance
SIGCSE ’13 Proceeding of the 44th ACM Technical Symposium on Computer Science Education, 2013


(see Publications)

Tagged , , ,

Paper at AI-2012: “R U :-) or :-( ? Character- vs. Word-Gram Feature Selection for Sentiment Classification of OSN Corpora”

In December, my PhD student Benjamin Blamey presented a joint paper entitled: R U :-) or :-( ? Character- vs. Word-Gram Feature Selection for Sentiment Classification of OSN Corpora at AI-2012, the 32nd SGAI International Conference on Artificial Intelligence in Cambridge (for which he also won the best poster prize).

If you do not have institutional access to SpringerLink, especially the Research and Development in Intelligent Systems series, you can download our pre-print. The abstract is as follows:


Binary sentiment classification, or sentiment analysis, is the task of computing the sentiment of a document, i.e. whether it contains broadly positive or negative opinions. The topic is well-studied, and the intuitive approach of using words as classification features is the basis of most techniques documented in the literature. The alternative character n-gram language model has been applied successfully to a range of NLP tasks, but its effectiveness at sentiment classification seems to be under-investigated, and results are mixed. We present an investigation of the application of the character n-gram model to text classification of corpora from online social networks, the first such documented study, where text is known to be rich in so-called unnatural language, also introducing a novel corpus of Facebook photo comments. Despite hoping that the flexibility of the character n-gram approach would be well-suited to unnatural language phenomenon, we find little improvement over the baseline algorithms employing the word n-gram language model.

(see Publications)

Tagged , , , ,

Paper at WiPSCE’12: “Grand Challenges for the UK: Upskilling Teachers to Teach Computer Science Within the Secondary Curriculum”

Further to the CAS paper presented at Koli Calling 2011 in Finland in November 2011, Sue Sentance (Anglia Ruskin University) presented a paper entitled: Grand Challenges for the UK: Upskilling Teachers to Teach Computer Science Within the Secondary Curriculum at WiPSCE’12, the 7th International Workshop in Primary and Secondary Computing Education, in Hamburg in November.

The paper is available to download for free via the ACM Author-ize service below; the abstract is as follows:


Recent changes in UK education policy with respect to ICT and Computer Science (CS) have meant that more teachers need the skills and knowledge to teach CS in schools. This paper reports on work in progress in the UK researching models of continuing professional development (CPD) for such teachers. We work with many teachers who either do not have an appropriate academic background to teach Computer Science, or who do and have not utilised it in the classroom due to the curriculum in place for the last fifteen years. In this paper we outline how educational policy changes are affecting teachers in the area of ICT and Computer Science; we describe a range of models of CPD and discuss the role that local and national initiatives can play in developing a hybrid model of transformational CPD, briefly reporting on our initial findings to date.

ACM DL Author-ize service

Sue Sentance, Mark Dorling, Adam McNicol, Tom Crick
WiPSCE ’12 Proceedings of the 7th Workshop in Primary and Secondary Computing Education, 2012


(see Publications)

Tagged , , , ,
Follow

Get every new post delivered to your Inbox.

Join 351 other followers