XRDS

Crossroads The ACM Magazine for Students

Sign In

Association for Computing Machinery

Magazine: Fall 2012 | Volume 19, No. 1

Announcing the <i>XRDS</i> blog

COLUMN: Letter from the editors

Announcing the XRDS blog

By Peter Kinnaird, Inbal Talgam-Cohen

HTML | In the Digital Library
Tags: Document types

Inbox

DEPARTMENT: Inbox

Inbox

By XRDS Staff

HTML | In the Digital Library
Tags: Document types, General and reference

Big data

ACM Author-Izer

Six tips for students interested in big data analytics

Creativity and computing: An ACM student chapter initiative

Blogs

Sketching and streaming algorithms for processing massive data

SECTION: Features

Sketching and streaming algorithms for processing massive data

The rate at which electronic information is generated in the world is exploding. In this article we explore techniques known as sketching and streaming for processing massive data both quickly and memory-efficiently.

By Jelani Nelson

HTML | In the Digital Library
Tags: Information retrieval, Information systems applications

Big privacy

Big privacy

Approaches from computer science and statistical science for assessing and protecting privacy in large, public data sets.

By Ashwin Machanavajjhala, Jerome P. Reiter

HTML | In the Digital Library
Tags: Collaborative and social computing systems and tools, Computational complexity and cryptography, Cryptography, Data encryption, Database administration, Information systems applications, World Wide Web

Taming big probability distributions

Taming big probability distributions

New algorithms for estimating parameters of distributions over big domains need significantly fewer samples.

By Ronitt Rubinfeld

HTML | In the Digital Library
Tags: Information retrieval, Probability and statistics

Designing good MapReduce algorithms

Designing good MapReduce algorithms

An introduction to designing algorithms for the MapReduce framework for parallel processing of big data.

By Jeffrey D. Ullman

HTML | In the Digital Library
Tags: Concurrent programming languages, Database query languages (principles), Natural language processing, Query languages

Big data and internships at Cloudera

Big data and internships at Cloudera

Students working in the big data space get uniquely valuable experiences and perspectives by taking industrial internships, which can help further their research agendas.

By Yanpei Chen, Andrew Ferguson, Brian Martin, Andrew Wang, Patrick Wendell

HTML | In the Digital Library
Tags: Computing occupations, Database query languages (principles), Distributed programming languages, Query languages

An interview with Surajit Chaudhuri

An interview with Surajit Chaudhuri

Surajit Chaudhuri, Distinguished Scientist and head of the Extreme Computing Group (XCG) at Microsoft Research, Redmond provides valuable insights for revisiting data analytics in the context of big data.

By Aditya Parameswaran

HTML | In the Digital Library
Tags: Computing occupations, Data management systems, Database management system engines

Peregrine: Low-latency queries on Hive warehouse data

Big data platforms: What's next?

Big data platforms: What's next?

Three computer scientists from UC Irvine address the question "What's next for big data?" by summarizing the current state of the big data platform space and then describing ASTERIX, their next-generation big data management system.

By Vinayak R. Borkar, Michael J. Carey, Chen Li

HTML | In the Digital Library
Tags: Database management system engines, Information retrieval

Interactive analysis of big data

Interactive analysis of big data

New user interfaces can transform how we work with big data, and raise exciting research problems that span human-computer interaction, machine learning, and distributed systems.

By Jeffrey Heer, Sean Kandel

HTML | In the Digital Library
Tags: Image manipulation, Information retrieval, Information systems applications, Rendering

Propagation and immunization in large networks

Parallel machine learning on big data

Parallel machine learning on big data

On algorithms for parallel machine learning, and why they need to be more efficient.

By John Langford

HTML | In the Digital Library
Tags: Concurrent programming languages, Database management system engines, Parallel algorithms

Big data in computational biology

Big data in computational biology

An invitation to the digital science of life.

By Cliburn Chan

HTML | In the Digital Library
Tags: Information retrieval, Information systems applications

Profile Jeff Dean<br />Big data at Google

The Centre for Advanced Spatial Analysis at University College London, London, UK

Automated DNA sequencers

Finding yourself using geolocation and the Google Maps API

The secret behind the Luhn-ie

SECTION: Online features

FREE CONTENT FEATURE

The secret behind the Luhn-ie

A look at the Luhn algorithm and how it is used in the 21st century for error detection.

By Broderick Causley

HTML | In the Digital Library
Tags:

FREE CONTENT FEATURE

Mastering real-time big data with stream processing chains

To conciliate application logic concerns with event handling performance, we introduce the spChains processing framework.

By Dario Bonino, Luigi De Russis

HTML | In the Digital Library
Tags: