Difference between revisions of "Data abstraction"

From CS 61A Wiki
Jump to: navigation, search
[checked revision][checked revision]
(use definition from http://www.eecs.berkeley.edu/~bh/ssch17/part5.html)
m (Data abstraction violation: reword)
Line 116: Line 116:
 
</syntaxhighlight> We assume that the internal representation of a point is a tuple. Instead, we should have used the selectors <code>x_coord</code> and <code>y_coord</code>.
 
</syntaxhighlight> We assume that the internal representation of a point is a tuple. Instead, we should have used the selectors <code>x_coord</code> and <code>y_coord</code>.
  
Data abstraction violations destroy flexibility because if you want to change the representation of the ADT, you must change all the functions that use the ADT. If you maintain data abstraction, you need only change the constructors and selectors.
+
Data abstraction violations limit flexibility because if you want to change the representation of the ADT, you must change all the functions that use the ADT. If you maintain data abstraction, you need only change the constructors and selectors.
  
 
== Sources ==
 
== Sources ==
 
* http://inst.eecs.berkeley.edu/~cs61a/fa13/slides/10-Data_6pp.pdf
 
* http://inst.eecs.berkeley.edu/~cs61a/fa13/slides/10-Data_6pp.pdf

Revision as of 22:38, 2 June 2014

Data abstraction is the invention of new data types. It separates functionality from representation—we don't need to know how data is represented internally; we just need to know how to interact with it.

Data abstraction is supported by defining an abstract data type (ADT), which is a collection of constructors and selectors. Constructors create an object, bundling together different pieces of information, while selectors extract individual pieces of information from the object.

Functional ADTs

ADTs can be implemented by functions. Functional ADTs are usually immutable; that is, their fields cannot be modified.

Examples

An ADT for recursive lists:

empty_rlist = None
 
# constructor
def make_rlist(first, rest=empty_rlist):
    return first, rest
 
# selector
def first(r):
    return r[0]
 
# selector
def rest(r):
    return r[1]

An ADT for points:

# constructor
def make_point(x, y):
    return x, y
 
# selector
def x_coord(point):
    return point[0]
 
# selector
def y_coord(point):
    return point[1]
and a function that uses it:
def slope(point1, point2):
    """Return the slope of the line that connects POINT1 and POINT2."""
    return (y_coord(point1) - y_coord(point2)) / (x_coord(point1) - x_coord(point2))

An ADT for rational numbers (in dispatch function style) that supports mutation:

# constructor
def rational(x, y):
    """
    >>> rat = rational(1, 3)
    >>> print_rational(rat)
    1 / 3
    >>> set_numer(rat, 2)
    >>> print_rational(rat)
    2 / 3
    """
    def put(field, value):
        nonlocal x, y
        if field == 'numer':
            x = value
        elif field == 'denom':
            y = value
 
    def get(field):
        if field == 'numer':
            return x
        elif field == 'denom':
            return y
    return put, get
 
# selector
def numer(rat):
    return rat[1]('numer')
 
# selector
def denom(rat):
    return rat[1]('denom')
 
# mutator
def set_numer(rat, value):
    rat[0]('numer', value)
 
# mutator
def set_denom(rat, value):
    rat[0]('denom', value)
 
def print_rational(rat):
    print("{0} / {1}".format(numer(rat), denom(rat)))

OOP ADTs

ADTs can also be implemented through object-oriented programming as Python objects. OOP ADTs are mutable and have built-in constructors (__init__) and selectors (dot notation).

Example

An ADT for a recursive list:

class Rlist:
    class EmptyList:
        pass
 
    empty = EmptyList()
 
    # constructor
    def __init__(self, first, rest=empty):
        self.first = first
        self.rest = rest

Data abstraction violation

A data abstraction violation occurs when you assume some representation of the ADT, bypassing the constructors and selectors. For example, we are committing a data abstraction violation when we write the following slope function using the point ADT above:

def slope(point1, point2):
    """Return the slope of the line that connects POINT1 and POINT2."""
    return (point1[1] - point2[1]) / (point1[0] - point2[0])
We assume that the internal representation of a point is a tuple. Instead, we should have used the selectors x_coord and y_coord.

Data abstraction violations limit flexibility because if you want to change the representation of the ADT, you must change all the functions that use the ADT. If you maintain data abstraction, you need only change the constructors and selectors.

Sources