OOP University: oop

Showing posts with label oop. Show all posts

Saturday, June 11, 2016

Basics: Just what IS an object, anyway?

This is a pretty long one by the standards of this blog. Try to stay with it, though, the concepts are crucial.

Whether people are experienced developers used to procedural languages or newcomers to programming in general, really understanding objects tends to be a bit of a sticking point. It's odd, because once you start to get it, everything seems quite natural.

An object is a representation of... something. Sometimes they represent fairly nebulous concepts, sometimes they represent very real physical things, but in all cases they're a kind of model. Objects have properties, just like a flower has a color or a bee has [6] legs. Objects have methods, which are really just things you can ask them to do, like 'release pollen' or 'sting someone'. Finally, objects have events, or at least they can generate events. That could be 'I've been pollinated' or 'I left my stinger in someone'. Any particular object may make use of one, two or all three of these constructs.

Objects are based on classes in Java. There is an all too human tendency to use the terms interchangeably and I've probably been guilty of doing that, too. Technically, a class defines how an object should be built and an object is an instance of such a class. Basically, after you write your code you compile it to create a .class file, and then when you run your code you can make a new object from said class. Making objects is easy enough, most of them get created (or instantiated) by using the new keyword like this:

SimpleObject myObject = new SimpleObject();

Let's break that down:

First, we have declared that we're interested in working with a variable of type SimpleObject. In other words, someone out there has written a SimpleObject.java file that defined a SimpleObject class and you're going to make an object from that definition.

That variable will be called 'myObject'. We have to name our variables or we'd have a really hard time referring to them in our code!

We're not referring to a previously existing SimpleObject, we're going to make a completely new one. The actual creation is handled by a constructor inside of SimpleObject.java, and we need to rely on that constructor doing its job, correctly setting up anything within the new object that needs to be in place.

Some objects don't appear to have constructors at all if you read the code, but that just means that there is no need for a constructor to do any setup work, so the programmer was able to rely on Java creating a default constructor for them. The code for SimpleObject may have been written either way. We don't care at this point, we just know we can call it. I'll get back to constructors a little later when we talk about actually writing a class of your own.

Defining an class in Java is straightforward enough. You don't really have to do any more than create a .java file with a bit of correct syntax. The following example is enough to make a SimpleObject class (which really can't do anything but exist):

package com.oopuniversity.simpleobject;

public class SimpleObject {
}

Of course, a class that doesn't do anything isn't very useful, but I think it's good to have a picture in your head of all the 'extra' stuff that absolutely needs to be in place. Code can look a bit busy to new developers, and its best to know what is basically template stuff that you should make sure is there and then ignore.

Just to break it down, that 'package' statement up at the top tells the compiler where the generated class file should go. It's basically specifying an output directory, but using periods instead of slashes or backslashes. Packaging is primarily an organizational tool and it turns out to be an important one later on.

Then we have 'public class SimpleObject' which tells us we're defining a class called SimpleObject. That public keyword is important, it controls whether other objects in a larger program are able to create SimpleObjects or even refer to them at all. For now, just use 'public'. The day will come when you start to use other modifiers for specific reasons, but if you're reading this to learn you don't have those reasons yet.

Then we have some curly braces. Those are ubiquitous in Java programs, and basically set boundaries for chunks of code. In this case, they are setting the boundary for the beginning and end of the class, although they don't actually contain anything. Anything between those brackets will be considered an attempt at having something be a part of SimpleObject.

Man, four paragraphs to describe three lines... I guess a fair amount of information is consolidated down into even that useless bit of code! Fortunately, that stuff always stays pretty much the same. Once you understand that structure, you can kind of stop worrying about it and move on.

I mentioned above that I would talk about constructors. Well, that time has come. The following code is (aside from being in a different package) precisely identical to the previous code:

package com.oopuniversity.simpleobjectwithconstructor;

public class SimpleObject {
    public SimpleObject() {
    }
}

The only differences are:

We changed the package definition, which lets us have this version of SimpleObject sit in the same project as the previous version without any conflicts.
Now we have something new inside the braces that define the class.

The package definition is needed because I'm keeping everything inside one big project. Just like you can't have two files with the same name in one directory, you can't have two classes with the same name in one package.

The new stuff inside the braces is defining a default constructor for SimpleObject. It's public which means other objects can use it to make a new SimpleObject. It has nothing between the parentheses, which means you can create a SimpleObject without having to give it any parameters, which are nothing more than pieces of information you give it (we'll talk about those soon). Then it has some more of those fun curly braces, which again define boundaries. Anything inside this particular pair of braces belongs not just to the class, but to the constructor itself.

Why would you want to write a constructor like this, making your class busier? Well, you wouldn't, and that's why you get this for free with any class you write that doesn't bother defining a constructor of its own. However, you DO need to know about and understand this for one simple reason: It is also possible to define a class using a constructor (or a whole bunch of them if you like) that has parameters. If you do this, the compiler will *not* create a default constructor for you and you won't be able to use a default constructor unless you explicitly write one.

https://github.com/OOPUniversity/OOP_Basics

Monday, December 15, 2014

Inheritance Part 2: That's Rather Abstract, Isn't it?

I hope you've been browsing through Java source code outside of your own and the small samples I've provided on this site. Reading and trying to understand code can only help you in the long run. You will encounter different code styles, you will see interesting data structures, API calls you didn't know existed, all sorts of stuff.

Today I'm going to talk about something you might have seen, but if you haven't, it's no big deal. It is the keyword abstract.

We use abstract as part of the definition of a class, which indicates that while it is in fact a valid class file, nobody is allowed to create an object of that type. That doesn't mean that it's useless though. Remember, we have inheritance on our side. You can create concrete classes (concrete is just the opposite of abstract, indicating a class from which you can create objects. No special keyword is needed for that) that extend abstract classes all you want.

We also use abstract as part of method signatures when we want to create the idea of a particular method call but wish to defer the actual implementation of said method to our concrete classes. This is similar in concept to interfaces, and sometimes it's more of a personal decision as to which you will use.

You see a lot of abstract classes in use when you look through frameworks, which are basically bodies of code designed to accomplish some function but with the details left up to the final developer. There are business frameworks, logging frameworks, GUI frameworks, reporting frameworks, the list is very long. Usually the trigger for creating a framework is when someone realizes they've just written the same thing for the fifth or tenth time, and that the differences between the various programs are relatively minor. Maybe you query a database, format a report and write it to a file, and it's always the same except for the details of which fields you look for and print.

This is a great application for abstract classes. Let's take a look at a simple one:

public abstract class Report {
}

Well, that's about as simple as it gets. To be fair it's also about as useless as it gets but never fear, we'll fill in some details later. Actually it's not entirely useless. You might want to have a set of classes that all share a common base class so that you can collect them into a Collection of some sort, although you could also make this work with an interface. Abstract classes really come into their own when you include some code within them. So let's write something.

public abstract class Report {
public abstract void GenerateOutputLine(String dataLine);
public abstract String ReadData();

public void runReport() {

String dataLine = null;
while( null != ( dataLine = ReadData() ) ) {
GenerateOutputLine(dataLine);
}
}

}

OK, that was simple enough. We have defined not only our abstract Report class, but made it do a little bit of work for us. It calls ReadData until that method returns a null, and then takes the value returned from ReadData and passes it to GenerateOutputLine. Of course, you can't actually run this report, you need to create at least one concrete class based on it. That could be a simple test driver designed to make absolutely certain that your framework behaves properly with known data. It could be something that reads from one file and writes to another, which would basically give you a 'copy file' command. It could be something that reads one file, does some calculations or modifications, and then spits out the changed data. The options are nearly endless, despite the fact that this is a highly simplified example.

Let's write a simple test driver:

public class ReportTester extends Report {

public static void main(String[] args) {
ReportTester rt = new ReportTester();
rt.runReport();
}

String [] dataLines = {
"one", "two", "three", "four", "five"
};
int lineNumber = 0;

public void GenerateOutputLine(String dataLine) {
System.out.println(dataLine);
}

public String ReadData() {
String toReturn = null;
if (lineNumber < dataLines.length) {
toReturn = dataLines[ lineNumber++ ] ;
}
return toReturn;
}
}

Now if we were to run 'ReportTester' we'd get the output:

one
two
three
four
five

The main driver of the program is still in that 'Report' superclass, but the implementation details have been kept in ReportTester. We run the code in Report, and it calls the methods it finds in ReportTester to do the work. ReportTester itself is quite simple. In addition to a simple main that does nothing more that create ReportTester and run it (actually running that method from the superclass Report), it defines the specific implementations for our two abstract methods. One method does nothing more than write to the console and I hope I don't need to explain that. The other goes through an array of Strings, returning the next one in the array and increasing the index value each time it's invoked until it reaches the end of the list, at which time it returns a null which triggers the end of the loop in 'runReport'.

This is of course tremendous overkill for a program of this size, but as your projects get more involved, the ratio of abstract code to concrete code is likely to change quite a bit. System maintenance gets easier and faster, and your software gets more robust. After all, a bug fix to a framework may take care of issues across several dozen different specific implementations of different reports.

I will of course revisit this topic later, because we've only scratched the surface of the possibilities brought on by object orientation. With enough small components intelligently built, one can assemble new programs almost like putting together Tinker Toys or Lego. One's systems can be defined in large part by configuration files, and changed easily at will, all without writing, testing or deploying new code.

Thursday, December 11, 2014

Good Practices: Coding to the Interface

You will sometimes hear someone recommend that you 'code to the interface'. What the heck does that mean?

Well, I'm going to get deeper into just what interfaces provide for you, and why you should both create and use them in a later post, but a simple example can be shown with the very common Collections interface "List".

Using List is very common, although a lot of people will begin using ArrayList directly. It's not that there's anything precisely wrong with doing this, In fact, probably nine or more out of ten times the only thing it will cost you is a few extra keystrokes. However, it serves as a good example in a section of code with which many of us get familiar early.

Suppose I wanted to have a list of active elements for a game. I could write this code to have it available within my GameData class:

private ArrayList<GameElement> gameElements = new ArrayList<GameElement>();

And then I could have a getter method:

public Arraylist<GameElement> getGameElements() {
return gameElements;
}

That's simple enough, right? So I go ahead and write a bunch of code within the rest of my game, happily accessing all the elements they need to function:

public void addElementToGame(GameElement newElement) {
ArrayList<GameElement> allElements = myGameData.getGameElements();
if (false == allElements.contains(newElement) {
allElements.add(newElement);
}
}
...
public void walkAllElements(ElementChecker myChecker) {
ArrayList<GameElement> allElements = myGameData.getGameElements();

for(GameElement element : allElements ) {
myChecker.checkElement(element);
}
}

etc. etc. etc.

It's not that there is anything wrong with this. In fact, you're feeling pretty good, you even made a method that accepts a worker object as a parameter and invokes some code on that worker object for each item in the list.

Then the reports start coming in: Sometimes the game gets very slow. It's generally after someone has been playing for a long time. Eventually, you pin it down to your element management code. It's fine with a few elements but once you get to having hundreds, certain operations get bogged down. At the same time, you realize that you'd love to prioritize these game elements by allowing inserts to happen at the start of the list for faster processing. That's got you down all weekend as you try to figure out what to do.

The light at the end of the tunnel comes when you review the documentation for the Collections framework and discover LinkedList. It might be a bit slower with smaller amounts of data, but it sounds like it will behave better for you with the large data sets you're processing and it offers the side benefit of being able to easily handle quick inserts at the start of the list.

That's when you suddenly get annoyed. You make the change in GameData and from all the red markers that show up on your screen you realize that you've got seven hundred and fifty places in your code base where you have to do nothing more than change the word 'ArrayList' to the word 'LinkedList'. This is all fixable, but it's going to take up a significant chunk of your time. Most of the code does not even have to change in any other way, maybe you've got two or five places where you will need to explicitly refer to methods that are specific to LinkedList, bu the rest are just mechanical updates of the code.

This is where coding to the interface could have saved you some time and effort. If, instead of referring to 'ArrayList' everywhere, you had just used 'List', all or at least most of your code would have otherwise remained exactly the same.

The declaration of gameElements could have been written this way:

private List<GameElement> gameElements = new ArrayList<GameElement>();

Your walkAllElements could have looked like this instead:

public void walkAllElements(ElementChecker myChecker) {
List<GameElement> allElements = myGameData.getGameElements();

for(GameElement element : allElements ) {
myChecker.checkElement(element);
}

Note that nothing has really changed here except for changing 'ArrayList' to 'List' in some places. This is because 'List' is an interface, which is really a kind of contract. It forces an object that implements it to support a set of defined methods, and both ArrayList and LinkedList do this.

public class ArrayList implements List {

public class LinkedList implements List {

Now this just scratches the surface of interface usage. Later on we'll get into creating and using your own, and how that can save you even more time and energy, and bring your coding to a new level. For now though, just be aware that classes can implement interfaces, and that whenever possible, it's better to write your code to care only about said interfaces, and not to give a hang about the specific object type it's dealing with. Get used to it, let it become your natural way of operating. It will serve you well later on. Eventually you will be creating your own interfaces, which can serve as the control points for widely varying objects. I can drive a car from any manufacturer, because they all offer me the same set of controls. I do not need to be licensed to drive a Subaru, separately licensed to drive a Volvo, and have a learner's permit for a Ford. Interfaces are like specifying 'gas pedal, brake pedal, steering wheel'. Oh, and objects can support arbitrary numbers of interfaces, so in reality my car is defined something like this:

public class S60 implements Driveable, GasPowered, Geartronic, PushbuttonStart, KeylessEntry {

There would no doubt be a lot more if I sat down to think about it all. The point is, once I understand how to drive a car, I can drive pretty much any car, because they ALL implement Driveable. If I go to my local Avis and rent something, I may not know about all the features, the navigation system may be a complete mystery to me, but man, I know how to work a steering wheel.

Friday, December 5, 2014

Encapsulation - Protecting You From Yourself

Encapsulation sounds like the central plot point of some weird science fiction movie, but actually it's a fundamental technique for improving the reliability and predictability of object oriented programs. Let's pull out our old friend PersonData and see what's ailing him today:

public class PersonData {
public String givenName;
public String surName;

public PersonData() { }

public PersonData(String given, String sur) {
givenName = given;
surName = sur;
}

public String toString() {
return givenName + " " + surName;
}
}

Let me start by saying that there is nothing fundamentally wrong with this class. It is properly formed, is created with a valid constructor, and has a fully functional toString override method that formats the name is a reasonable manner. Is it object oriented? Well, it's an object at any rate, but we can do better.

Here's the key issue I'd like to address today. Suppose we had a method that we wanted to use to ensure that a person's name begins with a capital letter. The logic for this is simple enough, but I'm going to deliberately break it for this example, so don't use this method as-is:

public static String ensureFirstCharacterIsUpperCase(String input) {
String upperCaseString = null;
if (Character.isUpperCase(input.charAt(0))) {
return input;
}
upperCaseString = input.substring(0,1).toUpperCase() + input.substring(1);
return input; //Bad programmer! No Pizza!
}

Ignoring for the moment the fact that this method will not quite do what we actually want, how would one go about using it? Well, we could go ahead and write something like this:

PersonData joeBlow = new PersonData(); joeBlow.givenName=ensureFirstCharacterIsUpperCase("joe"); joeBlow.surName=ensureFirstCharacterIsUpperCase("blow");
System.out.println(joeBlow);

That would work just fine, the first, second, and eight hundredth time you do it. But why would you want to? What if you decided that you wanted to capitalize all surnames? What if you wanted to plug in some library that understands how to capitalize special names like MacNeil or something? Do you really want to search out every instance of setting the last name and change it?

There is a better way, my friend, and it gets to one of the other key aspects of object oriented programming. Encapsulation is basically protecting your data from your own programs by making it inaccessible except through a very clearly defined path that you strictly control.

We start by making the fields surName and givenName private. This indicates that they cannot be changed by code in any class except PersonData.

private String givenName;
private String surName;

Of course, we aren't quite finished at this point, because we can't change or even see these values. That could make for a remarkably useless class if we did not find a way around it. The way around it is by creating 'accessor' and 'mutator' methods. Those are horrible names so we usually just call them 'getters' and 'setters'. These labels actually make a lot more sense, because the naming convention is to prefix the field name with 'get' and 'set'.

The required methods for PersonData could look like this:

public void setGivenName(String newGivenName) {
givenName = newGivenName;
}

public String getGivenName() {
return givenName;
}

public void setSurName(String newSurName) {
surName = newSurName;
}

public String getSurName() {
return surName;
}

With the fields marked private, and with our getters and setters in place, we would now write the code above more like this:

PersonData joeBlow = new PersonData();
joeBlow.setGivenName (ensureFirstCharacterIsUpperCase("joe"));
joeBlow.setSurName (ensureFirstCharacterIsUpperCase("blow"));
System.out.println(joeBlow);

But wait, there's more!

Why should we take a chance that we (or some other developer) fails to use that method to make sure our first character is correct? We can do better than that, and make sure it always happens no matter what kind of night someone had. For now, we'll move 'ensureFirstCharacterIsUpperCase' into PersonData (we'll talk about a better way later) and we'll change our setters a bit:

public void setGivenName(String newGivenName) {
givenName = ensureFirstCharacterIsUpperCase(newGivenName);
}

public void setSurName(String newSurName) {
surName = ensureFirstCharacterIsUpperCase(newSurName);
}

With this done, we can now do this instead:

PersonData joeBlow = new PersonData();
joeBlow.setGivenName ("joe");
joeBlow.setSurName ("blow");
System.out.println(joeBlow);

And still get everything looking the way we expect. We've minimized the amount of code we need to write, and we've encapsulated both field access and a bit of business logic in our PersonData class. Malicious or misinformed programmers will not be able to bypass our hard and fast rules. IDEs won't even let you see the fields in their helpful pop-up dialogs. Dogs will love us.

Of course, we can (and really should) do more than this. First off, we have a constructor that accepts the two names, and we need to make sure it's also performing this task. That's probably most easily done by also having it call the setters. Second, we can think about other business rules we can enforce, like "A user's first name cannot be null or zero length". The setter can check for this and refuse to accept a bad value. Of course, if it does so it also needs to complain to the caller that they've done it wrong. This would be done via an Exception, which we'll need to talk about soon.

Get used to writing your code this way. Protect your data from yourself. Use accessor and mutator methods regularly and uniformly. It's a small amount of extra work, but you will thank yourself later. It's actually a VERY small amount of extra work, given that any decent IDE will create the methods for you if you just ask, and it will certainly save you time and energy later on.

I'll update this post later with a link to a gist containing the fully updated PersonData Object.

Wednesday, December 3, 2014

Diamonds are not an Object's Best Friend

Once again, I turn to a post I made on reddit for inspiration.

Some people wonder why inheritance branch but never join. Why not tie two different families of objects together and extend both of 'em?

I can see the point of that, one could conceivably build objects that join several disparate branches of inheritance into large and versatile classes. Conceivably. One could also get oneself into a world of trouble, though, and this is most easily explained via the so-called diamond inheritance problem.

Please note that you cannot do this and expect your code to compile, it's just a conceptual demonstration.

Diamond.java
public class Diamond extends Class1, Class2 {
}

Class1.java
public class Class1 {
    public void doSomething() {
         System.out.println("Class1.doSomething()");
    }
}

Class2.java
public class Class2 {
    public void doSomething() {
         System.out.println("Class2.doSomething()");
    }
}

Now you have a main that uses the above classes:

public static void main(String[] args) {
    Diamond diamond = new Diamond();
    diamond.doSomething();
}

So, when you run main, what happens? This is the essence of the diamond problem. You wind up having two lines of inheritance converging on one and run into a quite serious issue.

This basically just scratches the surface of the problem, of course. We can also think about what happens when you try to invoke something in super. The question would have to become 'which super'? There are many potential negative ramifications from the interactions that could be created by allowing this kind of inheritance, and the creators of Java decided that the best way to avoid them was by preventing the issue altogether.

One can achieve most of the positive effects of multiple inheritance via Interfaces and a few helpful design patterns such as Adapter, Composite and Decorator.

Inheritance Part 1 - That Class is Super!

I would like to discuss the concept of inheritance, and no, I don't mean the vase that Aunt Gladys left you in her will.

When we talk about inheritance in object oriented programming languages, we are talking about how one object can be based on another object. In Java, we use the keyword extends to do this.

However, in order to make this as simple as possible, let's ignore even that keyword for a moment. EVERY single Java class automatically extends the class Object without any effort at all on your part.

Let's look at an example Person data object, which I will continue to use as an example for future posts.

public class Person {
    private String firstName;
    private String lastName;

    public Person(String newFirstName, String newLastName) {
        firstName = newFirstName;
        lastName = newLastName;
    }
    ...
}

As I mentioned above, Person automatically extends Object, so in reality the class definition is really more like this:

    public class Person extends Object {

But we just don't bother writing it. We call Object the superclass of Person and Person is called a subclass of Object.

As far as what this brings to the table, let's use what is perhaps simultaneously one of the most common features and one of the most common issues people have with objects when they are starting out. Printing them out.

If I create a new Person object and want to see what it looks like, I might be tempted to do this:

Person joe = new Person("Joe", "Blow");

System.out.println(joe);

If I do this, I will be sorely disappointed when I run my program, because while I might expect to see this:

Joe Blow

I will instead see something like this:

Person@4f1d0d

At this point, I begin to tremble and sweat. In a panic I turn to stackoverflow.com or /r/javahelp and complain that my object is broken and my printouts are garbled. The regulars there probably make fun of me for not actually posting my code correctly or something. My dog bites me and my girlfriend refuses to go out with me. OK, I may have exaggerated a bit.

But what's happening here is that when you call 'System.out.println(joe)' you are (again, invisibly) actually in effect calling 'System.out.println(joe.toString())'. After all, you need to turn your object into something that can be displayed on the console.

You might at this point be saying to yourself "But wait, I don't have a 'toString' in Person" and in a sense you're right. You certainly didn't create such a thing. But you did in fact inherit a toString method from Object. Object is pretty ignorant about your code, though, and has no idea how you intend to format your printouts, so it just tells you the type of object and where in your JVMs memory that object is stored. In general, if a superclass contains a method, a subclass can call it as though it were a part of the subclass itself.

This is all inheritance really is, in the end. If we try to call a method or refer to a variable in an object, and it isn't mentioned in that object, the compiler tries to see if it's available via inheritance. If so, you avoid compile errors and things 'work'. They may not work the way you want, but at least something interesting happens.

Let's fix the problem now. I could take the cheap and easy way out like this:

System.out.println(joe.firstName + " " + joe.lastName);

And that would work. But honestly, all we really need to do is create our own 'toString' method in Person that does things the way we want. This will save us the effort of having to write the above code everywhere that wants to just display someone's name. You'll really appreciate that when you decide that you'd rather display that guy as "Blow, Joe" instead and have to change code in 23 different places.

Add the following block of code to Person:

public String toString() {
return firstName + " " + lastName;
}

And just like that, the original program will print the name out the way you wanted it to in the first place.

There, you've done it. You've made use of an inherited method, and then you created an override for it. An override is nothing more or less than specifying that instead of using the method found somewhere up the line of inheritance, you want to use this specific method instead.

When it comes down to it, overriding toString is just the tip of the iceberg, and I will of course be revisiting this later on. There are some extremely interesting things that you can do with inheritance, and it forms the basis for many important language structures. For now though, just remember that one class can extend another one, and that when it does so it has the option of replacing methods in its superclass with its own more specific version. There are a few other rules you'll need to follow, but most of them are not too horrible to deal with.

View Code

Monday, November 24, 2014

Collections Part 2: Map

It's time to talk about the Map.

Sometimes we need to store and retrieve information based on its relationship to other information in a keyed fashion. There are a lot of reasons for this, but for the purpose of this post we'll pretend that we want to store product descriptions based on item numbers.

So I have a store, where I sell an eclectic group of items:

Item #	Item Name
100001	Tooth straightener
100002	Dust Collection
100101	Pickled Snail Shell
100201	Odor Vent
200001	Checkerboard Paint
200101	Toenail Fastener

Oddly, business has not been very good lately.

Now, if I wanted to computerize my system, I might decide that I wanted a way to have the system display the description of an item when the item number was entered (or selected from a list, or something).

Let me start by saying it would be entirely possible to create two parallel arrays of String variables, such that itemNumbers[0] contains "100001" and itemNames[0] contains "Tooth straightener". One could search through the first array for the item number for which you were looking, take the index value you found and display the value from the second array. This can get a bit clumsy, and it would take a lot of work to keep everything in sync, but it could work just fine (albeit a bit slowly when the number of items gets large). Adding items would be relatively easy. Deleting them would be annoying, because you'd have to move all the items down. There are other ways you could do it, too.

But doesn't the following seem quite natural and easy?

//Get name of item based on item number
String itemName = products.get(itemNumber);

//Add new item type to product list
products.put(newItemNumber, newItemName);

//Remove obsolete item type from product list
products.remove(oldItemNumber);

Enter the Map.

A Map is a Collections framework interface designed for just this purpose. Keyed lookups like this are very handy, and the Map makes it easy to set up, too.

Like List, Map is just an interface. You will need to create an actual object that implements the Map interface in order to use one, and the most likely candidate is the HashMap.

So your code would look like this:

In the old days, before generics:

Map products = new HashMap();

Or if you're working in this century:

Map<String, String> products = new HashMap<String, String>();

The two object types are the key and the value data types, respectively.

Friday, November 21, 2014

Collections Part 1: List

Let's talk about Collections. I don't mean baseball cards... or dust. I'm talking about the Collections framework, which is a powerful set of classes included with your Java distribution.

This is not meant to be an in depth tutorial. I'm merely going to scratch the surface and give a few guidelines regarding basic use of the framework.

The Collections framework is incredibly important to the Java developer. You will spend a great deal of time working with the data structures within it. The two that you will most likely use more than any other are List and Map.

List and Map are not classes. They are interfaces. An interface can be thought of as the set of controls for a range of classes. Much like you might have a gas powered or electric powered oven, you still turn a dial and set a desired temperature. The interface is the same, although the implementation is different. This is a key concept: Code to the Interface is commonly said, and slightly less commonly actually done. I will do other posts on this idea, it's very important.

Let's start with List. We'll do Map in another post.

Suppose I wanted to keep track of a few numbers read in from a file. Each line contains one number, but I don't know how big the file is. While it would be possible to load the numbers into an array, it would be a bit messy, as I would have to manage the size of the array. Enter the 'ArrayList'. This is a Collections based object that works very much like an Array with a turbocharger on it. I don't have any hard numbers on this, but my suspicion is that it is the most commonly used Collections class out there.

Let's say my pseudocode looks something like this:

Open a file

For each line in the file:

Read an integer number from the file
Store the integer in a list for later use

Generate a sum of all the integers in the list as SumTotal
Display SUM to the user

There are multiple paths to success of course, and it's hard to say that any given solution is exactly 'wrong'. However, unless we know in advance how many values we're going to store, or we wildly oversize the initial allocation, using an array would require us to modify the length of the array regularly. It's not that this is particularly hard, but it's even easier not to do it.

And let's face it: One of the secrets of successful programming is doing things the easy way. It's a hard enough job, there's no reason to add unnecessary bells, whistles and epicycles to our code.

I'm not going to address the bits about reading the file right now, that will be in another post. I just want to concentrate on 'Store the integer in a list for later use' and 'Generate a sum of all the integers in the list as SumTotal'.

First, we have to create the ArrayList object. I'm going to do so in a way I haven't shown you before, but it's not too awful to understand:

List<Integer> numberList = new ArrayList<Integer>();

What?

Well, honestly Java can deal with this without having those '<Integer>' parts, but I thought you ought to get used to using them. They're really handy for preventing certain classes of errors when programs get bigger.

What this is doing is creating a variable called 'numberList'. It is an object of type 'List', and that 'List' is constrained to hold only Integer values. That's the reason for the syntax, it makes sure we don't do something silly like try to put a String or a PersonData in there.

List is NOT a class. You cannot create a 'new List()' directly. It is what's known as an interface, and essentially describes how to use a category of classes, rather than one single class.

ArrayList IS a class. You could do this if you so choose:

ArrayList<Integer> sillyNumberList = new ArrayList<Integer>();

This is roughly akin to adding the name of the manufacturer before mentioning an appliance. Every time. You wouldn't tell someone to go get you a beer from the Acme refrigerator, you'd just say refrigerator. The fact that it's an Acme was only important when you bought it (or when you need to have it repaired. Don't ask me about refrigerator repair, it's a sore spot). The fact that it's a refrigerator means that you will grab a handle, pull it, and find something yummy and cold inside. The specific manufacturer does not matter. Similarly, a List allows you to add items, remove items, look at items and such, all without having to know that it is specifically an ArrayList.

Let's imagine that we've now got a value we want to add as an Integer, in a variable called 'numberToAddToList' (imaginative, huh?) We'd just do this:

numberList.add(numberToAddToList);

No muss, no fuss, no checking capacity. Set it and forget it. You can do this once, or a thousand times or more. That one line is all you'll need to use.

The other bit we need to work on is generating a sum of the numbers. The way to do this, of course, is to take all the numbers, one by one, and add them to a variable that started at zero. I'm going to do this three times here. The first time using a for loop the way we've already done. Next, I'll do so using an 'Iterator', which is a helper type designed to go through all the items in a Collection one by one, has been around forever, and is oftentimes a better choice than a standard for loop. The last is using the 'enhanced' for loops that have been in the Java spec for a few years now. None of these ways are strictly wrong, but if I was reviewing your code I'd want to see or hear a good reason for not doing it the third way.

Integer sumTotal = 0;

1)
for (int i = 0; i < numberList.size(); i++ ) {
sumTotal = sumTotal = numberlist.get(i);
}

2)
Iterator<Integer> iter = numberList.iterator();
while(iter.hasNext()) {
sumTotal = sumTotal + iter.next();
}

3)
for(Integer number : numberList) {
sumTotal = sumTotal + number;
}

In all cases, we'd just then write:

System.out.println("The total is " + sumTotal);

As I think you can see, the new enhanced for loop makes life pretty easy. You declare a variable and state where it comes from, and then just use it in the loop.

The iterator probably looks pretty awful to you, and to be fair, that's the cleaned up version available to us since we got to start using Generics. Back before we could add that <Integer> tag it would have looked like this:
Iterator iter = numberList.Iterator();
while(iter.hasNext()) {
sumTotal = sumTotal + (Integer)(iter.next());
}

But it was still a useful construct quite often. Naturally in small examples like these the full impact does not show itself.

List can be used for much more than numbers of course. Any sort of Object can be stored in a list. You can have a List of Strings, or a List of Lists, or a List of Maps...

Next time, we'll discuss Maps, which instead of sequential access specialize in looking up specific objects based on a key value.

OOP University