Where to find the example code
Our example is a set of command-line applications for managing an address book data file, encoded using protocol buffers. The command add_person_go
adds a new entry to the data file. The command list_people_go
parses the data file and prints the data to the console.
You can find the complete example in the examples directory of the GitHub repository.
Defining your protocol format
To create your address book application, you’ll need to start with a .proto
file. The definitions in a .proto
file are simple: you add a message for each data structure you want to serialize, then specify a name and a type for each field in the message. In our example, the .proto
file that defines the messages is addressbook.proto
.
The .proto
file starts with a package declaration, which helps to prevent naming conflicts between different projects.
syntax = "proto3";
package tutorial;
import "google/protobuf/timestamp.proto";
The go_package
option defines the import path of the package which will contain all the generated code for this file. The Go package name will be the last path component of the import path. For example, our example will use a package name of “tutorialpb”.
option go_package = "github.com/protocolbuffers/protobuf/examples/go/tutorialpb";
Next, you have your message definitions. A message is just an aggregate containing a set of typed fields. Many standard simple data types are available as field types, including bool
, int32
, float
, double
, and string
. You can also add further structure to your messages by using other message types as field types.
message Person {
string name = 1;
int32 id = 2; // Unique ID number for this person.
string email = 3;
enum PhoneType {
MOBILE = 0;
HOME = 1;
WORK = 2;
}
message PhoneNumber {
string number = 1;
PhoneType type = 2;
}
repeated PhoneNumber phones = 4;
google.protobuf.Timestamp last_updated = 5;
}
// Our address book file is just one of these.
message AddressBook {
repeated Person people = 1;
}
In the above example, the Person
message contains PhoneNumber
messages, while the AddressBook
message contains Person
messages. You can even define message types nested inside other messages – as you can see, the PhoneNumber
type is defined inside Person
. You can also define enum
types if you want one of your fields to have one of a predefined list of values – here you want to specify that a phone number can be one of MOBILE
, HOME
, or WORK
.
The " = 1", " = 2" markers on each element identify the unique “tag” that field uses in the binary encoding. Tag numbers 1-15 require one less byte to encode than higher numbers, so as an optimization you can decide to use those tags for the commonly used or repeated elements, leaving tags 16 and higher for less-commonly used optional elements. Each element in a repeated field requires re-encoding the tag number, so repeated fields are particularly good candidates for this optimization.
If a field value isn’t set, a default value is used: zero for numeric types, the empty string for strings, false for bools. For embedded messages, the default value is always the “default instance” or “prototype” of the message, which has none of its fields set. Calling the accessor to get the value of a field which has not been explicitly set always returns that field’s default value.
If a field is repeated
, the field may be repeated any number of times (including zero). The order of the repeated values will be preserved in the protocol buffer. Think of repeated fields as dynamically sized arrays.
You’ll find a complete guide to writing .proto
files – including all the possible field types – in the Protocol Buffer Language Guide. Don’t go looking for facilities similar to class inheritance, though – protocol buffers don’t do that.
Compiling your protocol buffers
Now that you have a .proto
, the next thing you need to do is generate the classes you’ll need to read and write AddressBook
(and hence Person
and PhoneNumber
) messages. To do this, you need to run the protocol buffer compiler protoc
on your .proto
:
-
If you haven’t installed the compiler, download the package and follow the instructions in the README.
-
Run the following command to install the Go protocol buffers plugin:
$ go install google.golang.org/protobuf/cmd/protoc-gen-go
The compiler plugin
protoc-gen-go
will be installed in$GOBIN
, defaulting to$GOPATH/bin
.It must be in your
$PATH
for the protocol compilerprotoc
to find it. -
Now run the compiler, specifying the source directory (where your application’s source code lives – the current directory is used if you don’t provide a value), the destination directory (where you want the generated code to go; often the same as
$SRC_DIR
), and the path to your.proto
. In this case, you would invoke:$ protoc -I=$SRC_DIR --go_out=$DST_DIR $SRC_DIR/addressbook.proto
Because you want Go code, you use the
--go_out
option – similar options are provided for other supported languages.# e.g., $ protoc -I=./proto --go_out=gen_proto proto/addressbook.proto
This generates github.com/protocolbuffers/protobuf/examples/go/tutorialpb/addressbook.pb.go
in your specified destination directory.
The Protocol Buffer API
Generating addressbook.pb.go
gives you the following useful types:
- An
AddressBook
structure with aPeople
field. - A
Person
structure with fields forName
,Id
,Email
andPhones
. - A
Person_PhoneNumber
structure, with fields forNumber
andType
. - The type
Person_PhoneType
and a value defined for each value in thePerson.PhoneType
enum.
You can read more about the details of exactly what’s generated in the Go Generated Code guide, but for the most part you can treat these as perfectly ordinary Go types.
Here’s an example from the list_people
command’s unit tests of how you might create an instance of Person:
p := pb.Person{
Id: 1234,
Name: "John Doe",
Email: "jdoe@example.com",
Phones: []*pb.Person_PhoneNumber{
{Number: "555-4321", Type: pb.Person_HOME},
},
}
Writing a Message
The whole purpose of using protocol buffers is to serialize your data so that it can be parsed elsewhere. In Go, you use the proto
library’s Marshal function to serialize your protocol buffer data. A pointer to a protocol buffer message’s struct
implements the proto.Message
interface. Calling proto.Marshal
returns the protocol buffer, encoded in its wire format. For example, we use this function in the add_person
command:
book := &pb.AddressBook{}
// ...
// Write the new address book back to disk.
out, err := proto.Marshal(book)
if err != nil {
log.Fatalln("Failed to encode address book:", err)
}
if err := ioutil.WriteFile(fname, out, 0644); err != nil {
log.Fatalln("Failed to write address book:", err)
}
Reading a Message
To parse an encoded message, you use the proto
library’s Unmarshal function. Calling this parses the data in buf
as a protocol buffer and places the result in pb
. So to parse the file in the list_people
command, we use:
// Read the existing address book.
in, err := ioutil.ReadFile(fname)
if err != nil {
log.Fatalln("Error reading file:", err)
}
book := &pb.AddressBook{}
if err := proto.Unmarshal(in, book); err != nil {
log.Fatalln("Failed to parse address book:", err)
}
My Demo
https://github.com/swsmile/protobuf_demo/tree/main