The Northeastern Luzon Subgroup of
Philippine Languages
Laura C. Robinson and Jason William Lobel
UNIVERSITY OF CALIFORNIA, SANTA BARBARA
AND UNIVERSITY OF HAWAI‘I AT M NOA
This paper presents a survey of the languages of the northeastern part of the
large northern Philippine island of Luzon—Dupaningan Agta, Pahanan Agta,
Casiguran Agta, Nagtipunan Agta, Dinapigue Agta, Paranan, and Kasiguranin—the first five of which are spoken by Negrito Filipino groups. With the
exception of Kasiguranin, these languages compose a subgroup called Northeastern Luzon. Evidence is presented to determine the internal and external
relationships of these languages, including historical phonology, functors,
and lexicon. It is argued that they are not members of the Northern Cordilleran subgroup, as has been previously suggested, but instead form a primary
branch of the Northern Luzon (Cordilleran) subgroup.
1. INTRODUCTION.1 The eastern coast of northern Luzon, from Casiguran,
Aurora to the small island of Palaui off the northern tip of Luzon near Santa Ana,
Cagayan, is home to a world rather different from the rest of the modern Philippines.
Although Ilokano migrants settled in this area during the twentieth century, the narrow
strip of lowlands sandwiched between the Sierra Madre mountains and the sea has historically been home to a unique mix of Agta (that is, phenotypically Negrito2) and a
minority of non-Agta Filipinos who have apparently borrowed some or most of the
forms in their languages from their Agta neighbors. All of these groups speak Austronesian languages today, although the Agta must have spoken non-Austronesian languages
at some point in the past, since their ancestors were present in the Philippines probably
tens of thousands of years before the arrival of Austronesian-speaking peoples (cf. Reid
1987, 1994; Bellwood 1997; Blust 2005). Considering the isolation of this part of the
Philippines, it should come as no surprise that—with the exception of Kasiguranin
1. Many thanks to all of our language consultants and those who facilitated our research in this
region, as well as to Dr. Robert Blust and two anonymous referees whose feedback contributed to the revision of this paper. Any errors are ours alone. A complete list of abbreviations
may be found in appendix 1. ISO 639-3 codes are given in square brackets immediately following the first mention of their language names.
2. The term Negrito is used primarily in western academic literature, and we will use it here,
although we would have preferred to promote the much more neutral term “Black Filipino” to
avoid preconceived notions associated with the diminutive “Negrito” (literally ‘small black
person’ < Spanish), and to remind readers that, as Thomas Headland states, “these are people
who have evolved right along with the rest of us into the 20th century” (1997:607).
Oceanic Linguistics, Volume 52, no. 1 (June 2013)
© by University of Hawai‘i Press. All rights reserved.
126
OCEANIC LINGUISTICS, VOL.
52, NO. 1
(KAS) [ksn], the descendant of an early Tagalog dialect that borrowed heavily from Agta
languages—the languages spoken in this area form a genetic subgroup, called Northeastern Luzon (NELUZ), with at least five members: Dupaningan Agta (DUP), Pahanan
Agta (PAH, sometimes referred to as Palanan Dumagat or Palanan Agta), Dinapigue
Agta (DIN), Casiguran Agta (CAS, sometimes called Casiguran Dumagat), and Nagtipunan Agta (NAG).3 A sixth language, Paranan (PAR), spoken by non-Agta, is tentatively
included in the NELUZ subgroup due to the overwhelming lexical similarities to Pahanan Agta, although key differences in its functors point to a possible origin outside of
the subgroup.
1.1 RELATIONSHIPS TO OTHER NEGRITO FILIPINO GROUPS.
A number of other Negrito Filipino groups exist in areas adjacent to where NELUZ languages are spoken (see map 1), yet none of their languages appear to have any immediate relationship to NELUZ. These neighboring languages include Central Cagayan Agta
(or “Labin” Agta, CCAGTA), which belongs to the Cagayan Valley (CV) subgroup.4
To the immediate south are the languages Northern and Southern Alta [aqn and agy,
respectively], as well as Umiray Dumaget [due]: all are spoken by Negrito Filipino populations, but none shares much with NELUZ save for some lexical items that most likely
were borrowed, and a couple of phonological innovations: Low Vowel Fronting in
Umiray Dumaget (with a similar phenomenon in Northern and Southern Alta), which is
also found in Inagta Alabat [dul] in Quezon Province and Manide [abd] in the northern
Bikol Peninsula (Lobel 2010); and *j > d in Northern and Southern Alta, Arta, Ilokano,
Umiray Dumaget, Manide, Inagta Alabat, and the Greater Central Philippine languages.
A chain of Negrito Filipino groups actually continues almost uninterrupted from
Dupaningan Agta in the north through Alta, Arta [atz],5 Umiray Dumaget, and Remontado Dumagat [agv], down through Inagta Alabat in Quezon Province, and Manide in
Camarines Norte. These latter groups do not appear to share any innovations with the former groups, yet it is likely that they were at least in casual contact at some early point, as—
with the mountain range serving as a formidable obstacle to land travel—travel along the
eastern coast of Luzon would have had to be primarily by sea (or at least along the coast,
possibly by walking), and there would have been few other groups for residents of eastern
Luzon to interact with other than other Negrito Filipinos. This has been the case at least
since the time of the Spanish occupation of the Philippines: “By the time of the Spanish
3. Two or three other Agta dialects are reported to exist in the inland areas of Palanan,
Dinapigue, and San Mariano towns, but we were unable to reach those areas. It is expected
that they are closely related to the languages covered in this study. Note that the abbreviations
given in this paragraph are the ones used throughout this paper. The ISO 639-3 codes for these
languages are: Dupaningan Agta [duo], Pahanan Agta [apf], Casiguran Agta [dgc], and
Paranan [prf]. Dinapigue Agta and Nagtipunan Agta do not have ISO 639-3 codes but are
closely related to Pahanan Agta and Casiguran Agta, respectively.
4. The Cagayan Valley subgroup includes Gaddang [gad], Itawis [itv], Central Cagayan Agta
[agt], Ibanag [ibg], Atta [att], Yogad [yog], and Isnag [isd], and possibly Malaweg (no ISO
639-3 code, erroneously conflated with Itawis), with different authors varying slightly as to
which languages they include.
5. Although we were unable to locate any Arta in their former site in Aglipay and Maddela
towns, Quirino Province, Lawrence Reid (pers. comm., October 10, 2010) reports that he has
recently discovered another group of Arta speakers in the same general area.
THE NORTHEASTERN LUZON SUBGROUP
127
arrival in the archipelago, most of the Philippine Negrito groups had already been ‘minoritized’ and driven into remote areas by the Malay ethnic groups. By contrast, in the southeastern region of Luzon (present Quezon), the Aetas [that is, Negrito Filipinos] and other
Negrito groups were still a majority compared to the Malay people6 when the Spanish first
came to the area (c. 1571, according to a Spanish document)” (Goda 2003:183‒84).
MAP 1. NEGRITO FILIPINO GROUPS IN AND AROUND
NORTHEASTERN LUZON
6. The use of the term “Malay people” to refer to non-Negritos reflects a widespread misunderstanding in the Philippines, dating from early twentieth-century wave migration theories that
claimed that the ancestors of most non-Negrito Filipinos came from Malaysia and Indonesia.
Although these theories have long since been debunked by historians (for example, Scott
1984, 1992) and are contradicted by the linguistic and archaeological evidence, they persist in
the Philippines and, in fact, are still being taught in the Philippines’ national elementary and
high school curriculum.
128
OCEANIC LINGUISTICS, VOL.
52, NO. 1
1.2 HISTORICAL LINGUISTIC GEOGRAPHY OF EAST COAST
NORTHERN LUZON. The NELUZ languages occupy the area between the small
island of Palaui, just off the northeastern coast of Luzon, and Dinalungan town (see map
1). Until recently, this area served as a buffer zone on the east coast of northern Luzon
between Tagalog and Ilokano (or earlier, Ibanag), as illustrated in map 2. Assuming that
the east coast of northern Luzon was traditionally home mainly to Agta who spoke
NELUZ languages, we can formulate a hypothesis about what must have taken place
MAP 2. NORTHEASTERN LUZON AS A BUFFER ZONE
BETWEEN TAGALOG AND ILOKANO
THE NORTHEASTERN LUZON SUBGROUP
129
with the early arrival of non-Agta in the area. In the northernmost area of northeastern
Luzon, there do not appear to be any long-standing non-Agta communities. To the south,
people on the east coast of Luzon east from Laguna de Bay (that is, the towns of General
Nakar, Infanta, Real, and Mauban) speak one of the Quezon dialects of Southern Tagalog. North of this area, however, the oldest non-Agta town is Kasiguranin-speaking
Casiguran, founded in 1607. The oldest true Tagalog-speaking town in this area, Baler,
was reportedly founded even more recently than Casiguran.
At present, Tagalog is spoken as the majority language as far north as Baler, Dipaculao,
and Dinalungan. The native language in the town center of Casiguran is Kasiguranin (the
descendant of an early Tagalog dialect that borrowed lexicon heavily from NELUZ Agta
languages). To the north of Casiguran, other than Palanan town, are four small, primarily
Ilokano-speaking towns—Dilasag, Dinapigue, Maconacon, and Divilacan—which do
not appear to be particularly old as non-Agta settlements (and which, with the exception of
Dilasag, have less than 5,000 residents each, making them the three least-populated coastal
towns on any major island in the entire Philippines).7 The town of Casiguran is probably
one of the oldest non-Negrito Filipino settlements on the entire east coast of northern
Luzon. Kasiguranin is the result of an early Tagalog dialect that borrowed a large amount
of lexicon from the surrounding Agta languages. Paranan, the language of the non-Agta
residents of Palanan town, is similar to the Pahanan language spoken by the Agta of Palanan town, save for the *r > /h/ change in Pahanan Agta (an areal feature in the Agta languages of northeastern Luzon), a small percentage of lexical differences, and a non-Agta
substratum in Paranan.8
It appears that the first non-Agta Filipinos to permanently settle in the Casiguran area
were Tagalogs migrating northward from Quezon Province,9 who traveled up the coast,
finding Casiguran, likely already a locally important, relatively large Agta settlement
whose bay is conveniently protected from storms by the San Ildefonso Peninsula. With
relatively few other non-Agta to interact with in their new home, the early Casiguran
Tagalogs began to borrow heavily from the language of the Casiguran Agta (which itself
was the descendant of a language that had been adopted from non-Agta Austronesians
centuries or millennia earlier). As they still would have had some communication with
Tagalogs to the south, the Casiguran Tagalogs had enough need for Tagalog that they
retained a significant portion of their native language, while borrowing considerably from
Casiguran Agta and the other local Agta languages. The resulting Kasiguranin language
is a unique mixture that, after several generations, became incomprehensible to Tagalogs
and, until recently, was reportedly incomprehensible even to the Casiguran Agta whose
ancestors contributed approximately half of the Kasiguranin lexicon. The first author calculated 68 percent cognates between Casiguran Agta and Kasiguranin based on a 2807. Note, however, that the ancestors of the modern Agta groups have almost certainly been in the
region for much longer, and the fact that the names of many of these towns, as well as of the
barangays thereof, begin with the Agta locative marker di suggests that they were probably
founded on the sites of earlier Agta settlements.
8. Less than 2 percent of non-Tagalog, non-Agta lexicon is shared exclusively between Kasiguranin and Paranan, implying that there was little if any significant contact between the nonAgta residents of Paranan town and those of Casiguran town that did not also include considerable numbers of Agta.
9. Until 1979, Aurora Province was a part of Quezon Province, which was previously called Tayabas.
OCEANIC LINGUISTICS, VOL.
130
52, NO. 1
word list (see table 1), while Headland (1975) calculated 77 percent cognates. Kasiguranin data are included in this paper for the sake of comparison, even though it is not a
member of the NELUZ subgroup. The authors are preparing a separate paper on this subject. (Lobel and Robinson n.d.). Like Kasiguranin, the Paranan language is spoken by
non-Agta, but Paranan is a genetic member of the NELUZ subgroup along with the five
Agta languages. The first author calculated 81 percent cognates between Pahanan Agta
and Paranan (see table 1), while Headland (1975) found 85 percent cognates.
TABLE 1. COGNATE PERCENTAGES
PAH
PAR
CAS
NAG
KAS
65%
54%
60%
58%
49%
81%
78%
72%
72%
71%
60%
78%
77%
68%
55%
DUP
PAH
PAR
CAS
NAG
1.3 PREVIOUS REFERENCES TO THE NORTHEASTERN LUZON
LANGUAGES. A number of previous works have investigated the subgrouping of
one or more of the NELUZ languages, but often with very little data, and with as few as
only one of these languages included.
Tharp (1974a) classifies Casiguran Agta as a primary branch of Northern Cordilleran,
but does not include any of the other NELUZ languages in his study. Headland (1975)
considers Casiguran Agta, Kasiguranin, Pahanan Agta, Paranan, and Dupaningan Agta
on the bases of shared cognate percentages and intelligibility testing, and is thus the first
author to define a NELUZ subgroup similar to the one we are proposing. However, the
methodology used is problematic, as (i) lexicostatistics does not differentiate retentions,
borrowings, and shared innovations, which is especially important in the case of contact
languages like Kasiguranin and Paranan; and (ii) intelligibility testing has never been
demonstrated to be a reliable basis for subgrouping, and can give misleading results when
used to determine the relationships between the languages of two communities that have
always historically interacted and traded with one another, as is the situation with Kasiguranin and Casiguran Agta. Furthermore, as Blust (2000:327) points out, “we cannot tell
when [lexicostatistics] gives valid results and when it does not. … Since we now know
that languages vary widely in retention rate of basic vocabulary over lengthy intervals of
time, lexicostatistics must be seen as an unreliable foundation for subgrouping hypotheses
that are not independently confirmed by the evidence of exclusively shared innovations.”
Most modern works (for example, Reid 2006, Lewis 2009) assume that the NELUZ languages form a subgroup within Northern Cordilleran, which is in turn a subgroup of Northern
Luzon (Cordilleran),10 but there are no published works justifying this subgrouping.
Of the NELUZ languages described in this paper, only Casiguran Agta (Headland
and Headland 1974, Headland and Healey 1974, Headland and Wolfenden 1967) and
Dupaningan Agta (Robinson 2011) have been described in any detail. Vanoverbergh
10. This subgroup was previously referred to as “Cordilleran,” but we have adopted the name
“Northern Luzon” to reflect the current usage of Lawrence Reid, the leading scholar on the
languages of the northern Philippines.
131
THE NORTHEASTERN LUZON SUBGROUP
(1937) includes wordlists of Kasiguranin (his “Casiguran”) and Casiguran Agta (his
“Casiguran Negrito”). Pahanan Agta and Paranan are included in Ethnologue and
other surveys, although there are no published data available on either. The Nagtipunan
Agta and Dinapigue Agta varieties discussed here have not been previously mentioned
in the literature.
1.4 THE DATA AND ELICITATIONS. The data used in this study were collected in September and October of 2006 by the current authors. For each speech variety, a
1,000-item wordlist was elicited along with between 100 and 200 sentences, usually with a
small group of native speakers. Much less data were elicited for Dinapigue Agta, for
which the authors were only able to spend one afternoon with a single native speaker. Data
were usually elicited via Tagalog (which was said to be better understood by the residents
of these areas than Ilokano), except for Dupaningan Agta, where the medium was Ilokano,
and Nagtipunan Agta, where a mixture of Tagalog and Ilokano was used (since Ilokano is
much more prevalent on the western side of these mountains than on the eastern side).
2. THE EVIDENCE FOR A NORTHEASTERN LUZON SUBGROUP.
As mentioned above, previous authors have argued or assumed that most or all of the languages of northeastern Luzon form a single subgroup, but without presenting phonological, morphological, or lexical evidence to support this subgrouping. The goal of this
section is to present such evidence.
2.1 PHONOLOGICAL EVIDENCE. This section will outline the various phonological shifts that have occurred in the NELUZ languages. When they are similar to the
NELUZ forms, Kasiguranin forms are given in parentheses for the sake of comparison.
2.1.1 Consonant reflexes. Table 2 illustrates the phoneme inventory of ProtoNortheastern Luzon as we reconstruct it.
TABLE 2. PROTO-NELUZ PHONEME INVENTORY
CONSONANTS
*p
*b
*m
*w
*t
*d
*s
*n
*l
*r
*y
*k
*g
VOWELS
*ʔ
*h
*i
*e
*ə
*a
*u
*o
*ŋ
2.1.1.1 Reflexes of PMP *R. PMP *R is reflected as /g/ in the NELUZ languages, as
illustrated in forms (1)‒(9).11
(1) *qaRta ‘outsiders, alien people’ > DUP, PAR, CAS, NAG ágtaʔ (KAS ágtaʔ)
‘Negrito Filipino person’12
11. Reconstructions are from Blust (1972, 1983‒84, 1987, 1999, 2006, 2009) and Blust and Trussell
(in progress) unless otherwise noted. In all numbered examples in this paper (except for
[95]‒[121]), all protoforms are Proto‒Malayo-Polynesian (PMP) unless otherwise specified.
OCEANIC LINGUISTICS, VOL.
132
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
52, NO. 1
*qabaRa ‘shoulder’ > DUP abigíʔ, PAR abagáʔ (KAS abagáʔ)
*daRaq ‘blood’ > PAH, CAS digéʔ, DUP, NAG digíʔ (KAS digíʔ)
*daRat ‘littoral sea’ > DUP, PAR, CAS digɛ́t, PAH, NAG digit (KAS digɛ́t) ‘sea’
*baqəRu ‘new’ > DUP, PAH, PAR, CAS, NAG bigúʔ (KAS bigúʔ)
*diRus ‘bathe’ > DUP dégus, PAH, PAR, CAS, NAG dígus (KAS dígus)
*hulaR ‘snake’ > DUP, PAH, CAS, NAG ulág (KAS ulág)
*niuR ‘coconut’ > DUP, PAH, PAR, DIN, CAS, NAG niyóg (KAS niyóg)
*bəŋəR ‘deaf’ > DUP baŋŋág, PAH, PAR, DIN bəŋŋə́g
The /g/ reflex of PMP *R in both the Cagayan Valley and Northeastern Luzon languages is one reason why previous authors have suggested that they subgroup together.
The NELUZ languages share the *R > /g/ shift with the Cagayan Valley languages, but it
is worth noting that languages further south on the Pacific coast of Luzon also reflect *R
as /g/, including Manide (Lobel 2010), Inagta Alabat (Lobel 2011), and Central Philippine languages like Tagalog and Bikol.
2.1.1.2 Reflexes of PMP *j. PMP *j is reflected as /d/ in the NELUZ languages, with
some exceptions. Among the other Northern Luzon languages, a /d/ reflex of *j only
occurs in Arta, Northern and Southern Alta, and Ilokano (Reid 2006:5‒6). Elsewhere in
the Northern Luzon subgroup—that is, Cagayan Valley and South-Central Cordilleran—
PMP *j is reflected as /g/. Forms (10)‒(18) illustrate the /d/ reflex of *j in the Northeastern Luzon languages.
(10) *pusəj ‘navel’> DUP pusád, PAH, PAR, CAS, NAG pusə́d (KAS pusə́d)
(11) *huaji ‘younger sibling’ > DUP, PAH, PAR, CAS, NAG wadíʔ (KAS wadíʔ)
(12) *qapəju ‘bile, gall’> DUP, PAH, PAR, CAS ápduʔ, NAG apdúʔ
(13) *ə[n]juŋ ‘nose’13 > PAR əddúŋ (KAS əddúŋ)
(14) *qaləjaw ‘day’ > DUP, PAR, CAS, NAG aldéw DIN, PAH áldew (KAS aldéw)
(also ‘sun’ in DUP, PAH, PAR, DIN, KAS)
(15) *quləj ‘maggot’ > DUP, PAR urád, PAH, CAS uhə́d (KAS urə́d) ‘worm’
(PNELUZ *urəd)14
(16) PAN *maja ‘dry’ > DUP, PAH, PAR, DIN, NAG madíʔ
(17) *Rawǝj ‘betel leaf’ > DUP gíwad, PAH, PAR giwə́d, DIN, CAS, NAG gawə́d
(KAS gawə́d)
(18) *laja ‘weave’ > DUP, PAH, PAR, CAS ladíʔ (KAS ladíʔ)
In several forms, such as (19)‒(23), *j is reflected as /r/ in the NELUZ languages, or
as /r/ in some languages and as /g/ in others. This could reflect a sporadic shift of *j > *d
> /r/, but independent evidence for the intermediate step of *d > /r/ is lacking.
12. Glottal stop is not written word-initially because it is not phonemic in this position in any of
the NELUZ languages.
13. This etymology was attributed to Proto-Western Malayo-Polynesian (PWMP), but since this
is not a widely accepted subgroup, we have regularized this and other similar etymologies in
this paper to PMP.
14. Note that the Agta languages (except Dupaningan) reflect the shift *r > h.
THE NORTHEASTERN LUZON SUBGROUP
133
(19) *ŋajan ‘name’ > DUP nagen, PAH ŋahán, PAR ŋarán, CAS, NAG ŋahɛ́n (KAS
ŋarán)
(20) *sujud ‘fine-toothed comb for delousing’ > DUP, NAG súgod, DUP, PAH,
DIN, CAS suród, PAR surúd (KAS suród) (note DUP suród ‘comb’ vs. súgod
‘fine-toothed comb for delousing’)
(21) *pajay ‘rice in field’ > DUP, PAR paráy, PAH, CAS, NAG paháy (KAS paráy)
(22) *qikəj ‘cough’ > DUP ikár, PAH, DIN, CAS, NAG ikə́h, PAR ikə́r (KAS ikə́r)
(23) PNLUZ *pajǝs ‘wind’ > DUP parás, PAH pahás, PAR parə́s, CAS, NAG pahə́s
(KAS parə́s)
There is at least one item, (24), in which *j appears to be reflected as /g/ in all of the
NELUZ languages for which we have data.
(24) *qujiŋ ‘charcoal’ > DUP, PAH, PAR, CAS, NAG ugíŋ (KAS ugíŋ)
However, David Zorc (pers. comm., August 31, 2011) suggests that there is evidence for
a northern Philippine doublet *u:Riŋ alongside PMP/PPH *qujiŋ, so these reflexes would
be regular.
Although the Cagayan Valley languages generally have a /g/ reflex of *j, there are also a
few exceptions in which these languages have a /d/ or /r/ reflex, such as forms (25)‒(30).
(25) *palaj ‘palm (of hand)’ > Ilokano palad, Gaddang pallad
(26) *huaji ‘younger sibling’ > Ilokano adi
(27) *pusəj ‘navel’ > Central Cagayan Agta pusad (cf. DUP pusád)
(28) *qapəju ‘bile, gall’ > Central Cagayan Agta apdu (cf. DUP apdúʔ)
(29) *qikəj ‘cough’ > Central Cagayan Agta ikar (cf. DUP ikár)
(30) *qaləjaw ‘day’ > Central Cagayan Agta araw (cf. Tagalog araw)15
Reid (2006:9) notes that the reflex of *palaj has a final /d/ in all of the languages of the
northern Philippines, so it seems likely that this was either a borrowing or a sporadic shift
in the protolanguage. Central Cagayan Agta, especially the Gattaran dialect from which
forms (27)‒(30) are taken, has been in extensive contact with Dupaningan Agta, so it is
likely that forms (27)‒(29) have been borrowed from the latter. Form (30) looks like a
borrowing from Tagalog.
Thus, all of the NELUZ languages share the shifts *R > /g/ and *j > /d/, a combination
of changes that is not found in any other language in the northern Philippines (Ilokano
and Arta have /d/ reflexes of *j, but *R split to /g/ and /r/ without apparent conditioning in
both languages; see section 3), but which is found further south in Manide, Inagta Alabat,
and the Greater Central Philippine languages. Given the importance previous authors
have placed on *R and *j for subgrouping purposes, this combination of phonological
innovations is considered strong evidence supporting the existence of a NELUZ subgroup
distinct from the other branches of Northern Luzon.
15. Thanks to an anonymous reviewer for bringing to our attention these forms from Mayfield’s
(n.d.) unpublished dictionary of the Gattaran dialect of Central Cagayan Agta, which has been
in close contact with Dupaningan Agta.
134
OCEANIC LINGUISTICS, VOL.
52, NO. 1
2.1.1.3 Reflexes of PMP *z. PMP *z is regularly reflected as /d/ in the NELUZ languages, as illustrated in forms (31)‒(35). Thus, PMP *z, *j, and *d all merged as /d/ in
these languages.
(31) *zalan ‘road, path’ > DUP, PAH, PAR, CAS, NAG dilán (KAS dalán)
(32) *zaRum ‘needle’ > DUP dágum, PAH, PAR, CAS, NAG digúm (KAS digúm)
(33) *tazəm ‘sharp’ > DUP tadám, PAH, CAS, NAG tadə́m
(34) *quzan ‘rain’ > DUP, PAR, NAG udén, PAH udín, CAS udɛ́n (KAS udén)
(35) *azani ‘near’ > PAH, PAR, adɛ́niʔ (KAS adɛ́ne)
2.1.1.4 Reflexes of PMP *r. PMP *r is reflected as /l/ or /r/ in the NELUZ languages,
without apparent conditioning, much like the Cagayan Valley languages, which also
reflect a sporadic *r > l shift (Tharp 1974a). Evidence to support this is limited, however,
because most of the PMP etymologies with *r have relatively obscure meanings that are
not included on the second author’s 1,000-item wordlist. Therefore, we have largely had
to rely on the more extensive lexical data available for Dupaningan Agta (Robinson
2011) and Casiguran Agta (Headland and Headland 1974). Forms (36)‒(40) illustrate the
/r/ reflex of *r:
(36) *rəbaq ‘to collapse (house)’ > PAH nahabáʔ, PAR narbáʔ (from *marəbbaʔ)
(37) *rəpuk ~ *rəput ‘rotten to brittleness, of wood’ > DUP rappótak ‘rotten, of
wood’
(38) *burik ‘speckled, of hens, etc.’ > DUP burék ‘spotted, striped, speckled,
multi-colored’
(39) *qarimaw > DUP sarimaw ‘civet cat’
(40) PPH *warak > DUP warék ‘scatter’; CAS wahak-wahak ‘to be continually
dropping large pieces of things being carried over the trail (e.g., sweet potatoes, pencils, books)’ (Headland and Headland 1974)
Compare these with forms (41) and (42), which illustrate the /l/ reflex:
(41) *rəpag > DUP lappág ‘slap’
(42) PPH *haprus > DUP aplos ‘massage, rub’
2.1.1.5 Reflexes of PMP *q. PMP *q was usually lost in NELUZ, as illustrated in
forms (43)‒(46).
(43) *baqəRu ‘new’ > DUP, PAH, PAR, CAS, NAG bigúʔ (KAS bigúʔ)
(44) *taqi ‘feces’ > DUP, PAH attáy, PAR, NAG əttáy, CAS ətáy ~ əttáy (KAS
əttáy)16
(45) *tuqəlaŋ ‘bone’ > DUP, PAH, PAR, CAS, NAG tuláŋ (KAS tuláŋ)
(46) *bituqən ‘star’ > DUP, PAH, PAR, CAS bitón, NAG bitún
In at least one case, (47), some of the languages do show a medial glottal stop. More
data are needed to determine whether this is an exception or reflects some as yet undetermined regularity.
16. After the loss of medial *q in this form, a schwa was epenthesized to preserve the preferred
disyllabic syllable structure.
135
THE NORTHEASTERN LUZON SUBGROUP
(47) *liqəR ‘neck’ > DUP leːg, PAH laɁég, PAR allíg, DIN liɁég, CAS, NAG liːg
(KAS əllég)
2.1.1.6 Reflexes of PMP *h. PMP *h was lost in initial position, as illustrated in
forms (48)‒(51).
(48) *hadiRi ‘house post’ > DUP, PAH, PAR, CAS, NAG adigíʔ (KAS adigíʔ)
(49) *huRas ‘wash’ > DUP, PAH, CAS, NAG ugés, PAR ugás (KAS ugás) ‘wash
(general, or of hands)’
(50) *hulaR ‘snake’ > DUP, PAH, CAS, NAG ulág (KAS ulág)
(51) *huaji ‘younger sibling’ > DUP, PAH, PAR, CAS, NAG wadíʔ (KAS wadíʔ)
In medial position, however, PMP *h is sometimes reflected as a glottal stop in PAH,
PAR, DIN, and CAS, while it disappears in NAG and DUP, as illustrated in forms (52)‒(54).
(52) *buhək ‘hair’ > DUP, NAG buːk, PAH, PAR buɁók, DIN, CAS buɁúk (note
that KAS buhók continues the Tagalog form)
(53) *dahun ‘leaf’ > DUP, NAG doːn, PAH dáɁon, PAR dúɁun, CAS dəɁón (KAS
duɁón)
(54) *bahaR ‘loincloth’ > DUP beːg, PAH, PAR, DIN baɁeg, CAS, NAG biːg (KAS
baɁeg)
On the other hand, there are quite a few words in which *h was lost in all of the
NELUZ languages, such as (55)‒(61).
(55) *kahiw ‘wood’ > DUP, PAH, PAR, CAS, NAG, kayúʔ (KAS kayóʔ)
(56) *qihu ‘shark’ > DUP, NAG iyúʔ, PAH ayyúʔ, CAS iyóʔ (KAS iyúʔ)
(57) *duha ‘two’ > DUP, PAH, PAR, DIN dúwa, CAS əduwá (KAS dúwa)
(58) *unahik ‘climb’ > DUP, PAH, PAR, DIN, CAS, NAG unék (KAS unék)
(59) *anahaw ‘palm tree’ > DUP, PAH, PAR, DIN, CAS anáw (KAS anáw)
(60) *di lahud ‘downstream’ > DUP, PAH, CAS, NAG dilód (KAS dilód)
(61) *luhəq ‘teardrop’ > DUP, PAH, PAR luwáʔ, CAS, NAG lə́wa
2.1.1.7 *s > /h/. The sporadic and unconditioned shift of *s to /h/ is found in all members of the NELUZ subgroup, except Casiguran Agta.17 Table 3 lists selected forms in
which there is variation between /s/ and /h/ in the Northeastern Luzon languages. The *s
> /h/ forms are shaded, and dashes indicate that no cognate was found in that language.
TABLE 3. VARIATION BETWEEN /s/ AND /h/
‘thick bamboo’
‘lie on stomach’
‘close eyes’
3PL.NOM
1INCL.DUAL.NOM
‘hiccup’
DUP
—
hakab
kisap
hidi
hikitá
hálduʔ
PAH
buwaŋhina
—
kihəp
hide
hikitá
sə́lduʔ
PAR
buwaŋhina
—
kisəp
hidi
sikitá
sálduʔ
DIN
—
hakəb
kihəp
hide
hikitá
sə́lduʔ
CAS
buwaŋsina
sakəb
kisəp
side
sikitá
sə́lduʔ
NAG
—
hakəb
—
sidi
sikitá
sə́lduʔ
17. Since most of Kasiguranin’s non-Tagalog vocabulary is from Casiguran Agta, this shift is not
found in Kasiguranin either.
136
OCEANIC LINGUISTICS, VOL.
52, NO. 1
2.1.2 Vowe l reflexes. PNELUZ had a six-vowel system /a e i o u ə/, as illustrated in
table 2 above. PMP *i, *u, and *ə were reflected as *i, *u, and *ə, respectively, in ProtoNortheastern Luzon. PMP *a was reflected as PNELUZ *a in most environments, but as
*e after a voiced stop /b d g/, due to Low Vowel Fronting (cf. 2.1.2.2). The source of
PNELUZ *o is unclear. This section will outline the various shifts affecting vowels in the
NELUZ languages.
2.1.2.1 Schwa and gemination. In all of the NELUZ languages except Casiguran
Agta, consonants geminate after a schwa, which has been characterized as “inherently
short” (Blust 2009:548), as illustrated in forms (62)‒(68). We do not have enough data to
determine whether this gemination also occurred in Nagtipunan Agta.
(62) *təlu ‘three’ > DUP, PAH, PAR, DIN talló, CAS ətəlu (KAS təlló)
(63) *əpat ‘four’ > DUP, PAR appát, PAH, DIN əppát, CAS əpát (KAS əppát)
(64) *ənəm ‘six’ > DUP ánnam, PAH ənnəm, PAR annə́m, CAS ənəm (KAS
ənnəm)
(65) *bəŋəR ‘deaf’ > DUP baŋŋág, DIN, PAH, PAR bəŋŋə́g
(66) *qəlad ‘wing’ > DUP, PAH, NAG allád ‘feather’
(67) *ləsuŋ ‘mortar’ > DUP, PAH, PAR lassóŋ, CAS lúsoŋ, NAG lasúŋ (KAS ləssóŋ)
(68) *təkən ‘pole, usually of bamboo, used to propel a boat or raft’ > DUP takkán,
PAH, PAR təkkə́n, CAS təkə́n (KAS təkkə́n)
Occasionally, gemination does not occur where it is expected, as in the Dupaningan
Agta reflex of PMP *bəRas ‘uncooked rice’ and the Pahanan Agta and Paranan reflexes
of PMP *kəzut ‘pinch’, examples (69) and (70), respectively.
(69) *bəRas ‘uncooked rice’ > DUP bagáh, PAH, PAR baggés, DIN baggés, CAS,
NAG bəgís, (KAS bəgés ~ bəggés)
(70) *kəzut ‘pinch’ > DUP kaddút, PAH, CAS kədút, PAR, NAG kadút
Gemination after the schwa also occurs in the Cagayan Valley languages and Ilokano
(see, for example, Tharp 1974a), as well as in a number of other Philippine languages (for
example, some Manobo and Sama-Bajaw languages, and phonetically in Maranao). It is
likely that this was also a phonetic feature of PNELUZ but was subsequently lost in Casiguran Agta. There is even some evidence that gemination after the schwa was still present
in early twentieth-century Casiguran Agta, as it appears that Vanoverbergh (1937) documented geminate variability in Casiguran Agta on his trip through northern Luzon in
1936 and 1937, such as his kəttíhek ‘small’ and ənnə́m ‘six’, which we recorded as kətihə́k
and ənəm, respectively. In most cases, however, Vanoverbergh recorded a singleton following a schwa. In our own data, there was at least one instance of a geminate alternating
with a singleton in Casiguran Agta, ətáy ~ əttáy ‘feces’. Note also that gemination after
the schwa does occur in the Kasiguranin forms, suggesting that Kasiguranin borrowed
these forms from Casiguran Agta before the loss of gemination in the latter language.
2.1.2.2 Low Vowel Fronting. Low Vowel Fronting (LVF), the shift of /a/ to a mid or
high front vowel after a voiced stop, occurs sporadically in all of the NELUZ languages.18
18. See Blust (2000) for a discussion of this phenomenon in Sarawak.
THE NORTHEASTERN LUZON SUBGROUP
137
This process is found in all of the NELUZ languages, as well as in Southern Alta (Reid
1991),19 Umiray Dumaget (Himes 2002, Lobel 2012), Manide (Lobel 2010), and Inagta
Alabat (Lobel 2011). Interestingly, this shift is not found in the same words in all languages, and there is no apparent conditioning to explain which words will have LVF and
which will not. For some forms, LVF is found in all of the NELUZ languages, while for
others, no language reflects LVF. For many other forms, LVF is found in some languages
but not others. Note that all the LVF forms in KAS are also found in CAS, suggesting that
LVF may not have ever existed as a process in KAS, but was simply the result of lexical
borrowing. All but one of the LVF forms in PAR are also found in PAH, suggesting that
LVF in PAR is also primarily the result of lexical borrowing, but also reflecting the greater
influence of Agta languages on PAR than on KAS. Examples of LVF in NELUZ languages are given in table 4, where the cells containing forms reflecting LVF are shaded.
Kasiguranin forms are given in the last column for comparison. Since we have much less
data for Dinapigue Agta, we have included it in the Pahanan Agta column and noted
where the Dinapigue forms are the same (=D) or different. Where no Dinapigue form is
listed, we lack data.
2.2 MORPHOLOGICAL EVIDENCE. This section presents an analysis of the
pronouns and case markers of the NELUZ languages and discusses their implications for
subgrouping. Pronouns and case markers, as well as other types of functors, often serve
as strong evidence for subgrouping (cf. Zorc 1977, 1978; McFarland 1974; Lobel 2012),
especially where lexical evidence is ambiguous or even misleading due to heavy borrowing from more prestigious languages. For example, Zorc (1978:510) argues that “a language is more readily defined by its grammar than by its lexicon,” and that functors have
“obvious importance within any given speech variety” due to their “high text frequency”
and a “tendency towards stability and a low rate of replacement.” In most instances, the
Kasiguranin forms derive primarily from the Tagalog substratum, and so were not used
for reconstructing PNELUZ. Note that the full sets of pronouns, case markers, and other
functors are presented in appendix 2.
2.2.1 Pronouns. Table 5 presents the reconstructions of the PNELUZ pronouns. A few
general comments are in order. First, note that the bases are the same for the Topicalized
Nominative and the Oblique sets, with the PMP Nominative bases having replaced the
PMP Oblique bases, which is not at all uncommon in Philippine-type languages (see
Lobel 2012). The Topicalized Nominatives are formed by attaching *si- to the base, while
the shorter Nominative pronouns are usually an enclitic form of the base, with the exception of 1SG and 2SG, which have different forms, and also 3SG and 3PL, which are identical
to the Topicalized forms and do not appear to be enclitic. The Genitive set is composed of
enclitic monosyllabic pronouns largely identical to the reconstructed PPH set.
Various phonological shifts have affected the pronouns. The sporadic shift of *s > h in
some of the languages was discussed in 2.1.1.7. Second, Low Vowel Fronting has
affected the 3PL form, resulting in the base *-di from earlier *-da. Third, monophthongization has taken place in the 2SG.TOP/OBL base *-kaw, resulting in the base *-ko in all of
the languages except Dupaningan Agta. Fourth, the *a of the base *akən ‘1SG’ is lost in
19. In Northern Alta, *a became /ə/ rather than /i/ or /e/ in this environment.
OCEANIC LINGUISTICS, VOL.
138
52, NO. 1
all but the southern languages (Casiguran Agta and Nagtipunan Agta), resulting in the
form hikən from PNELUZ *siyakən. In CAS and NAG, on the other hand, the vowel of
Topicalized formative *si- is lost in the 1SG.TOP form, resulting in the form sakən. In the
Oblique form, however, neither segment is lost in CAS and NAG, resulting in the form
diyakən. It should also be noted that reduction in the 1SG.TOP/NOM form and not in the
other persons is common throughout the Philippines (cf. Lobel 2012).
TABLE 4. LOW VOWEL FRONTING IN NELUZ LANGUAGES
PMP
*baqəRu
*quzan
*daRaq
*daRat
*qaləjaw
*haRəzan
*tabəq
*laja
*azani
*maja
*balay
‘new’
‘rain’
‘blood’
‘sea’
‘day, sun’
‘ladder, stairs’
‘fat’
‘weave’
‘near’
‘dry’
‘house’
DUP
bigú
udén
digíʔ
digét
aldéw
ágden
tabíʔ
ladíʔ
—
madíʔ
biláy
PAR
bigú
udén
digí
digét
aldéw
agdénan
tabíʔ
ladíʔ
adéniʔ
madíʔ
biláy
CAS
NAG
bigú
bigú
udén
udén
digéʔ
digíʔ
digét
digít
aldéw
aldéw
agdénan ágen
tabíʔ
tabíʔ
ladíʔ
—
—
—
—
madíʔ
bilɛ
—
KAS
bigú
udén
digí
digét
aldéw
agdénan
tabíʔ
ladíʔ
adéne
—
baláy
dinóm
dilán
bilúʔ
daggíʔ
PAH, DIN
bigú
udín
digéʔ
digít
áldew
agdénan
tabíʔ
ladíʔ
adéniʔ
madíʔ
biláy
(=D)
dinúm
dilán
bilóʔ
daggíʔ
*danum
*zalan
*balu
PNELUZ
*madəggáʔ
*basəq
*sida
‘water’
‘road’
‘widow’
‘heavy’
dinóm
dilán
bilú
dəggí
dinúm
dilán
bilóʔ
dəgíʔ
dinúm
dilán
bilúʔ
dəggí
danóm
dalán
balóʔ
dəggá
‘wet’
‘3PL.NOM’
bésa
hidí
bisáʔ
hide
bisáʔ
hidiʔ
bisáʔ
side
bisáʔ
sidiʔ
basá
siláʔ
dikə́p
dikə́p
dikəp
dikəp
dakə́p
digúm
abéŋ
baggés
dikə́l
digúm
abéŋ
bəgís
díkkəl
digúm
abíŋ
bəgís
dikkə́l
digúm
abéŋ
bəg(g)és
díkkəl
ugás
gawəd
batú
ugés
gawəd
bitúʔ
ugás
gawəd
bitúʔ
ugás
gawəd
báto
balón
bilón
bilón
balón
gamót
gimót
gimút
gamót
*dakəp
‘catch’
dakkap
*zaRum
*qabaŋ
*bəRas
*dakəl
‘needle’
‘boat’
‘uncooked rice’
‘large’
*huRas
*Rawəd
*batuh
‘wash’
‘betel leaf’
‘stone’
*balun
‘provisions’
*Ramut
‘root’
*bayu
‘pound rice’
báyo
biyúʔ
biyúʔ
bayó
*gatəq
*bahaR
*qabaRa
*dahun
*baRiuh
‘coconut milk’
‘loincloth’
‘shoulder’
‘leaf’
‘storm’
digúm
abíŋ
baggés
dakə́l
(=D)
ugés
ugés
giwad
giwəd
bitúʔ
bitúʔ
(D bitóʔ)
bilón
bilún
(D bilón)
ramót (< gimút
ILK)
(=D)
biyuʔ
báyyuʔ
(D bayúʔ)
gittáʔ
gatáʔ
beeg
baʔeg
abigíʔ
—
doon
dáʔon
bágyo
bágyo
gatáʔ
baʔeg
abagáʔ
dúʔun
bágyu
gatáʔ
biig
—
dəʔón
bágyo
gatáʔ
biig
—
doon
bágyo
gatáʔ
baʔeg
abagáʔ
duʔón
bágyo
hidá
sidá
sidá
sidá
sidá
PPH *sidaq ‘main course’
dágum
abáŋ
bagah
dakal
sidá
139
THE NORTHEASTERN LUZON SUBGROUP
TABLE 5. PROTO-NORTHEASTERN LUZON PRONOUNS
1SG
2SG
3SG
1EXCL
1INCL.DU
1INCL.PL
2PL
3PL
*
TOP
NOM
GEN
OBL
*si-akən
*si-kaw
*si-ya
*si-kami
*si-kita
*si-kitam
*si-kam
*si-di
*=ək
*=ka
*si-ya
*=kami
*=kita
*=kitam
*=kam
*si-di
*=ku
*=mu
*=na
*=mi
*=ta
*=tam
*=muy
*=di
*ni/di-akən
*ni/di-kaw (>*-ko)
*di-ya; *ni/di-ko-na*
*ni/di-kami
*ni/di-kita
*ni/di-kitam
*ni/di-kam
*ni/di-di
& OBL BASE
*-akən
*-[i]kaw
*-iya
*-kami
*-kita
*-kitam
*-kam
*-[i]di
NOM
CAS and NAG retain PPH *diya.
DUP, PAH, and DIN innovated the 3SG.OBL form nikuna, which appears to be analyzable as Oblique formative *ni-, 3SG.GEN base *-na, and a medial segment *-ku-. The
exact origin of this medial segment *-ku- is unclear, but a similar medial segment *-kofollowed by the genitive base is found in the oblique pronouns of Paranan and one of the
three competing oblique pronoun sets of Kasiguranin (see table 6). Dupaningan Agta further innovates the Topicalized Nominative form hikuna ‘3SG.NOM’, consisting of the
Nominative formative *hi- (< earlier *si-) plus the same base *-kuna consisting of the
combination *-ku- + *-na ‘3SG.GEN’ as found in the oblique form nikuna. 20
Paranan has a quite different oblique set analyzable as oblique formative *di-, followed by *-ko- plus the genitive base (see tables 5 and 6).
Tables 7a‒d list the PNELUZ, Proto-Northern Luzon (Reid 1979a), and Proto-Northern
Cordilleran21 pronouns (PNCORD, Tharp 1974a). The Proto-Central Cordilleran
(PCCORD, Reid 1974, 1979a) and Proto-Southern Cordilleran (PSCORD, Reid 1979a,
Reid 2009) forms are also included for comparison, where available.
As can be seen in tables 7a‒d, the reconstructed PNELUZ pronouns have changed relatively little from the Proto-Northern Luzon forms reconstructed by Reid (1979a), and
there are very few if any exclusively shared innovations with Proto-Northern Cordilleran
(that is, Proto-Cagayan Valley, if the NELUZ languages are not included). If the PNELUZ
TABLE 6. OBLIQUE PRONOUNS IN PARANAN AND KASIGURANIN
PARANAN
1SG
2SG
3SG
dikókuʔ
dikómuʔ
dikónaʔ
“Tagalog” set
sa ákin
sa iyúʔ
sa kanyáʔ
1EXCL
1INCL.DU
1INCL.PL
2PL
3PL
dikómiʔ
dikótaʔ
dikótam
dikómoy
dikódiʔ
sa ámin
sa átaʔ
sa átam
sa ínyo
sa kanilá
KASIGURANIN
*-ko- set
sakókoʔ
kómoʔ
kónya, koniyáʔ
PNELUZ
*kaʔo- set
GEN
kaóko
*=ku
kaómoʔ
*=mu
—
*=na
sakómeʔ
sakótaʔ
sa kótam
sa ómoy/ komóy
—
—
kootáʔ
—
—
kaónila,
sakaódeʔ
*=mi
*=ta
*=tam
*=muy
*=di
PNELUZ
OBL
*ni/di-akən
*ni/di-kaw (>*-ko)
*ni/di-ko-na,
SNEL *diya
*ni/di-kami
*ni/di-kita
*ni/di-kitam
*ni/di-kam
*ni/di-di
20. Note that Yogad (cf. fig. 2) has takuna ‘3SG.OBL’, takura ‘3PL.OBL’, akuna ‘3SG.POSS’, and
akura ‘3PL.POSS’.
21. Note that Tharp included Casiguran Agta in his Northern Cordilleran subgroup, which likely
influenced his reconstructions.
OCEANIC LINGUISTICS, VOL.
140
52, NO. 1
pronouns do derive from PNLUZ forms as reconstructed by Reid (1979a), then there
appears to have been a reduction in the topicalized formative of *siqi- to *si-, which,
according to Tharp (1974a), also took place in NCORD. Note, however, that Topicalized
Nominatives beginning with a simple *si- (and, therefore, identical to the reconstructed
PMP nominative case marker *si) are found in many Philippine subgroups, and Reid’s
PNLUZ reconstructions appear to have double marking, with *si- followed by *qi-. It
TABLE 7. PRONOUN FORMS
PNELUZ
PNCORD
PNLUZ
a. LONG-FORM NOMINATIVE PRONOUNS
1SG
*si-akən
*si akən
*siyakən
2SG
*si-kaw (>*-ko) *si kaw
*siqikaw
3SG
*siya
*iya, *V[n]su *siya
1EXCL
*si-kami
*si kami
*siqikami
1INCL.DU *si-kita
*si kita
*siqikita
1INCL.PL *si-kitam
*siqikitam
PSCORD
*siyakǝn
*siqika ~ *sikqa
*siya
*dakami ~ *dikami
*daqita ~ *dita ~
*data
*datakayú ~ *ditakayú*
*dakayu ~ *dikayu
*siyak
*siqika
*siya
*siqikami
*siqikita
*si kamu,
*siqikamuyu
*si kayu
3PL
*sidi
*ida
*siqida
*daqida ~ *dida
b. SHORT-FORM NOMINATIVE PRONOUNS
1SG
*=ək
*ak
*-ak
*-ak
2SG
*=ka
*ka
*-ka
*-ka
3SG
*si-ya
*Ø
*Ø
*Ø
1EXCL
*=kami
*kami
*-kami
*-kami
1INCL.DU *=kita
*kita
*-kita
*-ta
1INCL.PL *=kitam
*kitam
*-kitam
*-takayú
2PL
*=kam
*kamu, *kayu *-kamuyu
*-kayu
3PL
*si-di
*ida
*-da
*-da
c. GENITIVE PRONOUNS
1SG
*=ku
*ku ~ *-k
*-ku ~ *-k, *-ta *-ku ~ *-k
2SG
*=mu
*mu ~ *-m
*-mu ~ *-m
*-mu ~ *-m
3SG
*=na
*na
*-na
*-na
1EXCL
*=mi
*mi
*-mi
*-mi
1INCL.DU *=ta
*ta
*-ta
*-ta
1INCL.PL *=tam
*tam
*-tam
*-taku
2PL
*=muy
*muy, *yu
*-muyu
*-yu
3PL
*=di
*da
*-da
*-da
d. OBLIQUE PRONOUNS
1SG
*ni/di-akən
*kani akən
*kanyaken
2SG
*ni/di-kaw (>*-ko) *kani kaw
*kanikaw
3SG
*diya, *ni/di-ko-na *kani kua na *kanya
1EXCL
*ni/di-kami
*kani kami
*kanikami
1INCL.DU *ni/di-kita
*kani kita
*kanikita
1INCL.PL *ni/di-kitam
*kani kitam *kanikitam
2PL
*ni/di-kam
*kani kamu, *kanikamuyu
*kani kayu
3PL
*ni/di-di
*kani kua da *kanida
2PL
*
*si-kam
*si kitam
PCCORD
*siqikitayu
*siqikayu
*siqida
Reid (1974, 1979a) reconstructs PCCORD *-taku for the 1INCL.PL nominative pronouns, but
this is revised to *-takayú with final stress in Reid (2009), and we have updated the tables
here to reflect that latter source.
THE NORTHEASTERN LUZON SUBGROUP
141
therefore seems likely that, even if there was a set of PNLUZ pronouns marked by the formative *siqi-, there was also a simpler set with the formative *si-.
There has also been a reduction of the 2PL.TOP/OBL base from *-kamuyu (or likely a
simpler form like *-kamu, as Tharp reconstructs for PNCORD, and which can also be
reconstructed for PMP) to *-kam. The use of the 3rd person Topicalized forms *siya and
*sidi has been extended to the Nominative set. In Dupaningan Agta, the 3SG short-form
nominative is actually null, but the topicalized nominative is used when an overt pronoun
is needed for disambiguation (Robinson 2011:81‒83). Since the Proto-Northern Luzon
3SG short-form nominative is also reconstructed as null, it is likely that PNELUZ
extended use of the topicalized pronouns in much the same way as DUP does. We reconstruct PNELUZ *siya ‘3SG.NOM’, however, because there is no evidence that a null is
used synchronically in any of the languages except DUP.
PNELUZ languages do not reflect the shortened monosegmental enclitic forms *=k
‘1SG.GEN’ and *=m ‘2SG.GEN’, which Reid (1979a) reconstructs as PNLUZ allomorphs
of *=ku and *=mu, respectively, following vowel-final forms. These allomorphs do
sometimes occur in Dupaningan Agta, but appear to be loans from Ilokano, since they
rarely if ever occur when speakers are consciously trying to speak “pure” Dupaningan
Agta, as opposed to the colloquial way of speaking that includes a great deal of codeswitching with Ilokano (cf. Robinson 2011).
There has been a reduction of PNLUZ *=muyu ‘2PL.GEN’ to PNELUZ *=muy. A
reduction of the oblique formative *kani- to *ni- has taken place in DUP, PAH, and DIN,
which also occurs in some Batanic/Bashiic languages. Note that the presence of the
oblique formative *di- in PAR, CAS, and NAG, also found in Batanic/Bashiic, Sabahan,
and even some Central Philippine languages, suggests that the rest of the Northern Luzon
languages lost *di- as an oblique pronoun formative, just as the vast majority of Central
Philippine languages did.
2.2.2 Case markers. Table 8 lists the case markers for the languages of Northeastern Luzon.
The Paranan case markers pose a particular problem. Most of the Paranan forms are
similar to forms found in the Agta languages (for example, *i NOM, *ti GEN/OBL), usually
Pahanan Agta, but a few of the forms (en ‘NOM.DEF’, nen ‘GEN.DEF’, and ten ‘OBL’) do
not appear to have an origin in PNELUZ. Taking into consideration that en, nen, and ten
are likely from earlier *in, *nin, and *tin (since Paranan [e] often corresponds to Central
Philippine *i in closed syllables), the first two forms (*in and *nin) both have cognates in
Central Philippine languages: Old Bikol had both *in ‘NOM.NONREF ’ and *nin
‘GEN.NONREF’, while *in also has cognates in Waray-Waray and other Warayan languages, in Tausug, and in the Kamayo dialect of Barobo town. Genitive *nin has cognates
in most Bikol languages and in Romblomanon. The *tin form could be from Pahanan
Agta ti, with the final *-n being the result of analogy with the *in and *nin forms. Note
that the *ʔ- : *n- : *t- contrast (where *ʔ- corresponds to the phonemically vowel-initial
form) is also found in Southern Ibanag, whose case markers are iC ‘NOM’, nəC ‘GEN’,
and təC ‘OBL’ (with the final segment being a copy of the first consonant of the following
word). However, the vowels do not match, as Paranan /e/ is not cognate with Ibanag /ə/.
OCEANIC LINGUISTICS, VOL.
142
52, NO. 1
Table 9 compares the reconstructed PNELUZ case markers with those reconstructed
for Proto-Northern Cordilleran (Tharp 1974a), Proto-Philippines (Reid 1979b), and
Proto-Central Cordilleran (Reid 2006). No reconstruction of the Proto-Northern Luzon
case markers is available. The PNELUZ case markers are markedly different from those
reconstructed for the other protolanguages. The nominative common case marker *i
reflects the protoform rather unproblematically, but of the reconstructed PNELUZ genitive common case markers (indefinite *ti and *ta, and definite *nu and *tu), only *nu is
reconstructed elsewhere (for PPH), and none of these is reconstructed for Proto-Northern
Cordilleran. On the other hand, in Dupaningan Agta and Casiguran Agta (the two lanTABLE 8. CASE MARKERS
DUP
COMMON, SG NOM (INDEF) ø
(DEF) i
GEN (INDEF) (na/di)
(DEF)
OBL
ha
COMMON, PL NOM (INDEF)
(DEF)
GEN (INDEF)
(DEF)
PAH
—
u, tu
ti
nu
ti
DIN
i
u
ta
nu, tu
ta
PAR
i
en
ti, nen
nen
ti, ten
NOM
GEN
OBL
PERS, PL
NOM
GEN
OBL
*
PROTO-NELUZ
*i
(*u)
*ta, *ti
*nu, *tu
*ta
du, di
OBL
PERS, SG
NAG
i
—
no, na —
na, tu
to, ta tu, ta
du, di
CAS*
i, tu
ni
ti
(na) ni ni
ha ni ni
ti
ni
ni
di
di
(na) di di
ha di kad
di
—
kəd
ti
ni
kənni,
kən
di ~ de
di
kəndi
du, di
ti
ni
ni
ti
ni
ni
*ti
*ni
*[ka]ni
de
de
de
—
—
de
*di
*di
*ka[n]di
From Headland and Healey (1974), reprinted in Headland and Headland (1974).
TABLE 9. COMPARISON OF RECONSTRUCTED CASE MARKERS
PNELUZ
*i
DEF.
(*u)
GEN INDEF. *ta, *ti
DEF.
*nu, *tu
OBL
*ta (PPAR *ti)
PERS., SG. NOM
*ti
GEN
*ni
OBL
*[ka]ni
PERS., PL. NOM
*di (<*da)
GEN
*di (<*da)
OBL
*ka[n]di (<*-da)
COMMON
*
NOM INDEF.
PNCORD
*i, *ia, *iu
—
*na
—
*sa
*si
*ni
*kani
*da
*da
*kada
PCCORD
*=y, *ø
*nan
*=n, *ø
—
*=s, *si*
*=s, *si
—
—
*da
—
—
PPH
*ʔi, *su, *ʔu
—
*na, *nu
—
*di, *sa
*si
*ni
*ka ni, *kay (<**ka ʔi)
—
—
—
Reid (2006) also includes topic (*sa), dative (*=n/*=y, *kan/*kay) and locative (*=d, *ʔidi/
*ʔudi/*di).
THE NORTHEASTERN LUZON SUBGROUP
143
guages for which we have more extensive data), na—which is reconstructed for both
PPH and PNCORD—is used for at least some of the genitives.22
The southern NELUZ languages use ta for the oblique, while PAR and PAH have ti,
and DUP retains a reflex ha of the earlier *sa. The form ta also occurs in Atta, Central
Cagayan Agta, and Ibanag (Tharp 1974a), where the change of *s > t is regular. However, Tharp reconstructs *sa for PNCORD based on external evidence. Therefore, the
form ta in the southern NELUZ languages could be a borrowing from one of these languages, or could reflect an inherited *sa having undergone a shift of *s > /t/ limited to
the case markers.
For the singular nominative personal case marker, the NELUZ languages all have *ti,
except DUP, which has ni. The latter appears to be an extension of the genitive personal
case marker, where all of the languages reflect the widespread *ni, which is reconstructed
for PPH, PMP, and PAN. Although most of the languages seem to reflect ni for singular
personal oblique as well, we tentatively reconstruct *[ka]ni on the basis of the Paranan
reflex, since the PPH form was *kani. It is, therefore, assumed that the *ka- element was
lost in the other NELUZ languages. It is also possible that PAR kənni was a borrowing from
Ilokano or one of the Cagayan Valley languages. For the personal plural oblique form, we
have more evidence for a *ka- formative, as it is reconstructed for Proto-Northern Cordilleran and would be an expected form for PPH based on the other reconstructions given by
Reid (1979b). Finally, the personal plural for both the nominative and genitive is *di,
which is a regular reflex of the reconstructed *da with Low Vowel Fronting.
2.3 LEXICAL EVIDENCE FOR A NELUZ GROUP. There are a number of
lexical innovations that appear to be NELUZ innovations, for which we have not found
cognates after searching the lexical resources available for other languages. It is likely,
however, that some of these forms will turn up in other languages as the documentation of
northern Philippine languages becomes more widespread. In order to minimize this problem, we have excluded any forms with meanings that do not generally appear on Swadesh
lists. If we had included the more rare forms, this list of proposed innovations would have
been much longer, but it would have increased the likelihood that many of those proposed
innovations would have undocumented cognates outside the NELUZ subgroup.
The following forms (71)‒(76) are proposed unique PNELUZ lexical innovations that
occur in all of the NELUZ speech varieties we surveyed.23
(71) PNELUZ *ləbbút ‘boil water’ > DUP labbút, PAH, PAR, KAS ləbbút, CAS,
NAG ləbút
(72) PNELUZ *ladúʔ ‘fever’ > DUP, PAH, PAR, CAS, KAS, NAG ladúʔ (also ‘sick’
in PAH, KAS, CAS, NAG)
(73) PNELUZ *putát ‘full’ > DUP, PAH, PAR, CAS, KAS, NAG putát
22. In Dupaningan, na is a pronoun that agrees with singular genitive noun phrases (Robinson
2011:55). The Dupaningan case marking system may be in transition, as Liao (2005)
describes for Central Cagayan Agta (a Cagayan Valley language), in which the genitive pronominal forms are in the process of losing their status as clitics and becoming agreement features instead.
23. Since our lexical survey of Dinapigue Agta was far less complete than for the other languages,
we lack a DIN cognate for many of these forms.
OCEANIC LINGUISTICS, VOL.
144
52, NO. 1
(74) PNELUZ *madəggáʔ ‘heavy’ > DUP, PAH madaggíʔ, PAR, NAG madəggíʔ,
CAS madəgíʔ, KAS madəggá
(75) PNELUZ *démət ‘arrive’ > DUP démat, PAH, PAR, CAS, KAS, NAG démət
(76) PNELUZ *pilás ‘muscle’ > DUP, PAH, CAS, KAS, NAG pilás, PAR pilá24
Item (77) reflects a unique semantic shift from ‘buttocks’ to ‘vulva’.
(77) PNELUZ *ubə́t ‘vulva’ > DUP, PAR ubát, PAH ubbát, CAS, KAS, NAG ubə́t
Items (78)‒(80) seem to be unique, but there are suspiciously similar-looking forms in
other languages in the Philippines.
(78) PNELUZ *lupə́s ‘rice husk’ > DUP lupás, PAH, PAR, CAS, KAS, NAG lupə́s;
cf. Ilianen Manobo qupis (Reid 1971); Western Subanon pis, Molbog upis
(Yap 1977) 25
(79) PNELUZ *[k]e[n]nam ‘taste, try’ > DUP énnam, PAH ínnam, PAR ennəm,
DIN ennám, CAS, KAS kɛ́nam, NAG kennám; cf. Central Cagayan Agta naanaamam (Oates and Oates 1955)
(80) PNELUZ *masanikíʔ ‘shy, ashamed’ > DUP, PAH, NAG masanikíʔ, PAR
mansə́ŋkiʔ, CAS, KAS masanikéʔ; cf. Ivatan masnɨk (Reid 1971, Yap 1977)
The following five items (81)‒(85) are innovations that we tentatively reconstruct for
PNELUZ, but which are not found in all of the languages. Based on our subgrouping
argument presented in section 3, we only list words that are found in DUP, at least one of
the other northern languages (PAH, PAR, DIN), and at least one of the southern languages
(KAS, CAS, NAG).
(81) PNELUZ *sánig ‘hear, listen’ > DUP, PAH, PAR, DIN, KAS sánig (*na-sánig
‘hear’, *mag-sánig ‘listen’)
(82) PNELUZ *tóglad ‘push (to transport)’ > DUP, CAS, KAS tóglad, PAR túglad
(83) PNELUZ *bakál ‘stab’ > DUP bakál, PAH, DIN, CAS26 bikál
(84) PNELUZ *réktat > DUP huméktat ~ ruméktat ~ ruméttat ‘start a journey,
commence’, PAH, CAS huméktat ‘leave’27
(85) PNELUZ *ləddís ‘crush lice’ > DUP laddís, PAH ləddís, CAS, NAG lədís
3. INTERNAL SUBGROUPING OF THE NORTHEASTERN LUZON
LANGUAGES. This section presents the evidence for the internal subgrouping of the
Northeastern Luzon languages from phonology (3.1), morphology (3.2), and lexicon (3.3).
3.1 PHONOLOGICAL EVIDENCE
3.1.1 Metathesis. Casiguran Agta and Nagtipunan Agta share an unusual metathesis
involving the secondary patient voice verbs, in which the first consonant and vowel of the
24. Possibly PAR *pilás > *piláh > pilá.
25. Yap (1977) also lists the initial /l/ in CAS as optional, making the cognates stronger, but there
is no evidence for a morpheme boundary after the /l/ in CAS.
26. This CAS form is from Headland and Headland (1974). Our field notes contain disón.
27. The CAS gloss ‘leave, depart from a place’ (Headland and Headland 1974) suggests that the
definitions for all three languages are probably more similar than is suggested by ‘leave’.
THE NORTHEASTERN LUZON SUBGROUP
145
root metathesize if the first vowel of the root is a schwa (that is, i- + Cə- > iyɛC-), as illustrated in forms (86)‒(88). Headland and Healey (1974) note that this is a synchronic process in Casiguran Agta, and that it only occurs when the first syllable of the root is open.
Since we generally elicited only one form for each verb in our own data, we can only
assume that the Nagtipunan Agta process is also synchronic.
(86) PNELUZ *lə[b]bəŋ ‘to bury’ > CAS, NAG iyɛlbə́ŋ ‘bury (PV2.INF)’ (cf. CAS
root ləbəŋ)
(87) PNELUZ *dəttón ‘to put, place’ > CAS iyɛgton, NAG iyɛdtón ‘put, place
(PV2.INF)’ (cf. CAS root dəton)
(88) PNELUZ *bəttén ‘hang by rope’ > CAS niyɛ́btɛn ‘hang by rope (PV2.PAST)’
3.1.2 Monophthongization. Casiguran Agta and Nagtipunan Agta also share a
monophthongization rule, in which *aw and *ay became /o/ and /e/, respectively, as illustrated in examples (89)‒(94). There are no diphthongs in penultimate syllables in these
languages, so all examples occur in the ultima. Note that a glottal stop is added phonetically after what would otherwise be a word-final vowel, a development shared by all of
the NELUZ languages.
(89) *anay ‘termite’ > CAS, NAG anéʔ (cf. PAH, PAR, DIN, KAS anáy)
(90) *balay ‘public building’ ‘house’ > CAS, NAG biléʔ (cf. DUP, PAH, PAR, DIN
biláy, KAS baláy)
(91) *piray ‘crippled’ > CAS, NAG piléʔ (cf. DUP, PAH, PAR, DIN, KAS piláy)
(92) *takaw ‘steal’ > CAS, NAG takóʔ (cf. DUP, PAH, PAR, DIN, KAS takáw)
(93) *laŋaw ‘housefly’ > CAS, NAG laŋóʔ (cf. DUP, PAH, PAR, DIN, KAS laŋáw)
(94) PPH *buŋaw ‘testicles’ > CAS, NAG buŋóʔ (cf. DUP, PAH, PAR, KAS buŋáw)
Headland and Healey (1974) describe this monophthongization rule for CAS as
involving lax mid vowels, that is, *aw and *ay shift to /ɛ/ and /ɔ/, respectively. Because of
the short amount of time that we worked on CAS and NAG, we can only assume that
forms (89)‒(94) should have lax vowels rather than tense ones.
3.1.3 *s > /h/. PNELUZ *s is reflected sporadically as /h/ in all of the NELUZ languages except CAS, which would seem to suggest that CAS split off from the other languages before this innovation occurred. However, as we will see below, there is stronger
evidence that DUP is a first-order subgroup of NELUZ, and very strong evidence that CAS
should subgroup with NAG. Therefore, it is assumed that the sporadic *s > /h/ shift spread
according to the wave model after the breakup of PNELUZ.
3.2 MORPHOLOGICAL EVIDENCE. All of the NELUZ languages except
DUP innovated a shift of *aw > /o/ in the 2SG topicalized and oblique pronouns. Note that
this is an expected shift in CAS and NAG, but not in DIN, PAH, or PAR, which also reflect
it. Since this shift is limited to a single pronominal base, it is unclear whether this was a
separate innovation in DIN, PAH, and PAR, or if this occurred under the influence of CAS
and/or NAG.
Pahanan Agta and Paranan are unique in having ti as a genitive and oblique case
marker for common nouns, whereas the other NELUZ languages (except DUP) use ti as a
146
OCEANIC LINGUISTICS, VOL.
52, NO. 1
singular nominative case marker for personal names. It is difficult to determine whether
this is an innovation, however, since ti is also the common genitive case marker in Ilokano, and the Ilokano common oblique case marker iti is also often shortened to ti.
Therefore, this could be a borrowing from Ilokano or, more likely, a functional shift of the
singular personal nominative case marker to instead mark the genitive and oblique of
common nouns.
The fact that all of the languages except DUP use ti as the singular personal nominative marker is also evidence for separating Dupaningan from the other Northeastern
Luzon languages.
DUP, PAH, PAR, and DIN share the innovated 3SG.OBL pronoun *ni/di-ku-na, while
Casiguran Agta and Nagtipunan Agta both reflect *diya, a form that is widespread in the
Philippines. It is likely that this innovation spread according to the wave model after the
breakup of NELUZ.
3.3 LEXICAL EVIDENCE FOR INTERNAL SUBGROUPING. This
section presents lexical innovations shared by two or more of the NELUZ languages. As
our goal here is to determine the internal subgrouping of NELUZ, we restrict ourselves to
presenting clusters of languages (although not necessarily genetic subgroups) that share
at least two lexical innovations. All reconstructions in this section (that is, [95]‒[121]) are
for unspecified nodes below Proto-Northeastern Luzon. Lacking further evidence, we
assume only that they belong to a node that includes all of the languages in which the
forms are reflected.
Six innovations (95)‒(100) were found in all of the NELUZ languages except DUP.
(95) *apérit ‘short (length)’ > PAH, CAS apéhit, PAR, KAS apérit, NAG apíhit
(96) *hágad ‘chase’ > PAH, PAR, CAS, KAS, NAG hágad
(97) *táblal ‘tasteless’ > PAH, PAR, CAS, KAS, NAG táblal
(98) *talád ‘sugarcane’ > PAH, PAR, CAS, KAS, NAG talád
(99) *burə́k ‘blind’ > PAH, CAS buhə́k, PAR, KAS burə́k ‘blind’; KAS naburə́k,
NAG nabuhə́k ‘having dirt in the eye’
(100) *tukóy ‘know (facts)’ > PAH, PAR, CAS, KAS, NAG tukóy (also CAS, NAG
‘know how’; KAS, CAS, PAR, DIN (na)tukuy(an) ‘learn’; PAH, CAS, KAS,
NAG ‘remember’)
Nine lexical innovations (101)‒(109) were found that are unique to CAS and NAG.
(101) *dulóy ‘lungs’ > CAS, NAG dulóy
(102) *bidút ‘deer’ > CAS, NAG bidút
(103) *mediŋát ‘red’ > CAS, NAG mediŋát (cf. DUP digkat)
(104) *kəbíl ‘carry, bring, hold in hand’ > CAS, NAG kəbíl
(105) *ləbúg ‘fight’ > CAS, NAG ləbúg (cf. Yogad labu, Tiruray lifut) (Davis and
Mesa 2000; Yap 1977)
(106) *lisóʔ ‘hide’ > CAS, NAG lisóʔ (cf. Isneg siru, Yap 1977)
(107) *mag-ayág ‘play’ > CAS mag-íyag (with LVF from the prefix), NAG magayág (cf. DUP kayag, Waray uyag) (Yap 1977)
THE NORTHEASTERN LUZON SUBGROUP
147
(108) *sahát ‘happy’ > CAS, NAG sahát
(109) *dinsón ‘stab’ > CAS, NAG dinsón
There were four innovations (110)‒(113) unique to PAH and PAR, although in this
case, one language could have easily borrowed the term from the other.
(110) *dukót ‘cook rice’ > PAH, PAR dukót (cf. DUP ‘build a fire’; semantic shift
from PPH[?] ‘burn’)
(111) *masibə́t ‘tight’ > PAH, PAR masibə́t (cf. Yogad siggat ~ sigat ‘tighten’)
(Davis and Mesa 2000)
(112) *mad(iə)ŋə́t ‘dirty’ > PAH madəŋə́t, PAR madiŋə́t
(113) *habág ‘have mercy’ > PAH nahabág, PAR kahabágan
There were three innovations (114)‒(116) unique to PAH, PAR, CAS, and KAS (but
surprisingly absent from NAG).
(114) *bulóŋ ‘heel’ > PAH, PAR, CAS, KAS bulóŋ (cf. Manide, Inagta Alabat
bu-lúng ‘knee’)
(115) *bulibuli ‘lie, untruth’ > PAH, PAR, CAS bulibulíʔ, KAS bulibuléʔ
(116) *sapsap ‘nipa tree (living, as opposed to processed)’ > PAH, PAR, CAS, KAS
sapsap
There were three innovations (117)‒(119) unique to CAS and KAS.
(117) *pəknit ‘rip, tear’ > CAS pəknit, KAS pə́knet
(118) *mapérəŋ ‘noisy’ > CAS mepéhəŋ, KAS mapérəŋ
(119) *ma-kelagíp ‘ask’ > CAS məg-pa-kelágip, KAS na-kélagip (cf. Ilokano lagip
‘remember’)
There were two innovations (120) and (121) unique to DIN, CAS, and NAG.
(120) *mudít ‘face’ > DIN, CAS, NAG mudít
(121) *mag-eplək ‘thirsty’ > DIN məg-íplək, CAS məg-eplək, NAG mag-eplək
3.4 SUMMARY OF EVIDENCE FOR INTERNAL SUBGROUPING.
All of the NELUZ languages except Dupaningan Agta share the singular personal case
marker ti and the monophthongization of the diphthong in the second person singular
topicalized and oblique pronouns. This contributes to the evidence for placing Dupaningan Agta as a primary branch of NELUZ.
Casiguran Agta and Nagtipunan Agta share a unique metathesis in the secondary
patient voice, and the monophthongization of diphthongs. We calculated 77 percent similarity between the two varieties on a 200-word Swadesh list (cf. table 1), very close to the
threshold for considering them to be dialects of a single language. Moreover, it is unlikely
that the similarities between Casiguran Agta and Nagtipunan Agta are due to recent borrowing or influence from common trade languages, for although the languages are separated by relatively few miles, the mountains separating the two are formidable, and both
groups claimed not to travel across them. Additionally, while Casiguran Agta is most
influenced by Tagalog and Kasiguranin, Nagtipunan Agta is surrounded by Ilokano,
which likely accounts for many of its lexical differences with the other more coastal
OCEANIC LINGUISTICS, VOL.
148
52, NO. 1
NELUZ languages. We, therefore, conclude that Casiguran Agta and Nagtipunan Agta
are very closely related, if not dialects of a single language.
Pahanan and Paranan share the innovation of using the case marker ti for the genitive
and oblique of common noun phrases. The first author calculated the lexicon of Pahanan
Agta and Paranan to be 81 percent cognate (cf. table 1).
Evidence from lexical innovations as described in 3.3 also suggests that Dupaningan
forms a primary branch of the Northeastern Luzon subgroup vis-à-vis the other five languages. Of the remaining five languages, Casiguran Agta and Nagtipunan Agta form one
branch together, and Pahanan Agta and Paranan form another branch together, while the
evidence for the exact position of Dinapigue Agta is lacking, due to the smaller amount
of data available.
Figure 1 illustrates the proposed internal subgrouping of the Northeastern Luzon languages. Based on the data presented here, the position of Dinapigue Agta remains unresolved, as it shares some features with Pahanan Agta and Paranan, and others with
Casiguran Agta and Nagtipunan Agta. Given the limited amount of data on this variety, it
is not surprising that it is difficult to resolve its place within the tree. Kasiguranin is not
included in this tree, even though it shares several lexical innovations with Casiguran Agta,
which are most likely early borrowings, as is much of the non-CPH lexicon of Kasiguranin. However, Kasiguranin’s Tagalog-heavy substrata clearly point to its genetic relationship with Tagalog (Lobel and Robinson 2012), and the similarities to Casiguran Agta can
be explained as the result of borrowing after it separated from the core Tagalog dialects.
4. EXTERNAL RELATIONSHIPS OF THE NORTHEASTERN
LUZON LANGUAGES. Having discussed the unity of the NELUZ subgroup
and its internal structure, we will now address the question of how this subgroup is related
to other languages in the northern Philippines, considering the evidence for or against
each of the various possibilities.
To date, it has been argued that the NELUZ languages subgroup with the Cagayan
Valley languages, as two coordinate branches of a Northern Cordilleran subgroup, which
is in turn one of the primary branches of the Northern Luzon group. Figure 2 illustrates
the internal structure of the Northern Luzon subgroup according to Reid (2010). In order
to support this hypothesis, it would be necessary to find exclusive innovations shared by
both Proto-Cagayan Valley and Proto-NELUZ. However, we find no innovations shared
exclusively by these two subgroups, as will be discussed below.
FIGURE 1. PROPOSED NELUZ INTERNAL SUBGROUPING
Northeastern Luzon
Dupaningan
Agta
Pahanan
Agta
Paranan
Dinapigue
Agta
Casiguran
Agta
Nagtipunan
Agta
149
THE NORTHEASTERN LUZON SUBGROUP
FIGURE 2. THE NORTHERN LUZON LANGUAGES (REID 2010)
Northern Luzon (= Cordilleran)
Meso-Cordilleran
Ikokano
Arta
Northern Cordilleran
Cagayan Valley
Northeastern Luzon
Gaddang
Itawis
C. Cagayan Agta
Ibanag
Atta
Dupaningan Kasiguranin Casiguran Palanan
Paranan
Yogad
Dumagat Dumagat
Isnag
(Agta)
(Pahanan)
If an exclusive link with the Cagayan Valley languages cannot be demonstrated, then
the next possibility is that NELUZ still belongs in the Northern Luzon subgroup, but is not
a part of NCORD, in which case NELUZ would be coordinate with NCORD, C/SCORD,
Ilokano, and Arta. To prove this, we would need evidence that the NELUZ languages
share exclusive innovations with the other Northern Luzon languages. The evidence for
this possibility is stronger than the evidence for a link with the Cagayan Valley languages.
A third possibility is that the NELUZ languages are coordinate with Northern Luzon,
in a higher subgroup. Any similarities shared exclusively with Northern Luzon would be
due to a heavy Northern Luzon overlay from early contact. Since very few innovations
for Northern Luzon have been identified, this latter hypothesis awaits further study.
4.1 PHONOLOGICAL EVIDENCE. This section will discuss whether the
phonological innovations shared by the NELUZ languages (cf. section 2) can be used to
subgroup the NELUZ languages vis-à-vis the other languages of the northern Philippines.
4.1.1 *R > /g/. As in the Cagayan Valley languages, the NELUZ languages all reflect
the shift of *R > g. The reflexes of *R have generally been considered quite important in
subgrouping the languages of the Philippines, and the current hypotheses for subgrouping the languages of the northern Philippines are no exception. Figure 3 presents a tree of
the Northern Luzon languages taking into consideration only the *R > g shift.
FIGURE 3. INTERNAL STRUCTURE OF NORTHERN LUZON BASED ON *R
Northern Luzon
*R > g, r
*R > g
*R > l
Meso-Cordilleran Ilokano
Arta
Cagayan Valley
Northeastern Luzon
OCEANIC LINGUISTICS, VOL.
150
52, NO. 1
Note that this tree is nearly identical to the one proposed by Reid (2010), except that it
subgroups Ilokano and Arta together, which Reid (1989) leaves open as a possibility.
NELUZ and Cagayan Valley are the only subgroups within Northern Luzon to reflect an
unconditioned *R > g shift. In Ilokano and Arta, *R split to /g/ and /r/ without clear conditioning (Tharp 1974b, Reid 1989).
4.1.2 The reflexes of *j and *d. In the NELUZ languages, *j merged with *d as /d/.
Within Northern Luzon, only NELUZ, Arta, Ilokano, and Northern and Southern Alta
reflect the merger of *j with *d, where the other Northern Luzon languages have merged
*j with *g instead (Reid 2006:5‒6). Problematically, this suggests a different subgrouping than the one suggested by the reflexes of *R, as illustrated in figure 3. If we were to
base the subgrouping of the NELUZ languages primarily on the reflexes of *j, we would
get the tree presented in figure 4.
The reflexes of *R are also indicated for the sake of comparison. Note that if this is the
correct subgrouping, then the mergers of *R and *g in NELUZ and the CV languages
were independent of one another, as was the shift of *R > l in Northern and Southern
Alta, on the one hand, and South-Central Cordilleran, on the other. Certainly, it would be
ideal to minimize the instances of convergence in any subgrouping hypothesis, so this is
not an ideal tree. On the other hand, if we revisit the reflexes of *R in figure 3, this time
including the *j reflexes, we also run into problems, as can be seen in figure 5 below.
To reconcile the reflexes of *j with the tree based on *R, more convergent changes are
required. The shift of *j > d must have occurred independently three separate times, while
the *j > g shift must have occurred twice. In terms of parsimony, then, the tree based on the
reflexes of *j is preferable. But in order to support the *j tree, we would need exclusively
shared innovations between NELUZ, Ilokano, and Arta on the one hand, and between CV
FIGURE 4. INTERNAL STRUCTURE OF NORTHERN LUZON BASED ON *j
Northern Luzon
*j > d
*R > g, r
*j > g
*R > l
*R > g
NE Luzon
Ilokano
Arta
N & S Alta
Cagayan Valley
SC-Cord
FIGURE 5. SUBGROUPING OF NORTHERN LUZON WITH *R AND *j
Northern Luzon
*R > g, r
*j > d
*R > l
*j > g
Meso-Cord.
*R > g
N & S Alta
Ilokano
Arta
Cagayan Valley
NE Luzon
THE NORTHEASTERN LUZON SUBGROUP
151
and Meso-Cordilleran on the other. We have not specifically sought out such evidence
because we believe that there is a simpler explanation for the reflexes of *j and *R.
It is important to note that there are very few modern reflexes of *j and *R in the Philippines. In all cases, both *j and *R have merged with some other phoneme in all known
Philippine languages, “making none of them what Dempwolff called ‘Test-Sprachen’ for
*j or *R” (Bob Blust, pers. comm., February 13, 2013). Furthermore, none of the modern
reflexes of *j or *R continue the reconstructed phonetic values attributed to PMP: *j as a
“palatalized velar stop” (Blust 1991a:134) or a velar fricative (Ross 1992), and *R as a
uvular trill or fricative (see Blust 1991a and Wolff 2003, among others). Indeed, throughout the Philippines, there is a surprising lack of diversity in the reflexes of both *R and *j,
likely due to the contact-based influence of a handful of powerful languages in relatively
recent times, something that has been alluded to in various contact and leveling hypotheses presented by Reid (1987, 1994) and Blust (1991b, 1999).
It should also be noted that, although the NELUZ languages are the only Northern
Luzon languages to reflect both *R > g and *j > d, this same pair of shifts is found in the
Greater Central Philippines subgroup. We suggest, then, that the phonological shifts of *R
> /g/ and perhaps also *j > /d/ in PNELUZ happened under influence of speakers of more
prestigious Greater Central Philippine languages who were sailing along the east coast of
northern Luzon, where the NELUZ languages are spoken. The various northern Philippine
languages may have retained a distinct *R phoneme until relatively recently, but then
came under the influence of languages whose populations were much more powerful or at
least much more advanced in terms of trade and regional connections, so their pronunciation of *R and *j assimilated to that of the prestige language. If they were under the
influence of a GCPH language such as Tagalog or Bikol, this would have been *R > /g/.
This possibility is consistent with the idea of the “stereotyped Philippine /g/ reflex” of
*R (Conant 1910), which was revisited by Blust (1991b) who proposed that the /g/ reflex
of *R in Ilokano, and the presence of mag- < *maR- in northern Philippine languages
that do not otherwise reflect *R as /g/, may have happened under the influence of a prestigious Greater Central Philippine language.
This may not be such a stretch after all, since recent historical research suggests that
the most important centers of long-distance trade in the Philippines were Ma-i (Mindoro)
in the late tenth century AD (Scott 1984), Butuan in northeastern Mindanao at the opening
of the eleventh century AD (Scott 1984, Hontiveros 2000), and then Jolo in the Southwest
Philippines beginning in the 1300s, all areas where Greater Central Philippine languages
are spoken (Butuanon in Butuan, Tausug in Jolo, and either Tagalog or a Bisayan or
Mangyan language in Mindoro).
It is also possible that *R > /g/ and *j > /d/ developed in the NELUZ languages under
the influence of both Ilokano to the north and Greater Central Philippine languages to the
south. If speakers of NELUZ languages were surrounded by two very different languages
that both happened to share these reflexes, it is not difficult to believe that they would
have adjusted their pronunciation of *R and *j to sound similar to the two surrounding,
more prestigious, groups.
OCEANIC LINGUISTICS, VOL.
152
52, NO. 1
In short, then, while the reflexes of *R and *j are particularly salient and have been
frequently used for subgrouping, they constitute fairly weak evidence for classifying the
NELUZ languages.
4.1.3 t /s metathesis. The main feature defining Proto-Northern Luzon is t /s metathesis, which is also found in NELUZ. The sequence *tVs metathesized to *sVt in the Northern Luzon languages. There is also the long-distance metathesis of *tVCVs to *sVCVt.
These processes were first noted by Dyen (1972), and both are reconstructed by Reid
(2006) for Proto-Northern Luzon. In our data, the metathesis appears to be sporadic for
all of the languages in which it occurs. The Cagayan Valley languages (except Malaweg)
reflect the *s > /t/ shift, so the metathesis, assuming it occurred in the ancestor of the modern Cagayan Valley languages, is obscured. Based on our data, Umiray Dumaget does
not reflect this metathesis. Forms (122)‒(128) illustrate the t/s metathesis.
*Ratas ‘milk’ > DUP, PAH, CAS, NAG gisát ‘breast’28
*taŋis ‘cry’ > DUP, PAR saŋɛ́t PAH, CAS, NAG saŋít (KAS saŋɛ́t)
*tədis ‘crush lice’ > DUP saddit, PAR səddít
*bitiəs ‘calf of leg’ > DUP bisat
*di-taqas > DUP disat ‘high ground’, CAS disat ‘term for the upper part of a
hillside’ (Headland and Headland 1974)
(127) *tiRis ‘decant’29 > DUP sigit ‘dip for liquid with a small bucket’, CAS sigit ‘to
pour water into a cup (from another container, or from a faucet)’ (Headland
and Headland 1974)
(128) *təRas ‘hardwood, hard’ > DUP sagget ‘narra (a kind of valuable hardwood)’
(122)
(123)
(124)
(125)
(126)
In form (129), the NELUZ reflexes do not reflect metathesis.
(129) *taqəmis > PAH, PAR, CAS, NAG támʔis (KAS tamís) ‘sweet’ (DUP sámɁit is
clearly an Ilokano loan, since Dupaningan Agta does not retain medial glottal
stop in the native stratum)
Moreover, the process is no longer productive, as there are words with the sequences
tVs and tVCVs, such as (130).
(130) PNELUZ *tulós ‘continue’ > DUP, PAH, CAS tulós, NAG tulús (KAS tulós)
4.1.4 *r > /l/. As mentioned in 2.1.1.3, the NELUZ languages reflect the split of *r > /r/
and /l/ without any apparent conditioning. Tharp (1974a) notes that the Cagayan Valley
languages and Ilokano also reflect sporadic *r > /l/. Note that the *r > /l/ shift is fairly common in the Philippines, and is also reflected in Tagalog, Cebuano, Ilonggo, and many
other Central Philippine and Greater Central Philippine languages.
4.1.5 Schwa and gemination. As mentioned earlier, all of the NELUZ languages
except Casiguran Agta geminate consonants after a schwa. This innovation is also found
28. The semantic shift here is not unique to the NELUZ languages. It also occurs in the CV languages, NALTA, and Umiray. It is unclear if the semantic shift is more widespread than that.
29. This and the following reconstructions are from Dyen (1972), who cites them as PAN, but since
he does not use any Formosan evidence, we have changed this to PMP. Dyen does not reconstruct
meanings for his protoforms, but rather lists the meanings in the various daughter languages.
THE NORTHEASTERN LUZON SUBGROUP
153
in the Cagayan Valley languages and Ilokano (Tharp 1974a). This is weak evidence,
however, as many other Philippine and non-Philippine Austronesian languages also
geminate consonants after schwa (for example, some Manobo and Sama-Bajaw languages, phonetically in Maranao, and evidence thereof in North Borneo, cf. Blust 2010),
which is hypothesized to have been phonetically shorter than the other three reconstructed PAN vowels (Blust 2009:548).
4.1.6 Low Vowel Fronting (LVF). All of the NELUZ languages have Low Vowel
Fronting (discussed in 2.1.2.2 above), as do Southern Alta (Reid 1991, also citing a similar
process in Northern Alta), Umiray Dumaget (Himes 2002), Manide (Lobel 2010), and
Inagta Alabat (Lobel 2011), all of which are spoken by Negrito Filipino populations along
the eastern coast of Luzon. Since there is no other evidence that these languages form a
linguistic subgroup, we consider this an areal feature of Negrito Filipino languages that has
spread from Casiguran Agta and Pahanan Agta to the non-Agta languages Kasiguranin
and Paranan, respectively, through borrowing of individual lexical items. It is also worth
noting that there are key differences in the implementation of LVF in the Northeastern
Luzon languages, in Manide and Inagta Alabat, and in Umiray Dumaget (Lobel 2012). In
the NELUZ languages, LVF occurs only sporadically, and only after voiced stops (/b d g/).
In Manide and Inagta Alabat, LVF is similarly sporadic, but occurs after glides /w y/ as
well as after voiced stops /b d g/. Umiray Dumaget is unique among these languages
because its LVF process (which occurs after voiced stops /b d g/ and glides /w y/) is completely productive, including synchronic alternations resulting from prefixation, infixation,
and suffixation.
4.1.7 /r/ ~ /h/. All of the Agta languages of northeastern Luzon have an /h/ variant of
PNELUZ *r, as illustrated in forms (131)‒(134). This is not found in the non-Agta languages Paranan and Kasiguranin. In Dupaningan Agta, for which the first author has much
more comprehensive data, we found that /r/ and /h/ are in dialectal variation, with /r/ most
commonly found in the northern dialects, and /h/ increasingly prevalent in southern dialects. Frequent movement of individuals, however, makes it difficult to define geographically based dialects for this semi-nomadic group. The Dupaningan Agta data here
generally reflect the northern dialect documented in Robinson (2011) and thus preserve /r/.
(131) PNELUZ *barə́k ‘piglet’30 > DUP barák, PAH bahə́k, PAR barə́k, CAS, NAG
bəhə́k (KAS barə́k) (cf. CCAGTA bahák)
(132) PNELUZ *uráy ‘wait’ > DUP, PAR uráy, PAH, CAS, NAG uháy (KAS uráy)
(133) PNELUZ *ikə́r31 ‘cough’ > DUP ikár, PAH, DIN, CAS, NAG ikə́h, PAR ikə́r
(KAS ikə́r) (cf. CCAGTA ikár)
(134) PNELUZ *karamáy ‘centipede’ > DUP, PAR karamáy, PAH, DIN kahamáy,
CAS, NAG kahaméʔ (KAS karamáy)
Interestingly, this feature is also found in Central Cagayan Agta, which belongs to the
Cagayan Valley subgroup. Oates and Oates (1958) note that although /h/ is more frequent
stem-initially, and /r/ more frequent stem-finally, /r/ alternates freely with /h/ in stem30. This is from PAN *bǝRǝk and the expected PNELUZ reflex is thus *bǝgǝk.
31. This is from PMP *ikǝj and the expected PNELUZ reflex is thus *ikǝd.
154
OCEANIC LINGUISTICS, VOL.
52, NO. 1
medial position in about half of the words in which the two phonemes occur, such as in
items (135)‒(138).
(135) CCAGTA sihat ~ sirat ‘stomach’
(136) CCAGTA fuhaw ~ furaw ‘white’
(137) CCAGTA gahut ~ garut ‘roofing grass’
(138) CCAGTA pahiŋŋil ~ pariŋŋil ‘cheek’ (Oates and Oates 1958)
In other cases, CCAGTA /h/ is clearly derived from earlier *r, but the /r/ reflex does not
occur synchronically, such as in form (139).
(139) CCAGTA huplaano < Spanish eroplano ‘airplane’
The alternation of /r/ and /h/, and the replacement of *r with /h/, are areal features of
the Agta languages of Cagayan, Isabela, and northern Aurora provinces, rather than an
inherited feature in a particular innovation-defined subgroup.
4.1.8 Glottal stop. All of the Agta languages of northern Luzon underwent a process
of final glottal stop epenthesis (Vanoverbergh 1937:15, Tharp 1974a:69), but while most
historically vowel-final forms contain a word-final glottal stop, there are also forms today
that do not. Due to its distribution, this is considered an areal feature in the Agta languages of northern Luzon.32
The loss of word-final glottal stop is one innovation that has been considered defining
of Proto-Northern Luzon (Reid 2006), but because of the aforementioned glottal stop
epenthesis, any evidence of loss of word-final glottal stop has been obscured in the NELUZ
languages (that is, it is unclear whether the inherited glottal stop was dropped prior to the
generalization of the word-final glottal stop on all historically vowel-final roots).
4.1.9 *s > /h/. As discussed in 2.1.1.7 above, *s sporadically becomes /h/ in the
NELUZ languages (though not in Casiguran Agta). This shift does not occur in the
Cagayan Valley languages (Tharp 1974a), nor in Arta (Reid 1989), and is only found in
one language in each of the Central Cordilleran and Southern Cordilleran subgroups:
Balangaw in the former (Reid 1974) and Kalanguya in the latter (Himes 1998). Only a
single form in Ilokano reflects an *s > /h/ shift: the negator saɁan, which has the alternate
pronunciation haɁan.33 Otherwise, Ilokano lacks an /h/ phoneme in the native stratum
(although speakers now freely borrow Tagalog and English words with /h/).
4.1.10 Monophthongization of *aw and *ay. All of the Cagayan Valley languages share a vowel reduction rule in which *aw and *ay monophthongize to /o/ and /e/,
respectively, when followed by another consonant (Tharp 1974a). Casiguran Agta and
32. Note that Manide and Inagta Alabat also reflect sporadic glottal stop epenthesis, but that
word-final glottal stop remains phonemic in these languages, as it is unpredictable.
33. Note that independent *s > /h/ shifts are found in functors sporadically throughout the Philippines,
including Manide, Inagta Alabat, most Waray-Waray dialects, Butuanon, Tausug, and even in a
number of Sabah’s Dusunic languages, including Kujau, Dusun Kuala Monsok, Dusun Tindal,
Dusun Tambunan, Coastal Kadazan, and Sonsogon. Bob Blust (pers. comm., February 13, 2013)
also points out that “*S > *h > zero also happened between PAN and PMP in high-frequency morphemes, such as *Səpat > *əpat ‘four’, *Si- > *i- ‘instrumental/benefactive voice’, or *Sika- >
*ika- ‘ordinal numeral prefix’, as we would normally expect all of the PMP forms here to begin
with *h. This appears to be a frequency effect in historical change.”
155
THE NORTHEASTERN LUZON SUBGROUP
Nagtipunan Agta share a very similar shift, in which *aw and *ay monophthongize to /ɔ/
and /ɛ/, respectively, as discussed in 3.1.2. We consider the shifts in each of these two subgroups to be independent of one another, as this is an exceedingly common innovation.
4.1.11 Merger of *ə with *a. All of the Cagayan Valley languages merged *ə and
*a as /a/, but this change did not occur in Proto-Northeastern Luzon, as the only NELUZ
language that consistently merges *ə and *a is Dupaningan Agta, or certain dialects
thereof. In other Dupaningan Agta dialects, however, /a/ and /ə/ remain separate phonemes (Robinson 2011:4). The shift of *ə > /a/ in the northern dialects of Dupaningan
Agta likely occurred under the influence of Ibanag or some other CV language, prior to
the Ilokano expansion into the area.
4.1.12 *s > /t/ and *ti > /s/. Tharp (1974a) lists a set of ordered innovations (1) *s > t,
(2) *ti > s /__ {V, y} that have taken place in the Cagayan Valley languages (cf. table 10)
and also in Ilongot (Himes 1998:138).34 The NELUZ languages do not share either change.
4.1.13 Other Cagayan Valley innovations. There are a number of other innovations in the Cagayan Valley languages (Tharp 1974a) that are not found in any of the
NELUZ languages: (1) the *p > f / _u shift found in Yogad, Ibanag, Gaddang, Central
Cagayan Agta, and Itawis (note that there is no [f] phone in any of the NELUZ languages);
(2) the *b > h /_u shift in Central Cagayan Agta and Itawis; (3) the intervocalic rhoticization
of *d (*d > r / V_V), which is found in Atta, Gaddang, Ibanag, Itawis, and Yogad (although
note that the instances of *j > r in the NELUZ languages could be via an intermediate step *j
> *d > r, cf. 2.1.1.2); (4) the palatalization of *d > /j/, /z/, or /h/ before /i/, which is found in
TABLE 10. *s > t AND *ti > s IN THE NORTHEASTERN LUZON
AND CAGAYAN VALLEY LANGUAGES*
PMP *asuk PMP *pusuq PMP *diRus PMP *uRsa PMP *tian
‘heart’
‘bathe’
‘deer’
‘stomach’
‘smoke’
Northeastern Luzon
Dupaningan
Pahanan
Paranan
Casiguran
Kasiguranin
Cagayan Valley
CC Agta
Ibanag
Atta
Isneg
Itawit
Malaweg
*
asók
asók
asók
asók
asók
pusú
pusóʔ
pusú
pusóʔ
pusóʔ
dégus
dígus
dígus
dígus
dígus
ógsa
úgsa
úgsa
ógsa
ogsa
tíyan
tíyan
tíyan
tiyán
tiyán
atúʔ
atúʔ
atu
atúʔ
atúk
asúʔ
futu
futú
puttu
(púso)
futúʔ
púsu
zigut
zigúʔ
jiguʔ
díxut
zihut
digus
úgtaʔ
uttá
—
ugtá
uttá
úgsa
—
sa:ŋ
sa:n
—
—
—
Data are from our field notes, except Isneg (from Vanoverbergh 1972), the Central
Cagayan Agta and Atta reflexes of ‘heart’ (from Tharp 1974), and the Central
Cagayan Agta, Atta, and Itawit reflexes of ‘bathe’ (also from Tharp 1974).
34. According to the first author’s field notes, these innovations are also curiously absent from
Malaweg, which is supposedly a member of the Cagayan Valley subgroup, and which Ethnologue considers a dialect of Itawit. Note, however, that the most recent edition (Lewis 2009)
includes the comment “Malaweg may possibly be reclassified as a separate language.”
OCEANIC LINGUISTICS, VOL.
156
52, NO. 1
CCAGTA, Atta, Ibanag, and Itawis; (5) the assimilation of the final /g/ of the prefix *maR(*mag- > maCi / _-Ci), which is found in Atta, Gaddang, Ibanag, Itawis, and Yogad; (6) the
monophthongization of *uy in final position (*uy > i / _#), which is found in Atta, Ibanag,
and Itawis; and (7) the lenition of *g > Ø in Gaddang, and of *g > /h/ in Itawis.
4.1.14 Summary of phonological evidence. Table 11 summarizes the phonological evidence discussed in this section. Of the twenty aforementioned phonological innovations, four are shared by the NELUZ languages and the Cagayan Valley (CV)
languages, but all four are also found in other languages of the Philippines, which suggests that there is very little to link the NELUZ languages exclusively with the Cagayan
Valley languages. The first of these four shared innovations is *R > g, which is quite
common and found elsewhere in the Philippines, including (sporadically) in Ilokano and
Arta. The second shared innovation is t/s metathesis, which has been reconstructed for
Proto-Northern Luzon (Reid 2006). The third shared innovation is sporadic *r > l, which
is also found in many CPH and even GCPH languages. The fourth shared innovation is
gemination after schwa, which is also found in Ilokano and a number of other languages
in the Philippines including some Manobo languages and Sama-Bajaw, as well as phonetically in Maranao (cf. Lobel and Riwarung 2011), and historically in the North Borneo
languages (cf. Blust 2010). Based on phonological evidence, then, it would seem that the
NELUZ languages likely belong to the Northern Luzon subgroup, but probably should
not be placed in a subgroup with the Cagayan Valley languages.
TABLE 11. SUMMARY OF PHONOLOGICAL EVIDENCE
Others
GCPH; Ilokano and Arta
have /g/ and /r/
2) t/s metathesis
YES (sporadic) YES (sporadic)
All Northern Luzon
3) Sporadic *r > /l/
YES (sporadic) YES (sporadic)
CPH, GCPH
4) Gemination after schwa
YES
YES
Ilokano, others
5) *j and *d merger
YES
NO
Arta and Ilokano
6) LVF
YES
NO
Other Negrito Filipino
languages
7) *r > /h/
Agta only
NO (only CCAGTA) NO
8) Sporadic *s > /h/
YES (not in
NO
VARIOUS: Kalanguya
CAS)
(SCORD), Balangaw
(CCORD)
9) Loss of contrastive stress YES
NO (only CCAGTA) Pangasinan (Zorc 1979)
and SCORD (Himes 1998)
10) Diph. > Monoph.
NO (only CAS YES
Not in N. Phil.
and NAG)
11) *a and *ə merger
NO (only DUP) YES
Not in N. Phil.
12) *s > /t/, *ti >/s/
NO
YES
Not in N. Phil.
13) *p > f / _u
NO
YES
Not in N. Phil.
14) *b > h / _u
NO
YES
Not in N. Phil.
15) *-d- > -r- / V_V
NO
YES
Tagalog and others
16) *d > j,z,h / _i
NO
YES
Not in N. Phil.
17) *maR- > *maC1 / _C1
NO
YES
Not in N. Phil.
18) *uy > i / _#
NO
YES
Not in N. Phil.
19) Lenition of *g,*j,*R/ V_V NO
YES
Not in N. Phil.
20) -ʔ# epenthesis
YES
NO (only CCAGTA) NO
1) *R > /g/
NE Luzon
YES
Cagayan Valley
YES
THE NORTHEASTERN LUZON SUBGROUP
157
4.2 MORPHOLOGICAL EVIDENCE. Just as there is little phonological evidence for subgrouping the NELUZ languages with the Cagayan Valley languages, there is
likewise little unambiguous morphological evidence for such a relationship.
The NELUZ oblique case marker *ta is probably borrowed from a Cagayan Valley
language in which *s is regularly reflected as /t/. Although Tharp (1974a) reconstructs the
oblique as *sa for the Cagayan Valley languages, the form is ta in Central Cagayan Agta,
Atta, and Ibanag. In fact, none of the Cagayan Valley languages in Tharp’s study have a
synchronic form sa, but Tharp reconstructs *sa based on external evidence and because
ta in these languages derives regularly from *sa. However, *ta should probably be reconstructed for PNCORD and *sa for pre-NCORD.
The NELUZ languages share the reduction of the 2PL.GEN pronoun *=muyu to
*=muy with CCAGTA, but given the close borrowing relationship between CCAGTA
and DUP, this could easily be a borrowing into CCAGTA.
4.3 LEXICAL EVIDENCE FOR THE POSITION OF NELUZ. Himes (n.d.)
lists 326 proposed lexical innovations for Proto-Cordilleran or “Proto-Northern Philippines” (although it is not clear what the difference is between his PCORD and PNPH). After
excluding morphological innovations already discussed in the previous section, innovations that are repeated (for instance, *ʔagal ‘cry’ and *ʔagal ‘weep’), forms that we found
to occur in higher-level protolanguages, and forms for which we believe the evidence to be
very weak, we considered 273 unique pair-meaning innovations, of which 18 forms
(140)‒(157) were shared with at least two NELUZ languages. Some of these are entirely
unique lexical items, some involve unique semantic shifts, and others reflect unique phonological innovations.
(140) PCORD *ʔatəd ‘to give’ (semantic shift < PAN *SatəD ‘escort’) > DUP atád,
PAH, PAR, CAS, NAG atǝ́d, KAS átəd
(141) PCORD *dakəl ‘ancestors’ (semantic shift < PPH *dakəl ‘large’) > DUP daddákal, PAH dəddikəl, NAG dədikə́l ‘parents’ (cf. PNELUZ *da[k]kə́l, ‘large’)
(142) PCORD *dakəl ‘flood’ (semantic shift < PPH *dakəl ‘large’) > DUP dakál i
dinum, PAR dikəl a dinum (cf. PNELUZ *da[k]kə́l ‘large’ + *dinum ‘water’)
(143) PCORD (also PNCORD) *dutdut ‘feather, body hair’ > PAH dútdut ‘body
hair’, PAR, KAS dútdut ‘body hair, feather’, DIN, CAS dutdut ‘feather’
(144) PCORD *bu(st)(əu)y ‘calf of leg’ > PAH, DIN, CAS butóy, NAG butúy
(145) PCORD *baŋləs ‘rotten’ > PAH bíŋləs, NAG béŋles ‘rotten, of meat’
(146) PNPH *piklat ‘scar’ (phonological innovation < PAN *pil(ae)k) > DUP , PAR
péklat, PAH píklat, CAS, KAS peklát (cf. NAG piláʔ) (also Tagalog peklat,
probably a borrowing from a Northern Philippine source)
(147) PCORD *kəməl ‘squeeze’ > PAH kaməl-kaməl-ən, PAR kə́mməl ‘squeeze,
check for softness’
(148) PCORD *lətəg (PNCORD *ləttəg) ‘swell’ > DUP linómtag (< **l<um>attag),
NAG linúmtəg
(149) PCORD *p(əu)dəw (PNCORD *pudəw) ‘white’ > PAH, PAR, DIN, KAS pudéw
158
OCEANIC LINGUISTICS, VOL.
52, NO. 1
(150) PCORD *laman ‘wild pig’ (semantic innovation < PPH ‘flesh’) > DUP , PAH,
CAS, NAG lamán
(151) PCORD *s(iu)gəm (PNCORD *təggəm, PMP *s(iə)jəm) ‘ant’ > PAH səggə́m
‘small, red ant’; CAS səgə́m ‘ant (general)’; KAS əgə́m ‘large, red ant’
(152) PCORD *yupyup ‘blow’ (innovation < PAN *iyup) > PAR, KAS yópyop
(153) PCORD *yəgyəg ‘earthquake’ > DUP yágyag, PAH, DIN yógyog, CAS, KAS
yǝ́gyəg, ‘shake’
(154) PCORD (also PNCORD) *tukak ‘frog’ > DUP, PAH, PAR, DIN, CAS, KAS,
NAG tukák
(155) PCORD *s(ai)luk (phonological innovation < PPH *siluŋ) ‘space under
house’ > PAH sahók, KAS sarók
(156) PCORD (also PNCORD) *takdəg ‘stand’ > DUP, PAH, PAR, CAS, KAS, NAG
taknəg35
(157) PCORD *tubuŋ ‘water container’ > PAR, KAS tubúŋ ‘bamboo water scooper’
The following form (158) appears to be shared with PCORD/PNPH, but has undergone a semantic shift unique to the NELUZ languages.
(158) PCORD (also PNCORD) *ʔubət ‘anus, buttocks’36 > PNELUZ *ubə́t ‘vulva’ >
DUP, PAR ubát, PAH ubbát, CAS, KAS, NAG ubə́t
The following five PCORD/PNPH lexical innovations (159)‒(163) were shared only
with DUP, for which we have a much richer lexical database. We have eliminated a number of forms that appeared to be borrowings from a non-NELUZ language (either ILK or
CCAGTA, the two major sources of borrowing for DUP).
(159) PCORD *salug (PNCORD talug) ‘swim’ > DUP sulóg (semantic innovation <
‘river’)
(160) PCORD *sakbat ‘carry (on shoulders)’ (phonological shift < PAN *sabat) >
DUP sakbét ‘carry by slinging over the shoulder’
(161) PCORD (also PNCORD) *bukəl ‘round’ > DUP nag-bukal (cf. DUP bukal
‘seed’; < PMP *bukəl ‘seed’)
(162) PCORD *tapəw ‘float’ > DUP tappáw
(163) PCORD *gudwa (PNCORD *gədduwan) ‘split’ > DUP gaddúwa ‘half’ (based
on *dua ‘two’)
It seems, then, that there is a sufficient amount of lexical evidence to support placing
the NELUZ languages within the Northern Luzon subgroup, but it appears that they constitute a primary branch of Northern Luzon, and do not immediately subgroup with the
CV languages.
5. CONCLUSION. This paper has attempted to fill a gap in the existing literature
by presenting data on the Northeastern Luzon subgroup of languages. By examining the
phonological, lexical, and functor innovations, we have shown that these languages form
35. This is one of the handful of forms where DUP retains the schwa.
36. Himes includes ‘vagina’ as one of the meanings, but this seems to be based on the CAS reflex.
159
THE NORTHEASTERN LUZON SUBGROUP
an independent subgroup, as has been previously suggested. Moreover, although it has
long been assumed that the Northeastern Luzon languages subgroup with the primarily
non-Agta languages of the Cagayan Valley, we have shown that this is probably not the
case, due to the fact that there are no exclusively shared innovations between the NELUZ
languages and the Cagayan Valley languages. As such, there can be no “Northern Cordilleran” subgroup (traditionally assumed to consist of the Cagayan Valley and Northeastern Luzon languages). There is some evidence that the NELUZ languages belong to the
Northern Luzon (Cordilleran) subgroup, in which case both the Northeastern Luzon subgroup and the Cagayan Valley subgroup would form primary branches of Northern
Luzon. However, since relatively few innovations have been identified for Northern
Luzon (Reid 2006 notwithstanding), a more definite placement of these languages awaits
further research on the larger Northern Luzon subgroup.
APPENDIX 1. LIST OF ABBREVIATIONS
AV
CAS
CCAGTA
CCORD
CV
CONT
DEF
DIN
DU
DUP
EXC
EXT
GEN
GCPH
ILK
IMPER
INC
INCOM
INDEF
INF
KAS
LOC
LSTNR
LV
LVF
NAG
actor voice
Casiguran Agta
Central Cagayan Agta
Central Cordilleran
Cagayan Valley
continuative
definite
NCORD
NEG
NELUZ
NOM
NONREF
OBL
PAH
Dinapigue Agta
dual
Dupaningan Agta
exclusive
existential
genitive
Greater Central
Philippines
Ilokano
imperative
inclusive
incompletive
indefinite
infinitive
Kasiguranin
locative
listener
locative voice
low vowel fronting
Nagtipunan Agta
PAN
PAR
PCCORD
PERS
PMP
PNCORD
PNELUZ
PNLUZ
PNPH
POSS
PPH
PV
PV2
SCORD
SPKR
TAG
TOP
UDGT
+
Northern Cordilleran
negative
Northeastern Luzon
nominative
nonreferential
oblique
Pahanan Agta (Palanan
Dumagat)
Proto-Austronesian
Paranan
Proto-Central Cordilleran
personal name marker
Proto‒Malayo-Polynesian
Proto-Northern Cordilleran
Proto-Northeastern Luzon
Proto-Northern Luzon
Proto-Northern Philippine
possessive
Proto-Philippine
patient voice
secondary patient voice
Southern Cordilleran
speaker
Tagalog
topic
Umiray Dumaget
loanword
OCEANIC LINGUISTICS, VOL.
160
52, NO. 1
APPENDIX 2. FUNCTOR SETS
1. PRONOUNS
DUP
PAH
PAR
DIN
hikə́n
hikó
siyá
hikamí
hikitá
hikitám
hikám
hidi
hikə́n
hikó
siyá
sikamí
sikitá
sikitám
sikám
hidí
=ak
=ka
(hikúna)
=kamí
=kitá
=kitám
=kam
hidi
=ək
=ka
siyá
=kamí
=kitá
=kitám
=kam
hide
ko
mo
na
mi
ta
tam
moy
di
1SG
nikán
2SG
nikáw
3SG
nikúna
CAS
NAG
PNELUZ
KAS
hikə́n sakən
hikóʔ sikó
sya
siyá
—
sikamí
hikitá sikitá
hikitám sikitám
hikám sikám
hídi
sidí
sakə́n
sikóʔ
syá
sikamí
sikitá
sikitám
sikám
sídi
*si-akən
*si-kaw (>*-ko)
*siya
*si-kami
*si-kita
*si-kitam
*si-kam
*sidi
akú
ikáw
siyá/sya
kamíʔ
kitá
táyuʔ
kayúʔ
siláʔ
=ək
=ka
siya
=kamí
=kitá
=kitám
=kam
hidíʔ
=ək
=ka
sya
=kámi
=kitá
=kitám
=kam
=hide
=ək
=ka
siyá
=kamí
=kitá
=kitám
=kam
sidé
=ək
=ka
siyá
=kamí
=kitá
=kitám
=kam
sidíʔ
*=ək
*=ka
*siya
*=kami
*=kita
*=kitam
*=kam
*side
akú
ka
Ø/sya/siyáʔ
kamíʔ
kitá
táyuʔ
kayúʔ
siláʔ
ko
mo
na
mi
ta
tam
moy
di
ku
mu
na
mi
ta
tam
moy
di
ku
mo
na
mi
ta
tam
moy
di
ku
mu
na
mi
ta
tam
moy
di
ku
mu
na
mi
ta
tam
moy
di
*=ku
*=mu
*=na
*=mi
*=ta
*=tam
*=muy
*=di
kúʔ
múʔ
niyáʔ
méʔ
táʔ
tam
muy
déʔ/niláʔ
nikə́n
dikókuʔ nikə́n
TOPICALIZED NOMINATIVE
1SG
2SG
3SG
1EXCL
1INCL.DU
1INCL.PL
2PL
3PL
hikán
hikáw
hikúna
hikamí
hikitá
hikitám
hikám
hidí
NOMINATIVE
1SG
2SG
3SG
1EXCL
1INCL.DU
1INCL.PL
2PL
3PL
GENITIVE
1SG
2SG
3SG
1EXCL
1INCL.DU
1INCL.PL
2PL
3PL
OBLIQUE
1EXCL
nikamí
1INCL.DU nikitá
diyakə́ diyakə́n *ni/di-akən
n
nikóʔ dikómuʔ nikóʔ dikó
dikóʔ *ni/di-kaw
(>*-ko)
nikoná dikónaʔ nikuná diyá
diyáʔ
*ni/di-ko-na,
ʔ
ʔ
SNEL *diya
nikamíʔ dikómiʔ —
dikamí dikamíʔ *ni/di-kami
nikitá dikótaʔ —
dikitá —
*ni/di-kita
1INCL.PL nikitám
nikám
2PL
nikitám dikótam —
dikitám —
nikám dikómoy nikám dikám dikám
*ni/di-kitam
*ni/di-kam
3PL
nidéʔ
dikódiʔ nidéʔ
didé
didíʔ
*ni/di-de
taká
takáʔ
taká
taká
*taka
nidí
1SG>2SG* taka
*
taká
sa ákin; sakókoʔ;
kaóko
sa iyúʔ; kómoʔ;
kaómoʔ
sa kanyáʔ;
kónya; koniyáʔ
sa ámin; sakómeʔ
saátaʔ; sakótaʔ;
kootáʔ
sa átam; sa kótam
sa ínyo; sa ómoy;
sa komóy
sa kanilá, kaónila,
sakaódeʔ
—
In both DUP and CAS, taka is used for 1SG>2SG, while takám is used for 1SG >2PL (Robinson
2011, Headland and Healey 1974). We did not elicit 1SG >2PL, so we lack data for this in the
other languages of NELUZ.
161
THE NORTHEASTERN LUZON SUBGROUP
2. DEMONSTRATIVES
DUP
SG.
PAH
PAR
DIN
CAS
saiyé səyéʔ
NAG
PNELUZ KAS (TAGbased)
PL.
TOPICALIZED NOMINATIVE
SPKR/
LSTNR
idé
dagínde
saiyɛ́
(sa) iyə́n
LSTNR
idáy
idó
dagínday
dagínto
saénaʔ
—
saiyə́n(/d) —
—
—
səináʔ
—
saínaʔ *sa-ʔináʔ —
—
—
—
FAR
sayíʔ *sa-ʔiye
—
NOMINATIVE
SPKR/
LSTNR
idé
dagínde
iyé
iddi
iyé
yéʔ
yi
*ye
itó, ‘tu
LSTNR
idáy
dagínday
ináʔ
ináʔ
ináʔ
ináʔ
*inaʔ
iyán, ‘yan
FAR
idó
dagínto
itúd
yúdiʔ
<iyən>
itúd
<iyud>
itúd
yód
—
*i[t]ud
‘yun
iddi
naiyé (na)iyéʔ (na)yi *na[i]yéʔ nitú
GENITIVE
SPKR/
LSTNR
na idé
LSTNR
na idáy na dagínday nan-ináʔ hidhaə́n
na idó na dagínto ( ) itúd —
FAR
na dagínde niiyé
naináʔ (na)ináʔ naínaʔ *naináʔ niyán
—
(na)yód —
*nai[ ]úd —
OBLIQUE
SPKR/
LSTNR
ihé
—
héʔ
həddi
LSTNR
iháy
—
hináʔ
FAR
ihó
—
hud
haən, hən, hináʔ sináʔ
šiyə́n, *sinaʔ
had
sináʔ
hud, haód hud sáʔ; sod —
*sud
doón
nandíto
héʔ
séʔ
síʔ
*seʔ
ditú
diyán
LOCATIVE
SPKR/
LSTNR
(i)hé
—
—
haddi
—
taiyéʔ
—
LSTNR
(i)háy
(i)hó
—
—
—
—
haan
húdi
—
—
taináʔ
taiyód;
taiyáʔ
kaináʔ —
kayá, —
taiyáʔ
FAR
—
nandyán
nandón
3. NEGATORS
DUP
awán
IMPERATIVE awan
EQUATIONAL awán
(bakkán)
EXISTENTIAL awán
(limus)
DON’T LIKE
madi(an)
VERBAL
PAH
(a)wán
dyan
bakə́n
PAR
awán
diyán
bakə́n
awán
awán
umə́d
DIN
awán
dyán
bakə́n
CAS
awán
diyán
bəkə́n ~
bakə́n
awán awán
umə́d, umə́d idél,
sála
sála
DON’T KNOW haybas
awán ku áhay
ahə́y ahə́y
(awan ko tukóy
kantandi)
NAG
awán
diyán
bəkkə́n
PNELUZ
(*awan)
*diyan
*b(aə)kən
KAS
hindéʔ; di
wag
hindéʔ; di
awán
(*awan)
waláʔ
sála
N *uməd, ajə́k, umádi
S *sála
awán ku, *ahəy
ahə́y
tukúy
OCEANIC LINGUISTICS, VOL.
162
52, NO. 1
4. VERBAL CONJUGATIONS
DUP
PAH
<um>
INF
-um-umCOMP -inum-umm-,
-immCONT (-um-)* (-um-)
INCOM (-um-)
(-um-)
ACTIVE VOICE WITH magINF
magmag-
PAR
DIN
CAS
PNELUZ KAS
-um-inum-
-um-inum-
-um-inum-
*<um> -um*<inum> -inum-, ni-…-um-
CumV(-um-)
(-um-)
(-um-)
CumV-um-
—
*<um>
məg-CV-…-ən
-um-
mag-
mag-
mag- ~
məgnag- ~ nəg-†
(mag- ~
məg-)
(mag- ~
məg-)
*mag-
mag-
*nag—
minag—
*mag-
—
-ən
-inCV-…-ən
-ən
*-ən
*<in>
—
*-ən
-ən
-in-, niCin-; CV-…-ən
-ən
-an
ni-/-in-…-an
inii-CV—
ACTIVE VOICE WITH
CONT
nag(mag-)
nag(mag-)
nag(mag-)
nag(mag-)
INCOM
(mag-)
(mag-)
(mag-)
(mag-)
-ən
-in(-in-)
(-ən)
-ən
-inCV-…-ən
(-ən)
-ən
-in(-ən)
(-ən)
COMP
PATIENT VOICE
-an
-inCONT —
INCOM (-an)
INF
COMP
LOCATIVE VOICE
INF
COMP
CONT
INCOM
-an
-an
-an
-an
-an
-in-…-an -in-…-an -in-…-an -in-…-an -in-…-an
—
(-an)
(-an)
(-an)
CV…-an (-an)
(-an)
(-an)
CV…-an
-an
*-an
*<in>…
-an
—
*-an
iinCV(i-)
iníCiyV-/iCui-
*i*ni?
*i-
CV-…-an
-an
SECONDARY PATIENT VOICE
iniCONT —
INCOM (i-)
INF
COMP
*
†
iin(in-)
(i-)
ini(i-)
(i-)
Optional reduplication is possible to emphasize the continuative in DUP actor voice (see
Robinson 2011:126)
Headland and Healey (1974) note the existence of minag- and minəg-, but these did not
occur in our data.
5. EXISTENTIALS AND QUANTIFIERS
HAVE
DON’T
HAVE
MANY,
A LOT
FEW,
A LITTLE
DUP
atóy
PAH
tehud
PAR
tehód
DIN
te(t);
tehúd
awán
CAS
NAG
PNELUZ
teə́gseʔ əgséʔ; te N *tehud
awán
awán
awán
awán awán
(límus)
makpál makpə́l
makpál makpál meádu meádu
<makpal,
cf.
kuməppal>
ballék bádit
bádit bəllik kətihək kəttíhək
(*awan)
KAS
me, mérun
<terun, teruun>
waláʔ
N *makpal marámi
S *meʔadu
(cf. UDGT)
(ILK adu)
*b(aə)ddit sabəddít
*bəllik
163
THE NORTHEASTERN LUZON SUBGROUP
6. INTERROGATIVES
DUP
ánya, héya
héya
WHO (NOM)
WHOSE
makin;
heya i makin…
WHEN (PAST)
hángan
WHEN (FUT)
hángan
WHERE (PAST/FUT) hádya
hénan, hádya
WHERE (PRES.)
FROM WHERE
taga hadya
apay
WHY
HOW (MANNER)
panyan
HOW MUCH
kasano*
(EXTENT)
HOW MUCH ($)
hángan
HOW MANY
hángan a piráso
WHICH
(anya)
HOW MANY TIMES naminhángan
WHAT
TO WHERE
CAS
ánya
teesiyáʔ
WHAT
WHO (NOM)
PAH
ánya <ənya>
(ti)déyaʔ
kandéya
PAR
ányaʔ
tidé(ya)
kandéyaʔ
DIN
ánya
tidéya
(tidéyaʔ)
nengkán
nikán
hidyá
hadya
taghídya
bákin
kónya
sakónya…ti ka-
nikán
nikán
hadya
hadya
tagahádya
bákit
kúdya
sakúdya…ti ka-
nikasangán
nikasangán
hadya
hadya
tagahadya
bákit
(anya)
gasíno
sángan
sángan a pirásuʔ
ánya…ti duwwa
pensángan
hidyá
NAG
anya
syesiyá
tagsángan
sángan a piráso
hadya ti duwa’y
nakasángan
hadya
PNELUZ
*anya
N *tidéya
S *[ ]siya
PP *kandéya
PP = *nikan
S *nikasiya
sángan
sángan
hadyá…ta dúwwaʔ(é)
nakasángan
hadyá
KAS
ano
sínu
WHEN (PAST)
kiniasiyáʔ
nikəsiyáʔ
WHEN (FUT)
nikəsiyáʔ
WHERE (PAST/FUT)
(ta)ahé
ahé
tagaahé
bákit, ataáy
pakódya
kódya
(syesiyá)
nikasya ~
nikasye
nakasya ~
nikasye
ahé
ahé
tagaahé
ataáy, atáe
pinakódya
kódya…ka-
tigsángan
sángan
ahé ta adúwaʔ
sángan
*t(ia)gsángan
sangán a pirásuʔ *sángan
ahé
=‘where’
nakasángan
ahé
nakasángan
taahé
WHOSE
WHERE (PRES.)
FROM WHERE
WHY
HOW (MANNER)
HOW MUCH
(EXTENT)
HOW MUCH
($)
HOW MANY
WHICH
HOW MANY TIMES
TO WHERE
*
Kasano is an Ilokano loan.
kaníno
ka-ilán
ka-ilán
*hadya
*hadya
*taga-…
—
*pakodya
*[sa]kodya
*naka-sángan
=‘where’
sáan
nasaán
tagasáan
bákit
paáno
sapaáno
təgsángan
sangán a mómon
alín sa
dúwwaʔ
nakasángan
sáan
OCEANIC LINGUISTICS, VOL.
164
52, NO. 1
7. ADVERBS OF TIME
DUP
ha essa nakkápon
PAH
ten ɛssa aldɛw
PAR
tu issa a aldíw
DIN
tu essa aldíw
nakkápon
ha kalláp hé(ya)
nak(k)óya
(ni)yéʔin
ayénan, nokkán
noʔúgma
mekadúwa a pamalák
nennápon
nenggibíʔ
nemmamayáʔ
nadíd
mamaya
niiláw
puwéra niiláw
tuápon
tugibí(heʔ)
tumámayáʔ
nadíd
mamayáʔ
niiláw
pwéra niiláw
tuápon(heʔ)
—
tumámayá(héʔ)
nadíd
mamayáʔ
nugágabíʔ
nu éssa aldíw
mallédum
pamalák
ápon
kállap
lúbok na kalláp
pómsag
dímadímang
tangháliʔ
apón
gibíʔ
(gibíʔ)
(dímadímang)
gagábiʔ
tangháliʔ
apon
gibíʔ
hatinggabi
kaldiwán
gág bíʔ
tánghali
pón
kəllə́p
hatinggabi
ésding aldíw dən
pamalak, aldew
CAS
to esá a aldéw
aldɛw
NAG
tu éssa a aldóʔ
aldíw
PNELUZ
—
to apón
tukələ́p(seʔ)
tomamayaséʔ
nadíd
mamayáʔ
niiláw, tagabiyáʔ
tuápon (séʔ)
tukələ́p(séʔ)
tumámaya(séʔ)
nadíd
mamayáʔ
tagábiyáʔ
DAY AFTER
TOMORROW
(sa)péraʔ niiláw
ta íssa a gagábiʔ
aldíw
KAS
nung éssang
aldéw
*tu-ʔápon
kahápon
—
nunggibíʔ
+tu-mámayá[seʔ] kanína
*nadid
ngay-ún
+mamayáʔ
mamayá
*niʔiláw,
niiláw
S *tagabiyáʔ
+pwéra-niʔiláw
pwéra niiláw
MORNING
gagábiʔ
tangháliʔ
apón
kələ́p
hatinggabí
amulaldéw
gagábiʔ
tangháliʔ
apón
kələ́p
—
—
*gagábiʔ
+tangháliʔ
*apon
*kələp, *gabiʔ
+hatinggabi
—
umágaʔ
tangháliʔ
ápon
gibíʔ
hatínggibíʔ
amuláldew
aldéw
aldóʔ
*aldaw
aldéw
DAY BEFORE
YESTERDAY
YESTERDAY
LAST NIGHT
EARLIER
TODAY, NOW
LATER
TOMORROW
DAY AFTER
TOMORROW
MORNING
NOON
AFTERNOON
NIGHT
MIDNIGHT
EARLY
MORNING
DAY
DAY BEFORE
YESTERDAY
YESTERDAY
LAST NIGHT
EARLIER
TODAY, NOW
LATER
TOMORROW
NOON
AFTERNOON
NIGHT
MIDNIGHT
EARLY
MORNING
DAY
165
THE NORTHEASTERN LUZON SUBGROUP
8. PARTICLES
ALSO
DISCOVERY
EMPHASIS
FIRST
ONLY
POSSIBLE
REQUEST
AS IF
QUOTATIVE
ALREADY
AGAIN
EVEN
ALSO
DISCOVERY
EMPHASIS
FIRST
ONLY
POSSIBLE
REQUEST
AS IF
QUOTATIVE
ALREADY
AGAIN
EVEN
*
TAG
din
pala
ngáʔ
múna
lang, lámang
yáta
—
párang
daw
na
muli
pati
DIN
—
—
—
laʔ
laʔ
—
—
kumə́n a…
kan
dən/rən
huwáy
(kahít)
ILK
=met
=gayam
=ngarud
=pay=la(e)ng
=la(e)ng
DUP
=bi
=noman
=man
=pala*
=la
=wade(n), =wan
paki—
kas-la...
konna ha..
=kan(o)
=kan
=en, =on
=dan (‘-n, ‘-on)
manen
=*manon
uray
mensan
CAS
NAG
béman béʔ
bíʔ
baléʔ
bal
ngani
ngáni (dən)
pa san
pa san
san
san
wadíʔ
wadíʔ
paki-…pad
pad
kumə́n saʔ; kumə́naʔ koman ti…
kan
—
huwáy
páti
huwáy
patí
PAH
—
bal
—
palláʔ
—
—
(bénid)
—
—
dən
ruwáy
patí
PNELUZ
*bi
*bəl[i]
*ngani
*palla
*la
*wad[i]
*p(ae)d
S *kumə́n
*kan
*dən
*ruwáy
*pati
PAR
—
bəl
nganíʔ
palla
laʔ
(hu)wád
pɛd
—
kan
dən
huwáy
pati
KAS
din/rin
balíʔ
ngáni
muná
laang
yáta
paki-…nɛd
kumə́n
daw
na
ruwáy
patí
Although the /l/ in this form most commonly occurs as a singleton, it can occur as a geminate.
REFERENCES
Bellwood, Peter. 1997. Prehistory of the Indo-Malaysian archipelago. Honolulu:
University of Hawai‘i Press.
Blust, Robert A. 1972. Note on PAN *qa(R)(CtT)a ‘outsiders, alien people’. Oceanic
Linguistics 11:166‒71.
———. 1983‒84. Austronesian etymologies II. Oceanic Linguistics 22‒23:29‒149.
———. 1987. Lexical reconstruction and semantic reconstruction: The case of
Austronesian “house” words. Diachronica 4:79–106.
———. 1991a. Patterns of sound change in the Austronesian languages. In Patterns of
change, change of patterns: Linguistic change and reconstruction methodology,
ed. by Philip Baldi, 129‒65. Berlin: Mouton de Gruyter.
———. 1991b. The Greater Central Philippines hypothesis. Oceanic Linguistics
30:73‒129.
———. 1999. Subgrouping, circularity and extinction: Some issues in Austronesian
comparative linguistics. In Selected papers from the Eighth International Conference on Austronesian Linguistics, ed. by Elizabeth Zeitoun and Paul Jen-Kuei Li,
31‒94. Taipei: Academia Sinica.
166
OCEANIC LINGUISTICS, VOL.
52, NO. 1
———. 2000. Why lexicostatistics doesn’t work: The ‘universal constant’ hypothesis
and the Austronesian languages. In Time depth in historical linguistics, vol. 2:
Papers in the prehistory of languages, ed. by C. Renfrew, A. McMahon, and L.
Trask, 311‒31. Cambridge: The McDonald Institute for Archaeological Research.
———. 2005. The linguistic macrohistory of the Philippines: Some speculations. Current
issues in Philippine linguistics and anthropology: Parangal kay Lawrence A. Reid, ed.
by Hsiu-chuan Liao and Carl R. Galvez Rubino, 31‒68. Manila: Linguistic Society of
the Philippines and SIL Philippines.
———. 2006. Anomalous liquid: Sibilant correspondences in Western Austronesian.
Oceanic Linguistics 45:210‒16.
———. 2009. The Austronesian languages. Canberra: Pacific Linguistics.
———. 2010. The Greater North Borneo hypothesis. Oceanic Linguistics 49:44‒118.
Blust, Robert A, and Stephen Trussel. In progress. Austronesian comparative dictionary,
web edition. Available at: http://www.trussel2.com/ACD/. Accessed December 20,
2012.
Charles, Mathew. 1974. Problems in the reconstruction of Proto-Philippine phonology
and the subgrouping of the Philippine languages. Oceanic Linguistics
13:457‒509.
Conant, Carlos E. 1910. The RGH law in Philippine languages. Journal of the
American Oriental Society 31:70‒85.
Davis, Philip W., and Angela D. Mesa. 2000. A dictionary of Yogad. Munich: Lincom
Europa.
Dyen, Isidore. 1965. A lexcostatistical classification of the Austronesian languages.
New Haven: Yale University.
———. 1972. Non-gradual regular phonetic changes involving sibilants. In Langues et
techniques, nature et société, vol. 1: Approche linguistique, ed. by Jacqueline M.
C. Thomas and Lucien Bernot, 95‒99. Paris: Klincksieck
Fox, Robert B., Willis E. Sibley, and Fred Eggan. 1965. A preliminary glottochronology
for Northern Luzon. Asian Studies 3(1):103‒13.
Goda, Toh. 2003. Postcolonialism and local politics in Southeast Asia. Quezon City:
New Day Publishing.
Headland, Thomas N. 1975. Report of Eastern Luzon language survey. Philippine
Journal of Linguistics 6(2):47‒54.
———. 1997. Revisionism in ecological anthropology. Current Anthropology
38(4):605–30.
Headland, Thomas N., and Janet D. Headland. 1974. A Dumagat [Casiguran]–English
dictionary. Canberra: Pacific Linguistics.
Headland, Thomas N., and Alan Healey. 1974. Grammatical sketch of Casiguran
Dumagat. Papers in Philippine Linguistics 6:1‒54.
Headland, Thomas N., and Elmer P. Wolfenden. 1967. The vowels of Casiguran
Dumagat. In Studies in Philippine anthropology, ed. by Mario D. Zamora,
592‒96. Quezon City: Alemar Phoenix.
Himes, Ronald S. 1998. The Southern Cordilleran group of Philippine languages.
Oceanic Linguistics 37:120‒77.
———. 2002. The relationship of Umiray Dumaget to other Philippine languages.
Oceanic Linguistics 41:275–94.
———. n.d. NPH reconstructions. MS.
Hontiveros, Greg. 2000. Butuan of a thousand years. Butuan City (Philippines):
Butuan City Historical & Cultural Foundation, Inc.
Lewis, M. Paul, ed. 2009. Ethnologue: Languages of the world. 16th ed. Dallas: SIL
International.
Liao, Hsiu-chuan. 2005. Pronominal forms in Central Cagayan Agta: Clitics or agreement
features? In Current issues in Philippine linguistics and anthropology: Parangal kay
THE NORTHEASTERN LUZON SUBGROUP
167
Lawrence A. Reid, ed. by Hsiu-chuan Liao and Carl R. Galvez Rubino, 346‒60.
Manila: Linguistic Society of the Philippines and SIL Philippines.
Lobel, Jason William. 2010. Manide: An undescribed Philippine language. Oceanic
Linguistics 49:480‒512.
———. 2011. Inagta Alabat: Another undescribed Philippine language. MS.
———. 2012. Philippine and North Bornean languages: Issues in description, subgrouping, and reconstruction. PhD diss., University of Hawai‘i.
Lobel, Jason William, and Labi Hadji Sarip Riwarung. 2011. Maranao: A preliminary
phonological sketch with supporting audio. Language Documentation and Conservation 5: 31‒59.
Lobel, Jason William, and Laura C. Robinson. n.d. Kasiguranin: Notes on the position of
a Philippine contact language. MS.
Mayfield, Roy. n.d. Gattaran Agta dictionary. Typescript. 350 pp.
McFarland, Curtis D. 1974. The dialects of the Bikol area. PhD diss., Yale University.
———. 1980. A linguistic atlas of the Philippines. Tokyo: Institute for the Study of
Languages and Cultures of Asia and Africa, Tokyo University of Foreign Studies.
Nickell, Thomas L. 1985. A partial stratificational analysis of Eastern Cagayan Agta language. In The Agta of northeastern Luzon: Recent studies, ed. by P. Bion Griffin and
Agnes Estioko-Griffin, 119‒46. Cebu City: University of San Carlos Publications.
Nickell, Thomas L., and Kristy Nickell. 1987. Eastern Agta dictionary. Unpublished ms.
Oates, William J., and Lynette F. Oates. 1955. A vocabulary of Central Cagayan
Negrito. Manila: Summer Institute of Linguistics.
———. 1958. The phonemes of Central Cagayan Negrito. In Oceania Linguistic
Monographs 3, ed. by A. Capell and S. A. Wurm, 48‒66. Sydney: University of
Sydney.
Reid, Lawrence A. 1974. The Central Cordilleran subgroup of Philippine languages.
Oceanic Linguistics 13:511‒60.
———. 1979a. Towards a reconstruction of the pronominal systems of Proto-Cordilleran,
Philippines. In South-East Asian linguistic studies 3, ed. by Nguyen Dang Liem,
259‒75. Canberra: Pacific Linguistics.
———. 1979b. Evidence for Proto-Philippine nominative marking. Philippine Journal
of Linguistics 10(1‒2):1‒20.
———. 1982. The demise of Proto-Philippines. In Papers from the Third International
Conference on Austronesian Linguistics, vol. 2: Tracking the travellers, ed. by Amran
Halim, Lois Carrington, and S. A. Wurm, 201‒16. Canberra: Pacific Linguistics.
———. 1987. The early switch hypothesis: Linguistic evidence for contact between
Negritos and Austronesians. Man and Culture in Oceania 3:41–59.
———. 1989. Arta, another Philippine Negrito language. Oceanic Linguistics
28:47‒74.
———. 1991. The Alta languages of the Philippines. In VICAL 2, Western Austronesian
and contact languages: Papers from the Fifth International Conference on Austronesian Linguistics, ed. by Ray Harlow, 265‒97. Te Reo Special Publication. Auckland:
Linguistic Society of New Zealand.
———. 1994. Unravelling the linguistic histories of the Philippine Negritos. In
Language contact and change in the Austronesian world, ed. by Tom Dutton
and Darrell T. Tryon, 443‒75. Berlin: Mouton de Gruyter.
———. 2006. On reconstructing the morphosyntax of Proto-Northern Luzon. Philippine
Journal of Linguistics 37(2):1‒63.
———. 2009. The reconstruction of a dual pronoun to Proto Malayo-Polynesian. In
Discovering history through language: Papers in honour of Malcolm Ross, ed. by
Bethwyn Evans, 461‒77. Canberra: Pacific Linguistics.
168
OCEANIC LINGUISTICS, VOL.
52, NO. 1
———. 2010. Historical linguistics and Philippine hunter-gatherers. In Piakandatu
ami Dr. Howard P. McKaughan, ed. by Loren Billings and Nelleke Goudswaard,
234‒60. Manila: Linguistic Society of the Philippines and SIL Philippines.
Reid, Lawrence A., ed. 1971. Philippine minor languages: Word lists and phonologies.
Oceanic Linguistics Special Publication No. 8. Honolulu: University of Hawai‘i Press.
Robinson, Laura C. 2011. Dupaningan Agta: Grammar, vocabulary, and texts. Canberra:
Pacific Linguistics.
Ross, Malcolm D. 1992. The sound of Proto-Austronesian: An outsider’s view of the
Formosan evidence. Oceanic Linguistics 31:23‒64.
Rubino, Carl Ralph Galvez. 2000. Ilocano dictionary and grammar: PALI Language
Texts. Honolulu: University of Hawai‘i Press.
Scott, William Henry. 1984. Prehispanic source materials for the study of Philippine
history. Quezon City: New Day Publishers.
———. 1992. Looking for the prehispanic Filipino and other essays in Philippine history.
Quezon City: New Day Publishers.
Tharp, James A. 1974a. The Northern Cordilleran subgroup of Philippine languages.
University of Hawai‘i Working Papers in Linguistics 6(6):53‒114.
———. 1974b. Notes on the Ilokano reflexes of Proto-Ausronesian *R. University of
Hawai‘i Working Papers in Linguistics 6(6):47‒51.
Thomas, David, and Alan Healey. 1962. Some Philippine language subgroupings: A
lexicostatistical study. Anthropological Linguistics 4(9): 21‒33.
Vanoverbergh, Morice. 1937. Some undescribed languages of Luzon. Nijmegen:
Dekker and van de Vegt.
———. 1972. Isneg‒English vocabulary. Honolulu: University of Hawai‘i Press.
Walton, Charles. 1979. A Philippine language tree. Anthropological Linguistics
21(2):70‒98.
Wolff, John U. 1973. Verbal inflection in Proto-Austronesian. In Parangal kay Cecilio
Lopez: Essays in honor of Cecilio Lopez on his seventy-fifth birthday, ed. by
Andrew B. Gonzalez, 71‒91. Quezon City: Linguistic Society of the Philippines.
———. 2003. The sounds of Proto-Austronesian. In Issues in Austronesian historical
linguistics, ed. by John Lynch, 1‒14. Canberra: Pacific Linguistics.
Yap, Fe Aldave. 1977. A comparative study of Philippine lexicons. Manila: Institute of
National Language.
Zorc, R. David. 1977. The Bisayan dialects of the Philippines: Subgrouping and
reconstruction. Canberra: Pacific Linguistics.
———. 1978. Functor analysis: A method of quantifying function words for comparing and classifying languages. The Fifth LACUS Forum, 510‒21. Columbia, SC:
Hornbeam Press.
———. 1979. On the development of contrastive word accent: Pangasinan, a case in
point. In Southeast Asian linguistic studies, vol. 3, ed. by Nguyen Dang Liem,
241‒58. Canberra: Pacific Linguistics.
lrobinson@linguistics.ucsb.edu
jasonlobel@yahoo.com