Quantcast
Channel: Statalist
Viewing all articles
Browse latest Browse all 65064

Combining two datasets

$
0
0
Dear Statlisters,

I am currently working a thesis on strategic alliances. For this thesis, I work with both patent data and alliance data. First, let me show a sample of the two datasets.

Patent data. Structured as follows: gyear = grant year, cat = main category patent, subcat = subcategory, subclass = subclass, gvkey = identifier company, patentsperyearpercompany = cumulative patents for company (based on the gvkey).

gyear, cat, subcat, subclass, gvkey, patentsperyearpercompany
1976 1 19 173 5229 127
1976 1 19 565 4503 344
1976 3 31 427 1359 62
1976 1 19 11 7114 333
1976 6 63 190 3558 8
1976 1 14 289 8530 160
1976 1 19 290.12 7946 698
1980 1 15 751 4087 770
1980 5 53 188.8 5073 565
1981 2 21 309 1786 18
1981 4 46 796 1300 134
1981 1 11 653 7471 39
1981 3 32 6.14 4503 344
1982 6 69 4.5 1349 5
1982 6 61 590 8762 220
1985 6 69 212 2745 29
1988 4 46 70 6008 30
1990 4 42 414 5650 503
1991 5 55 120 4839 220
1991 4 42 481 4961 210
19914 42 413 7946 698
1991 2 22 103 4839 220
1991 6 69 599 4839 220
1991 5 55 110 4672 112
1991 1 19 86 2593 28
1991 1 19 41.5 1010 40
1991 4 46 260 7946 698
1991 1 14 59 10066 82
1991 1 19 4 3532 193
1991 4 44 227.24 5046 58
1992 4 45 106 1344 5
1993 3 31 16 9967 191
1994 4 49 11 2504 331
1994 1 15 555 4060 418
1995 2 22 614 8663 5
1995 1 14 17 5696 130
1996 1 19 476 2910 2
19971 19 345 9653 282
1998 1 19 232 9236 112
end
[/CODE]

Alliance data:
Structured as follows: ID = unique ID of the particular alliance, ParticipantNumber is to show which company is the focal firm (P1 in this case), Name = Name of company, GVKEY is company identfier.

D, ParticipantNumber, Name, GVKEY
51 "1996" "P1" "DSP Group Inc" 29722
51 "1996" "P2" "NEC Corp" 7652
52 "1996" "P1" "Lucent Technologies Inc" 62599
52 "1996" "P2" "GTE Corp" 4961
53 "1996" "P1" "Silicon Graphics Inc" 12679
53 "1996" "P2" "Pixar Inc" 61627
54 "1996" "P1" "Intel Corp" 6008
54 "1996" "P2" "Lockheed Martin Corp" 6774
55 "1996" "P1" "Hewlett-Packard Co" 5606
55 "1996" "P2" "Fluke Corp" 4817
56 "1996" "P1" "Qualcomm Inc" 24800
56 "1996" "P2" "KYOCERA Corp" 6512
57 "1996" "P1" "Microsoft Corp" 12141
57 "1996" "P2" "Logic Works Inc" 61424
58 "1996" "P1" "Atmel Corp" 23767
58 "1996" "P2" "DSP Group Inc" 29722
59 "1996" "P1" "Microsoft Corp" 12141
59 "1996" "P2" "ForeFront Group Inc" 61756
60 "1996" "P1" "Molex Inc" 7506
60 "1996" "P2" "Teradyne Inc" 10453
61 "1996" "P1" "Lucent Technologies Inc" 62599
61 "1996" "P2" "Bay Networks Inc" 24357
62 "1996" "P1" "Intel Corp" 6008
62 "1996" "P2" "Amdahl Corp" 1392
63 "1996" "P1" "Qualcomm Inc" 24800
63 "1996" "P2" "VLSI Technology Inc" 11062
64 "1996" "P1" "DSP Group Inc" 29722
64 "1996" "P2" "Texas Instruments Inc" 10499
65 "1996" "P1" "Progress Software Corp" 24352
65 "1996" "P2" "Intersolv Inc" 12883
66 "1996" "P1" "Citrix Systems Inc" 61676
66 "1996" "P2" "Microsoft Corp" 12141
67 "1996" "P1" "Hewlett-Packard Co" 5606
67 "1996" "P2" "Seer Technologies Inc" 60962
68 "1996" "P1" "Texas Instruments Inc" 10499
68 "1996" "P2" "Sensormatic Electronics Corp" 9602
69 "1996" "P1" "Phoenix Technologies Ltd" 14626
69 "1996" "P2" "Inference Corp" 60955
70 "1996" "P1" "Phoenix Technologies Ltd" 14626
70 "1996" "P2" "AST Research Inc" 1055
71 "1996" "P1" "Microsoft Corp" 12141
71 "1996" "P2" "StarSight Telecast Inc" 28672
72 "1996" "P1" "Cadence Design Systems Inc" 13421
72 "1996" "P2" "Viewlogic Systems Inc" 24793
73 "1996" "P1" "BMC Software Inc" 14650
73 "1996" "P2" "Unison Software Inc" 61052
74 "1996" "P1" "Sun Microsystems Inc" 12136
74 "1996" "P2" "Novell Inc" 11910
75 "1996" "P1" "Energy Conversion Devices Inc" 4352
75 "1996" "P2" "Canon Inc" 2721

I need to combine the two datasets. As you can see, I have GVKEY as a identifier that I can link the two datasets. However, it is in neither dataset unique. For example, Microsoft has both more than 1 patent and 1 alliance.

What I need is a dataset that shows all the patents granted to the firms that engaged in the abovementioned strategic alliances. However, I only need all the patents granted up until 1 year prior the commencement of the strategic alliance. As an illustration, for the first strategic alliance between DSP Group and NEC Corp, I would need all their patents up until 1995.

Another problem lies in the fact that a certain patent is granted to 1 firm (let's say Microsoft), who engaged in several strategic alliances. Linking 1 patent to 1 strategic alliance partner would thus not be sufficient. This patent has to be linked to Microsoft in alliance ID 66 and 71, for example.

I hope I made myself clear, but if you need some further elaboration, I can give it.

Any help is much appreciated!

regards

Guy Swillens


Viewing all articles
Browse latest Browse all 65064

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>