An overview of social network analysis
Márcia Oliveira
Faculty of Economics, University of Porto, Porto, Portugal; The Laboratory of Artificial Intelligence and Decision Support, Institute for Systems and Computer Engineering of Porto, University of Porto, Porto, Portugal
Search for more papers by this authorCorresponding Author
João Gama
Faculty of Economics, University of Porto, Porto, Portugal; The Laboratory of Artificial Intelligence and Decision Support, Institute for Systems and Computer Engineering of Porto, University of Porto, Porto, Portugal
Faculty of Economics, University of Porto, Porto, Portugal; The Laboratory of Artificial Intelligence and Decision Support, Institute for Systems and Computer Engineering of Porto, University of Porto, Porto, PortugalSearch for more papers by this authorMárcia Oliveira
Faculty of Economics, University of Porto, Porto, Portugal; The Laboratory of Artificial Intelligence and Decision Support, Institute for Systems and Computer Engineering of Porto, University of Porto, Porto, Portugal
Search for more papers by this authorCorresponding Author
João Gama
Faculty of Economics, University of Porto, Porto, Portugal; The Laboratory of Artificial Intelligence and Decision Support, Institute for Systems and Computer Engineering of Porto, University of Porto, Porto, Portugal
Faculty of Economics, University of Porto, Porto, Portugal; The Laboratory of Artificial Intelligence and Decision Support, Institute for Systems and Computer Engineering of Porto, University of Porto, Porto, PortugalSearch for more papers by this authorAbstract
Data mining is being increasingly applied to social networks. Two relevant reasons are the growing availability of large volumes of relational data, boosted by the proliferation of social media web sites, and the intuition that an individual's connections can yield richer information than his/her isolate attributes. This synergistic combination can show to be germane to a variety of applications such as churn prediction, fraud detection and marketing campaigns. This paper attempts to provide a general and succinct overview of the essentials of social network analysis for those interested in taking a first look at this area and oriented to use data mining in social networks. © 2012 Wiley Periodicals, Inc.
This article is categorized under:
- Application Areas > Science and Technology
- Commercial, Legal, and Ethical Issues > Social Considerations
FURTHER READING
- Doreian P, Stockman FN, eds. Evolution of Social Networks. London: Routledge; 1997.
- Degenne A,
Forsé M.
Introducing Social Networks.
London/Thousand Oaks, CA/New Delhi: Sage Publications;
1999.
10.4135/9781849209373 Google Scholar
- Freeman LC. The Development of Social Network Analysis: A Study in the Sociology of Science. Vancouver, Canada: Empirical Press; 2004.
- Carrington PJ,
Scott J,
Wasserman S, eds. Models and Methods in Social Network Analysis.
New York: Cambridge University Press;
2005.
10.1017/CBO9780511811395 Google Scholar
- Knoke D,
Yang S. Social Network Analysis.
2nd ed.
London/Thousand Oaks, CA/New Delhi: Sage Publications;
2008.
10.4135/9781412985864 Google Scholar
REFERENCES
- 1 Moreno JL. Who Shall Survive? New York: Beacon House; 1953.
- 2
Domingos P,
Richardson M.
Mining the network value of customers. In:
Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
New York, NY: ACM
2001,
57–66.
10.1145/502512.502525 Google Scholar
- 3
Richardson M,
Domingos P.
Mining knowledge-sharing sites for viral marketing. In:
Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
New York, NY: ACM
2002,
61–70.
10.1145/775047.775057 Google Scholar
- 4 Leskovec J, Adamic LA, Huberman BA. The dynamics of viral marketing. ACM Trans Web 2007, 1: 228–237.
- 5
Dasgupta K,
Singh R,
Viswanathan B,
Chakraborty D,
Mukherjea S,
Nanavati AA,
Joshi A.
Social ties and their relevance to churn in mobile telecom networks. In:
Eleventh International Conference on Extending Database Technology: Advances in Database Technology.
New York, NY: ACM
2008,
668–677.
10.1145/1353343.1353424 Google Scholar
- 6 Wei C-P, Chiu I-T. Turning telecommunications call details to churn prediction: a data mining approach. Expert Syst Appl 2002, 23: 103–112.
- 7 Xu J, Che H. Criminal network analysis and visualization. Commun ACM 2005, 48: 101–107.
- 8 Shetty J, Adibi J. The Enron Email Dataset Database Schema and Brief Statistical Report. Technical Report, University of Southern California, 2004.
- 9 Newman MEJ. The structure and function of complex networks. SIAM Rev 2003, 45: 167–228.
- 10
Van De Bunt GG,
Van Duijn MAJ,
Snijders TAB.
Friendship Networks Through Time: An Actor-Oriented Dynamic Statistical Network Model.
Comput Math Org Theory
1999,
5: 167–192.
10.1023/A:1009683123448 Google Scholar
- 11 Ritter T. The networking company: antecedents for coping with relationships and networks effectively. Ind Mark Manage 1999, 28: 467–479.
- 12 Newman MEJ. The structure of scientific collaboration networks. Proc Natl Acad Sci USA 2001, 98: 404–409.
- 13 Truyen TT, Phung DQ, Venkatesh S. Preference networks: probabilistic models for recommendation systems. In: Sixth Australasian Conference on Data Mining and Analytics. Darlinghurst, Australia: Australian Computer Society, Inc. 2007. 70: 195–202.
- 14 Broder A, Kumar R, Maghoul F, Raghavan P, Rajagopalan S, Stata R, Tomkins A, Wiener J. Graph structure in the Web. Comput Netw 2000, 33: 309–320.
- 15 Alon U. Biological networks: the tinkerer as an engineer. Science 2003, 301: 1866–1867
- 16
Wasserman S,
Faust K.
Social Network Analysis: Methods and Applications.
Cambridge, UK: Cambridge University Press;
1994.
10.1017/CBO9780511815478 Google Scholar
- 17 Diestel R. Graph Theory. 3rd ed. Heidelberg: Spring-Verlag; 2005.
- 18 Granovetter M. The strength of weak ties. Am J Sociol 1973, 78: 1360–1380.
- 19 Granovetter M. Getting a Job: A Study of Contacts and Careers. Cambridge, MA: Harvard University Press; 1974.
- 20 Freeman LC. Centrality in social networks: conceptual clarification. Soc Netw 1979, 1: 215–239.
- 21 Brin S, Page L. Node centrality in weighted networks: generalizing degree and shortest paths. Soc Netw 2010, 32: 245–251.
- 22 Bonacich P. Power and centrality: a family of measures. Am J Sociol 1987, 92: 1170–1182.
- 23 Barabási AL, Bonabeau E. Scale-Free Networks. Sci Am 2003, 288: 60–69.
- 24 Barabási A-L, Albert R. Emergence of scaling in random networks. Science 1999, 286: 509–512.
- 25 Kossinets G, Watts DJ. Empirical analysis of an evolving social network. Science 2006, 311: 88–90.
- 26 Watts DJ, Strogatz SH. Collective dynamics of small-world networks. Nature 1998, 393: 440–442.
- 27
Leskovec J,
Kleinberg J,
Faloutsos C.
Graphs over time: densification laws, shrinking diameters and possible explanations. In:
Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining.
New York, NY: ACM
2005,
177–187.
10.1145/1081870.1081893 Google Scholar
- 28 Costa L, Oliveira O, Travieso G, Rodrigues F, Villas Boas P, Antiqueira L, Viana M, Rocha L. Analyzing and modeling real-world phenomena with complex networks: a survey of applications. Adv Phys 2011, 60: 329–412.
- 29 Kleinberg J. Authorative sources in a hyperlinked environment. J ACM 1999, 46: 604–632.
- 30 Brin S, Page L. The anatomy of a large-scale hypertextual Web search engine. Comput Netw ISDN Syst 1998, 30: 107–117.
- 31
Easley D,
Kleinberg J.
Networks, Crowds and Markets: Reasoning about a Highly Connected World.
Cambridge, UK: Cambridge University Press;
2010.
10.1017/CBO9780511761942 Google Scholar
- 32 Thelwall M. Interpreting social science link analysis research: a theoretical framework. J Am Soc Inf Sci Technol 2006, 57: 60–68.
- 33 Rapoport A. Spread of information through a population with socio-structural bias: I. Assumption of transitivity. Bull Math Biophys 1953, 15: 523–533.
- 34 Erdos P, Renyi A. On the evolution of random graphs. Publ Math Inst Hungarian Acad Sci 1960, 5: 17–61.
- 35 Fortunato S. Community detection in graphs. Phys Rep 2010, 486: 75–174.
- 36 Milgram S. The small world problem. Psychol Today 1967, 1: 61–67.
- 37 Price DDS. Networks of scientific papers. Science 1965, 149: 510–515.
- 38 Price DDS. A general theory of bibliometric and other cumulative advantage processes. J Am Soc Inf Sci 1976, 27: 292–306.
- 39 Newman MEJ. Mixing patterns in networks. Phys Rev E 2003, 67: 026126.
- 40 Gupta S, Anderson RM, May RM. Networks of sexual contacts: implications for the pattern of spread of HIV. AIDS 1989, 3: 807–817.
- 41 Newman MEJ, Girvan M. Finding and evaluating community structure in networks. Phys Rev E 2004, 69: 026113.
- 42 Girvan M, Newman MEJ. Community structure in social and biological networks. Proc Natl Acad Sci USA 2002, 99: 7821–7826.
- 43 Oliveira M, Gama J. MEC—monitoring clusters' transitions. In: Proceedings of the 5th Starting AI Researchers' Symposium. Lisbon, Portugal: IOS Press; 2010.
- 44 Palla G, Derényi I, Farkas I, Vicsek T. Uncovering the overlapping community structure of complex networks in nature and society. Nature 2005, 435: 814–818.
- 45
Pons P,
Latapy M.
Computing communities in large networks using random walks.
J Graph Algorithms Appl
2006,
10: 191–218.
10.7155/jgaa.00124 Google Scholar
- 46 R Development Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing; 2011. Available at: http://www.R-project.org. (Accessed January 14, 2012)
- 47 Newman MEJ. Modularity and community structure in networks. Proc Natl Acad Sci USA 2006, 103: 8577–8582.
- 48 Clauset A, Newman MEJ, Moore C. Finding community structure in very large networks. Phys Rev E 2004, 70: 066111.
- 49 Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. J Stat Mech: Theory Exp 2008, 2008: P10008.
- 50 Guimera R, Amaral LAN. Functional cartography of complex metabolic networks. Nature 2005, 433: 895–900.
- 51 Borgatti SP, Everett MG, Freeman LC. Ucinet for Windows: software for social network analysis. Harv Anal Technol 2002, 2006.
- 52 Hagberg AA, Schult DA, Swart PJ. Exploring network structure, dynamics, and function using NetworkX. In: Seventh Python in Science Conference; 2008, 11–15.
- 53
Bastian M,
Heymann S,
Jacomy M.
Gephi: an open source software for exploring and manipulating networks. In: Third International AAAI Conference on Weblogs and Social Media.
Palo Alto, CA: Association for the Advancement of Artiïcial intelligence.
2009,
361–362.
10.1609/icwsm.v3i1.13937 Google Scholar
- 54
Smith M,
Shneiderman B,
Milic-Frayling N,
Rodrigues EM,
Barash V,
Dunne C,
Capone T,
Perer A,
Gleave E.
Analyzing (social media) networks with NodeXL. In: Fourth International Conference on Communities and Technologies. New York, NY: ACM
2009,
255–264.
10.1145/1556460.1556497 Google Scholar
- 55 Combe D, Largeron C, Egyed-Zsigmond E, Géry M. A comparative study of social network analysis tools. Soc Netw 2010, 2: 1–12.
- 56 Batagelj V, Mrvar A. Pajek—program for large network analysis. Connections 1998, 21: 47–57.