Metin Özetlemesi için Düğüm Merkezliklerine Dayalı Denetimsiz Bir Yaklaşım

Cengiz Hark; Taner Uçkan; Ebubekir Seyyarer; Ali Karcı

doi:10.17798/bitlisfen.568883

Research Article

Metin Özetlemesi için Düğüm Merkezliklerine Dayalı Denetimsiz Bir Yaklaşım

Year 2019, Volume: 8 Issue: 3, 1109 - 1118, 30.09.2019

Cengiz Hark , Taner Uçkan , Ebubekir Seyyarer Ali Karcı

https://doi.org/10.17798/bitlisfen.568883

Cited By: 4

Abstract

Cümle seçerek özetleme çalışmaları kapsamında birçok
farklı yaklaşım mevcuttur. Bu çalışmada tek dokümanlı çıkarıcı metin özetleme
için yeni ve denetimsiz bir süreç önerilmektedir. Çalışma kapsamında metin dokümanları
çizgelerle temsil edilmektedir. Yöntemimiz temel olarak metinleri temsil eden çizgeleri
kullanmakta ve cümlelere yönelik bir ağırlıklandırma önermektedir. Önerilen
sürecin farklı düğüm ağırlıklandırma yöntemlerini kullanarak önemli düğümleri
ve dolayısı ile önemli cümleleri belirlenmesi önerilen özetleme sisteminin
cümle puanlandırma aşamasını oluşturmaktadır. Son olarak bu çalışma kapsamında
metin özetleme amaçlı önerilen yaklaşımın, açık erişimli metinler ve bu
metinlere ait özetleri içeren Document Understanding Conference (DUC-2002) veri
seti üzerindeki performansı ROUGE değerlendirme metrikleri kullanılarak
hesaplanmıştır. Yapılan deneysel çalışmalar sonucunda önerilen özetleme
sisteminin geleneksel çizge tabanlı yaklaşımlar ile rekabet edebilir ölçüde
performans değerleri ortaya koyduğunu göstermektedir. Ayrıca sunulan bu basit
ve etkili yöntemin dilbilimsel bir süreç izlememesi oldukça önemlidir.

Keywords

Çıkarıcı Metin Özetleme, Düğüm Merkezliği, Cümle Sıralama, ROUGE Metrikleri

References

[1] Osman Durmaz, “Metin Sınıflandırmada Boyut Azaltmanın Etkisi ve Özellik Seçimi,” 2011.
[2] C. Hark, A. Seyyarer, T. Uçkan, and A. Karci, “Doǧal dil işleme yaklaşimlari ile yapisal olmayan dökümanlarin benzerliǧi,” in IDAP 2017 - International Artificial Intelligence and Data Processing Symposium, 2017.
[3] Ş. Canberk, G. , Sağıroğlu, Bilgi ve Bilgisayar Güvenliği : Casus Yazılımlar ve Korunma Yöntemleri. Ankara: Grafiker Yayıncılık, 2006.
[4] D. R. Radev, E. Hovy, and K. McKeown, “Introduction to the special issue on summarization,” Comput. Linguist., vol. 28, no. 4, pp. 399–408, 2002.
[5] G. Erkan and D. R. Radev, “Lexrank: Graph-based lexical centrality as salience in text summarization,” J. Artif. Intell. Res., vol. 22, pp. 457–479, 2004.
[6] A. Joshi, E. Fidalgo, E. Alegre, and L. Fernández-Robles, “SummCoder: An Unsupervised Framework for Extractive Text Summarization Based on Deep Auto-encoders,” Expert Syst. Appl., 2019.
[7] O. Kaynar, Y. Görmez, Y. E. Işık, and F. Demirkoparan, “Comparison of graph based document summarization method,” in 2017 International Conference on Computer Science and Engineering (UBMK), 2017, pp. 598–603.
[8] M. Kutlu, C. Cigir, and I. Cicekli, “Generic text summarization for Turkish,” Comput. J., vol. 53, no. 8, pp. 1315–1323, 2010.
[9] C. Hark, T. Uçkan, and A. Seyyarer, Abubekir Karci, “Metin Özetleme İçin Çizge Tabanlı Bir Öneri,” in IDAP 2018 - International Artificial Intelligence and Data Processing Symposium, 2018.
[10] R. Mihalcea, “Language independent extractive summarization,” Proc. ACL 2005 Interact. poster Demonstr. Sess. - ACL ’05, no. June, pp. 49–52, 2005.
[11] K. Sarkar, K. Saraf, and A. Ghosh, “Improving graph based multidocument text summarization using an enhanced sentence similarity measure,” 2015 IEEE 2nd Int. Conf. Recent Trends Inf. Syst. ReTIS 2015 - Proc., pp. 359–365, 2015.
[12] R. Mihalcea and P. Tarau, “TextRank: Bringing Order into Texts,” in Proceedings of the ACL 2004 on Interactive poster and demonstration sessions -, 2004, vol. 85, p. 20–es.
[13] D. Parveen, H.-M. Ramsl, and M. Strube, “Topical coherence for graph-based extractive summarization,” in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015, pp. 1949–1954.
[14] C. Hark, A. Seyyarer, T. Uçkan, and A. Karci, “Doǧal dil işleme yaklaşimlari ile yapisal olmayan dökümanlarin benzerliǧi,” in IDAP 2017 - International Artificial Intelligence and Data Processing Symposium, 2017, pp. 1–6.
[15] R. Mihalcea and P. Tarau, “A Language Independent Algorithm for Single and Multiple Document Summarization,” Proc. IJCNLP 2005, 2nd Int. Join Conf. Nat. Lang. Process., pp. 19–24, 2005.
[16] A. Karci, “Çizge Algoritmaları ve Çizge Bölmeleme.pdf,” 1998.
[17] U. Von Luxburg, “A Tutorial on Spectral Clustering,” 2007.
[18] A. Bavelas, “A Mathematical Model for Group Structures,” Human Organization, vol. 7, no. 3. pp. 16–30, 1948.
[19] D. R. Radev, “LexRank : Graph-based Lexical Centrality as Salience in,” Artif. Intell., vol. 22, pp. 457–479, 2004.
[20] M. Kutlu, C. Ciǧir, and I. Cicekli, “Generic text summarization for Turkish,” in Computer Journal, 2010, vol. 53, no. 8, pp. 1315–1323.
[21] C. Hark, T. Uçkan, E. Seyyarer, and A. Karci, “Graph-Based Suggestion For Text Summarization,” 2018 Int. Conf. Artif. Intell. Data Process. IDAP 2018, 2019.
[22] F. Boudin et al., “A Comparison of Centrality Measures for Graph-Based Keyphrase Extraction To cite this version : HAL Id : hal-00850187 A Comparison of Centrality Measures for Graph-Based Keyphrase Extraction,” 2013.
[23] F. Boudin, “A Comparison of Centrality Measures for Graph-Based Keyphrase Extraction,” in International Joint Conference on Natural Language Processing (IJCNLP), 2013, pp. 834–838.
[24] M. McPherson, L. Smith-Lovin, and J. M. Cook, “Birds of a Feather: Homophily in Social Networks,” Annu. Rev. Sociol., vol. 27, no. 1, pp. 415–444, Aug. 2001.
[25] B. N. Analysis, “Centrality and Hubs,” no. 1979, 2016.
[26] Alex Kosorukoff, Social Network Analysis Theory and Applications. Passmore, D. L, 2011, 2011.
[27] C.-Y. Lin, “Rouge: A package for automatic evaluation of summaries,” Text Summ. Branches Out, 2004.

Year 2019, Volume: 8 Issue: 3, 1109 - 1118, 30.09.2019

Cengiz Hark , Taner Uçkan , Ebubekir Seyyarer Ali Karcı

https://doi.org/10.17798/bitlisfen.568883

Cited By: 4

Abstract

References

[1] Osman Durmaz, “Metin Sınıflandırmada Boyut Azaltmanın Etkisi ve Özellik Seçimi,” 2011.
[2] C. Hark, A. Seyyarer, T. Uçkan, and A. Karci, “Doǧal dil işleme yaklaşimlari ile yapisal olmayan dökümanlarin benzerliǧi,” in IDAP 2017 - International Artificial Intelligence and Data Processing Symposium, 2017.
[3] Ş. Canberk, G. , Sağıroğlu, Bilgi ve Bilgisayar Güvenliği : Casus Yazılımlar ve Korunma Yöntemleri. Ankara: Grafiker Yayıncılık, 2006.
[4] D. R. Radev, E. Hovy, and K. McKeown, “Introduction to the special issue on summarization,” Comput. Linguist., vol. 28, no. 4, pp. 399–408, 2002.
[5] G. Erkan and D. R. Radev, “Lexrank: Graph-based lexical centrality as salience in text summarization,” J. Artif. Intell. Res., vol. 22, pp. 457–479, 2004.
[6] A. Joshi, E. Fidalgo, E. Alegre, and L. Fernández-Robles, “SummCoder: An Unsupervised Framework for Extractive Text Summarization Based on Deep Auto-encoders,” Expert Syst. Appl., 2019.
[7] O. Kaynar, Y. Görmez, Y. E. Işık, and F. Demirkoparan, “Comparison of graph based document summarization method,” in 2017 International Conference on Computer Science and Engineering (UBMK), 2017, pp. 598–603.
[8] M. Kutlu, C. Cigir, and I. Cicekli, “Generic text summarization for Turkish,” Comput. J., vol. 53, no. 8, pp. 1315–1323, 2010.
[9] C. Hark, T. Uçkan, and A. Seyyarer, Abubekir Karci, “Metin Özetleme İçin Çizge Tabanlı Bir Öneri,” in IDAP 2018 - International Artificial Intelligence and Data Processing Symposium, 2018.
[10] R. Mihalcea, “Language independent extractive summarization,” Proc. ACL 2005 Interact. poster Demonstr. Sess. - ACL ’05, no. June, pp. 49–52, 2005.
[11] K. Sarkar, K. Saraf, and A. Ghosh, “Improving graph based multidocument text summarization using an enhanced sentence similarity measure,” 2015 IEEE 2nd Int. Conf. Recent Trends Inf. Syst. ReTIS 2015 - Proc., pp. 359–365, 2015.
[12] R. Mihalcea and P. Tarau, “TextRank: Bringing Order into Texts,” in Proceedings of the ACL 2004 on Interactive poster and demonstration sessions -, 2004, vol. 85, p. 20–es.
[13] D. Parveen, H.-M. Ramsl, and M. Strube, “Topical coherence for graph-based extractive summarization,” in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015, pp. 1949–1954.
[14] C. Hark, A. Seyyarer, T. Uçkan, and A. Karci, “Doǧal dil işleme yaklaşimlari ile yapisal olmayan dökümanlarin benzerliǧi,” in IDAP 2017 - International Artificial Intelligence and Data Processing Symposium, 2017, pp. 1–6.
[15] R. Mihalcea and P. Tarau, “A Language Independent Algorithm for Single and Multiple Document Summarization,” Proc. IJCNLP 2005, 2nd Int. Join Conf. Nat. Lang. Process., pp. 19–24, 2005.
[16] A. Karci, “Çizge Algoritmaları ve Çizge Bölmeleme.pdf,” 1998.
[17] U. Von Luxburg, “A Tutorial on Spectral Clustering,” 2007.
[18] A. Bavelas, “A Mathematical Model for Group Structures,” Human Organization, vol. 7, no. 3. pp. 16–30, 1948.
[19] D. R. Radev, “LexRank : Graph-based Lexical Centrality as Salience in,” Artif. Intell., vol. 22, pp. 457–479, 2004.
[20] M. Kutlu, C. Ciǧir, and I. Cicekli, “Generic text summarization for Turkish,” in Computer Journal, 2010, vol. 53, no. 8, pp. 1315–1323.
[21] C. Hark, T. Uçkan, E. Seyyarer, and A. Karci, “Graph-Based Suggestion For Text Summarization,” 2018 Int. Conf. Artif. Intell. Data Process. IDAP 2018, 2019.
[22] F. Boudin et al., “A Comparison of Centrality Measures for Graph-Based Keyphrase Extraction To cite this version : HAL Id : hal-00850187 A Comparison of Centrality Measures for Graph-Based Keyphrase Extraction,” 2013.
[23] F. Boudin, “A Comparison of Centrality Measures for Graph-Based Keyphrase Extraction,” in International Joint Conference on Natural Language Processing (IJCNLP), 2013, pp. 834–838.
[24] M. McPherson, L. Smith-Lovin, and J. M. Cook, “Birds of a Feather: Homophily in Social Networks,” Annu. Rev. Sociol., vol. 27, no. 1, pp. 415–444, Aug. 2001.
[25] B. N. Analysis, “Centrality and Hubs,” no. 1979, 2016.
[26] Alex Kosorukoff, Social Network Analysis Theory and Applications. Passmore, D. L, 2011, 2011.
[27] C.-Y. Lin, “Rouge: A package for automatic evaluation of summaries,” Text Summ. Branches Out, 2004.

There are 27 citations in total.

Details

Primary Language	Turkish
Journal Section	Araştırma Makalesi
Authors	Cengiz Hark 0000-0002-5190-3504 Taner Uçkan 0000-0001-5385-6775 Ebubekir Seyyarer This is me 0000-0002-8981-0266 Ali Karcı 0000-0002-8489-8617
Publication Date	September 30, 2019
Submission Date	May 22, 2019
Acceptance Date	August 8, 2019
Published in Issue	Year 2019 Volume: 8 Issue: 3

Cite

IEEE	C. Hark, T. Uçkan, E. Seyyarer, and A. Karcı, “Metin Özetlemesi için Düğüm Merkezliklerine Dayalı Denetimsiz Bir Yaklaşım”, Bitlis Eren Üniversitesi Fen Bilimleri Dergisi, vol. 8, no. 3, pp. 1109–1118, 2019, doi: 10.17798/bitlisfen.568883.