Portal de Programas de Pós-Graduação (UFBA)

	Thèses
1	ISRAEL NASCIMENTO MATOS Temporal Data Reconstruction in the Internet of Things using Neural Networks: A Comparison with Classical Interpolation Methods Leader : CASSIO VINICIUS SERAFIM PRAZERES MEMBRES DE LA BANQUE : CASSIO VINICIUS SERAFIM PRAZERES CLEBER JORGE LIRA DE SANTANA RICARDO ARAUJO RIOS Data: 31 janv. 2024 Afficher le Résumé Currently, we live in the era of Big Data, in which cutting-edge scientific instruments, networks, social media, as well as Internet of Things (IoT) devices generate and transmit enormous amounts of data daily through the internet. In the context of IoT, a new paradigm emerges to mitigate the overload in sending such data to the cloud. Thus, in contrast to cloud computing, fog computing is increasingly being adopted, with most of the data processing and storage being performed at the network edge by gateways. In this scenario, in environments where sensors collect information every second, for example, instead of sending each measurement to the cloud, the gateway adopts a data aggregation strategy before transmitting them. However, since various applications require data as originally measured by the devices, and considering that the cloud has greater capacity to serve a larger number of clients compared to gateways, the ideal scenario implies the cloud's ability to reconstruct the data history and provide it to these applications. This work proposes to investigate the capacity of the \textit{perceptron} neural network to reconstruct sensor data history through interpolation from the aggregated values sent to the cloud. Since \ac{IoT} devices are resource-constrained, the study also compares the perceptron neural network developed with classical interpolation algorithms to evaluate the efficiency and effectiveness between the methods. To achieve this goal, two neural networks were developed: the first at the network edge, which learns the behavior of sensor data over time, while the second neural network located in the cloud performs interpolation based on the aggregated data sent and the model generated by the edge neural network. The results obtained indicate that even a relatively simple neural network architecture like the perceptron can perform \ac{IoT} data interpolation with a considerable margin of accuracy. When compared to classical interpolation algorithms, considering criteria such as time and Mean Squared Error (MSE), the perceptron network proves to be statistically as effective as classical methods but less efficient in interpolation time. In summary, this study contributes to understanding the possibilities and limitations of applying neural networks in the temporal reconstruction of \ac{IoT} data and highlights the need to evaluate the same comparison with the use of other neural network architectures, such as Long Short-Term Memory (LSTM) neural networks.
2	EDER PEREIRA DOS SANTOS Technical debt in agile projects: Investigating the point of view of project management professionals on Stack Exchange Leader : MANOEL GOMES DE MENDONCA NETO MEMBRES DE LA BANQUE : EDUARDO MANUEL DE FREITAS JORGE JOSÉ AMÂNCIO MACEDO SANTOS MANOEL GOMES DE MENDONCA NETO Data: 16 févr. 2024 Afficher le Résumé Technical Debt (TD) can provide short-term benefits for software projects, but its presence can lead to decreased product quality. Although recent literature has proposed several approaches to identify and manage Agile Software Development Technical Debt (TD-ASD), most of them look at the point of view of software developers. However, there is a gap in knowledge about how project management professionals perceive and deal with TD-ASD items. The objective of this study is to investigate how project management practitioners discuss and face TD items in their professional activities. To achieve this objective, an analysis of 108 discussions related to TD-ASD was carried out on the Stack Exchange Project Management (SEPM) question and answer website. These discussions totaled 547 posts and 882 text comments, providing a good overview of the topic. The analysis was conducted using quantitative and qualitative methods, employing open coding to identify types of TD and indicators. The study identified 74 indicators used to recognize TD-ASD items, 126 TD management practices and 9 types of TD. The most frequent discussions in SEPM about technical debt focus on two types of process and people, differing from previous studies with developers, which tended to focus on code and design debt items. Furthermore, the Product Owner and Development Team are the roles most involved with TD-ASD. Sprint Backlog and Sprint Planning are the agile elements most affected by TD-ASD. Project management professionals who use SEMP adopt a different perspective than developers when analyzing TD. TD management indicators and practices were structured in a Sankey diagram, providing a valuable tool to guide ID management and guide future research on the subject.
	Thèses
1	ALEX SILVA SANTOS Resource Allocation Policies in Disaster Situations Leader : GUSTAVO BITTENCOURT FIGUEIREDO MEMBRES DE LA BANQUE : BRUNO PEREIRA DOS SANTOS GUSTAVO BITTENCOURT FIGUEIREDO HELDER MAY NUNES DA SILVA OLIVEIRA MAYCON LEONE MACIEL PEIXOTO RODRIGO IZIDORO TININI Data: 29 janv. 2024 Afficher le Résumé Elastic optical networks (EON) allow the transmission of large volumes of data through multiple channels with different spectrum granularities. However, the resource availability of EON networks can be severely impacted by congestion, natural disasters or human- made attacks. To deal with these resource constraints, the network operator needs to make choices about which lightpath will be served by the network. The Decision-making process must take into account requirements of the different Classes of Service (CoS), as well as the possibility of service degradation, providing lower bandwidth than reques- ted or adjusting the time instant of establishing the ligthpath, to adapt the network provisioning when optical resources are insufficient. In this doctoral thesis, problems related to the scarcity of resources and survival in Elastic Optical Networks are addres- sed. Strategies that were at least as efficient as existing strategies in the literature were developed and validated: an algorithm that considers a proportional Quality of Service (QoS) model and information from higher layers to decide which lightpath to be degra- ded in provisioning process, aiming to reduce the impact the unavailability of resources in applications sensitive to delays and bandwidth; an algorithm that uses a hybrid multi- criteria decision-making technique to select requests to be provisioned; an algorithm for selecting lightpaths to be restored after disasters; an algorithm that aims to reduce spec- trum fragmentation in Elastic Optical Networks by allocating new requests on paths that produce less spectrum fragmentation after allocation. Some of the proposed algorithms have a multi-criteria decision approach that considers CoS, bandwidth, number of hops and completion time. Furthermore, service degradation is also considered for lightpaths that cannot be restored to full bandwidth.
2	GLAUCYA CARREIRO BOECHAT An Investigation into Sentiment Analysis and Categorization of Reopened GitHub Issues Leader : MANOEL GOMES DE MENDONCA NETO MEMBRES DE LA BANQUE : EMMANUEL SÁVIO SILVA FREIRE GLAUCO DE FIGUEIREDO CARNEIRO MANOEL GOMES DE MENDONCA NETO MÁRIO ANDRÉ DE FREITAS FARIAS RODRIGO ROCHA GOMES E SOUZA Data: 8 mars 2024 Afficher le Résumé The reopening of issues represents a significant challenge in software development and maintenance, increasing the costs and complexity of the efforts involved. This occurrence often indicates unresolved or misunderstood issues in communication between project collaborators and users on platforms like GitHub. This thesis aims to deepen the understanding of issue reopenings in open-source GitHub software repositories, considering historical data, issue categorization, and sentiment analysis of developers involved in the associated discussions. Our methodology involved using the SentiStrength-SE tool, adapted for lexicons in the field of Software Engineering, to calculate polarity and sentiment in the texts of discussions related to issues. Subsequently, we developed an automated issue categorization model, classifying them into specific categories such as configuration, database-related, program anomaly, performance, functional, GUI-related, info, permission/deprecation, network, security, and testing. This approach enables more effective prioritization in resolving reopened issues, directing resources more accurately. Finally, we characterized issue reopenings based on the sentiments of developers expressed in discussions within each issue category. The results revealed that sentiment analysis, when applied in isolation, did not prove to be an effective metric for identifying issue reopenings. However, we identified that certain types of issue categories are more prone to problems related to reopening. This underscores the importance of combining issue categorization with sentiment analysis for a more efficient approach to preventing and addressing issue reopenings in open-source software repositories.
3	ELIDIANE PEREIRA DO NASCIMENTO FRB-BlinGui: A Fuzzy Rule-Based Model for Predicting Collision Risks with Obstacles in Assistance to Visually Impaired People. Leader : TATIANE NOGUEIRA RIOS MEMBRES DE LA BANQUE : KAMILA RIOS DA HORA RODRIGUES MANOEL CARVALHO MARQUES NETO RICARDO ARAUJO RIOS TATIANE NOGUEIRA RIOS VANINHA VIEIRA DOS SANTOS Data: 2 avr. 2024 Afficher le Résumé It is possible to observe in the literature that several approaches are developed for safe navigation and orientation of people with visual impairments, aiming to avoid collisions with obstacles. These approaches often use ultrasonic or infrared sensors, mobile applications with cameras for computer vision, or wearable devices. However, uncertainties and inaccuracies resulting from the dynamics of the environment are often overlooked, making navigation and orientation for people with visual impairments insecure and subject to various types of collisions with different types of obstacles. Faced with this situation, there is a need to develop approaches that consider the degree of collision risk presented by each obstacle, the dynamics in the displacement between the individual and the obstacle, and its location at high or low points of complex perception. In this context, the fuzzy set theory (FST) presents itself as an essential tool for dealing with the imprecision and uncertainties existing in the environment since the use of systems based on FST has the advantage of intuitively interpreting user behavior. Thus, this thesis presents an innovative obstacle detection and collision risk prediction model called FRB-BlinGui in dynamic scenarios. The proposed model was tested on a wearable device to detect obstacles and prevent collisions in real time, offering gradual alerts about risks. The results show its effectiveness in providing alerts, promising to improve the safety and mobility of people with visual impairments.

Thèses

ISRAEL NASCIMENTO MATOS
Temporal Data Reconstruction in the Internet of Things using Neural Networks: A Comparison with Classical Interpolation Methods
Leader : CASSIO VINICIUS SERAFIM PRAZERES
MEMBRES DE LA BANQUE :
CASSIO VINICIUS SERAFIM PRAZERES
CLEBER JORGE LIRA DE SANTANA
RICARDO ARAUJO RIOS
Data: 31 janv. 2024

Afficher le Résumé
Currently, we live in the era of Big Data, in which cutting-edge scientific instruments, networks, social media, as well as Internet of Things (IoT) devices generate and transmit enormous amounts of data daily through the internet. In the context of IoT, a new paradigm emerges to mitigate the overload in sending such data to the cloud. Thus, in contrast to cloud computing, fog computing is increasingly being adopted, with most of the data processing and storage being performed at the network edge by gateways. In this scenario, in environments where sensors collect information every second, for example, instead of sending each measurement to the cloud, the gateway adopts a data aggregation strategy before transmitting them. However, since various applications require data as originally measured by the devices, and considering that the cloud has greater capacity to serve a larger number of clients compared to gateways, the ideal scenario implies the cloud's ability to reconstruct the data history and provide it to these applications. This work proposes to investigate the capacity of the \textit{perceptron} neural network to reconstruct sensor data history through interpolation from the aggregated values sent to the cloud. Since \ac{IoT} devices are resource-constrained, the study also compares the perceptron neural network developed with classical interpolation algorithms to evaluate the efficiency and effectiveness between the methods. To achieve this goal, two neural networks were developed: the first at the network edge, which learns the behavior of sensor data over time, while the second neural network located in the cloud performs interpolation based on the aggregated data sent and the model generated by the edge neural network. The results obtained indicate that even a relatively simple neural network architecture like the perceptron can perform \ac{IoT} data interpolation with a considerable margin of accuracy. When compared to classical interpolation algorithms, considering criteria such as time and Mean Squared Error (MSE), the perceptron network proves to be statistically as effective as classical methods but less efficient in interpolation time. In summary, this study contributes to understanding the possibilities and limitations of applying neural networks in the temporal reconstruction of \ac{IoT} data and highlights the need to evaluate the same comparison with the use of other neural network architectures, such as Long Short-Term Memory (LSTM) neural networks.

EDER PEREIRA DOS SANTOS
Technical debt in agile projects: Investigating the point of view of project management professionals on Stack Exchange
Leader : MANOEL GOMES DE MENDONCA NETO
MEMBRES DE LA BANQUE :
EDUARDO MANUEL DE FREITAS JORGE
JOSÉ AMÂNCIO MACEDO SANTOS
MANOEL GOMES DE MENDONCA NETO
Data: 16 févr. 2024

Afficher le Résumé
Technical Debt (TD) can provide short-term benefits for software projects, but its presence can lead to decreased product quality. Although recent literature has proposed several approaches to identify and manage Agile Software Development Technical Debt (TD-ASD), most of them look at the point of view of software developers. However, there is a gap in knowledge about how project management professionals perceive and deal with TD-ASD items. The objective of this study is to investigate how project management practitioners discuss and face TD items in their professional activities. To achieve this objective, an analysis of 108 discussions related to TD-ASD was carried out on the Stack Exchange Project Management (SEPM) question and answer website. These discussions totaled 547 posts and 882 text comments, providing a good overview of the topic. The analysis was conducted using quantitative and qualitative methods, employing open coding to identify types of TD and indicators. The study identified 74 indicators used to recognize TD-ASD items, 126 TD management practices and 9 types of TD. The most frequent discussions in SEPM about technical debt focus on two types of process and people, differing from previous studies with developers, which tended to focus on code and design debt items. Furthermore, the Product Owner and Development Team are the roles most involved with TD-ASD. Sprint Backlog and Sprint Planning are the agile elements most affected by TD-ASD. Project management professionals who use SEMP adopt a different perspective than developers when analyzing TD. TD management indicators and practices were structured in a Sankey diagram, providing a valuable tool to guide ID management and guide future research on the subject.

Thèses

ALEX SILVA SANTOS
Resource Allocation Policies in Disaster Situations
Leader : GUSTAVO BITTENCOURT FIGUEIREDO
MEMBRES DE LA BANQUE :
BRUNO PEREIRA DOS SANTOS
GUSTAVO BITTENCOURT FIGUEIREDO
HELDER MAY NUNES DA SILVA OLIVEIRA
MAYCON LEONE MACIEL PEIXOTO
RODRIGO IZIDORO TININI
Data: 29 janv. 2024

Afficher le Résumé
Elastic optical networks (EON) allow the transmission of large volumes of data through multiple channels with different spectrum granularities. However, the resource availability of EON networks can be severely impacted by congestion, natural disasters or human- made attacks. To deal with these resource constraints, the network operator needs to make choices about which lightpath will be served by the network. The Decision-making process must take into account requirements of the different Classes of Service (CoS), as well as the possibility of service degradation, providing lower bandwidth than reques- ted or adjusting the time instant of establishing the ligthpath, to adapt the network provisioning when optical resources are insufficient. In this doctoral thesis, problems related to the scarcity of resources and survival in Elastic Optical Networks are addres- sed. Strategies that were at least as efficient as existing strategies in the literature were developed and validated: an algorithm that considers a proportional Quality of Service (QoS) model and information from higher layers to decide which lightpath to be degra- ded in provisioning process, aiming to reduce the impact the unavailability of resources in applications sensitive to delays and bandwidth; an algorithm that uses a hybrid multi- criteria decision-making technique to select requests to be provisioned; an algorithm for selecting lightpaths to be restored after disasters; an algorithm that aims to reduce spec- trum fragmentation in Elastic Optical Networks by allocating new requests on paths that produce less spectrum fragmentation after allocation. Some of the proposed algorithms have a multi-criteria decision approach that considers CoS, bandwidth, number of hops and completion time. Furthermore, service degradation is also considered for lightpaths that cannot be restored to full bandwidth.

GLAUCYA CARREIRO BOECHAT
An Investigation into Sentiment Analysis and Categorization of Reopened GitHub Issues
Leader : MANOEL GOMES DE MENDONCA NETO
MEMBRES DE LA BANQUE :
EMMANUEL SÁVIO SILVA FREIRE
GLAUCO DE FIGUEIREDO CARNEIRO
MANOEL GOMES DE MENDONCA NETO
MÁRIO ANDRÉ DE FREITAS FARIAS
RODRIGO ROCHA GOMES E SOUZA
Data: 8 mars 2024

Afficher le Résumé
The reopening of issues represents a significant challenge in software development and maintenance, increasing the costs and complexity of the efforts involved. This occurrence often indicates unresolved or misunderstood issues in communication between project collaborators and users on platforms like GitHub.
This thesis aims to deepen the understanding of issue reopenings in open-source GitHub software repositories, considering historical data, issue categorization, and sentiment analysis of developers involved in the associated discussions.
Our methodology involved using the SentiStrength-SE tool, adapted for lexicons in the field of Software Engineering, to calculate polarity and sentiment in the texts of discussions related to issues. Subsequently, we developed an automated issue categorization model, classifying them into specific categories such as configuration, database-related, program anomaly, performance, functional, GUI-related, info, permission/deprecation, network, security, and testing. This approach enables more effective prioritization in resolving reopened issues, directing resources more accurately. Finally, we characterized issue reopenings based on the sentiments of developers expressed in discussions within each issue category.
The results revealed that sentiment analysis, when applied in isolation, did not prove to be an effective metric for identifying issue reopenings. However, we identified that certain types of issue categories are more prone to problems related to reopening. This underscores the importance of combining issue categorization with sentiment analysis for a more efficient approach to preventing and addressing issue reopenings in open-source software repositories.

ELIDIANE PEREIRA DO NASCIMENTO
FRB-BlinGui: A Fuzzy Rule-Based Model for Predicting Collision Risks with Obstacles in Assistance to Visually Impaired People.
Leader : TATIANE NOGUEIRA RIOS
MEMBRES DE LA BANQUE :
KAMILA RIOS DA HORA RODRIGUES
MANOEL CARVALHO MARQUES NETO
RICARDO ARAUJO RIOS
TATIANE NOGUEIRA RIOS
VANINHA VIEIRA DOS SANTOS
Data: 2 avr. 2024

Afficher le Résumé
It is possible to observe in the literature that several approaches are developed for safe navigation and orientation of people with visual impairments, aiming to avoid collisions with obstacles. These approaches often use ultrasonic or infrared sensors, mobile applications with cameras for computer vision, or wearable devices. However, uncertainties and inaccuracies resulting from the dynamics of the environment are often overlooked, making navigation and orientation for people with visual impairments insecure and subject to various types of collisions with different types of obstacles. Faced with this situation, there is a need to develop approaches that consider the degree of collision risk presented by each obstacle, the dynamics in the displacement between the individual and the obstacle, and its location at high or low points of complex perception. In this context, the fuzzy set theory (FST) presents itself as an essential tool for dealing with the imprecision and uncertainties existing in the environment since the use of systems based on FST has the advantage of intuitively interpreting user behavior. Thus, this thesis presents an innovative obstacle detection and collision risk prediction model called FRB-BlinGui in dynamic scenarios. The proposed model was tested on a wearable device to detect obstacles and prevent collisions in real time, offering gradual alerts about risks. The results show its effectiveness in providing alerts, promising to improve the safety and mobility of people with visual impairments.

	Thèses
1	PEDRO HENRIQUE BATISTA DIAMANTINO Visual Analytics to Support the Visual Exploration of Technical Debts in Software Repositories Leader : DANILO BARBOSA COIMBRA MEMBRES DE LA BANQUE : DANILO BARBOSA COIMBRA MANOEL GOMES DE MENDONCA NETO DANIEL GARCIA FEITOSA Data: 26 janv. 2023 Afficher le Résumé The increasing access to digital technologies leads to a large-scale data production and consumption worldwide. Consequently, there is a high demand for computing analysis techniques that help users to obtain a better understanding of data. An inherent challenge in this context is to analyze large volume of complex and heterogeneous datasets, such as conceived by software repositories. Areas such as Software Visualization and Visual Software Analytics have become increasingly used to support developers in software comprehension by providing a visual panorama of the entire software development process. Those graphical representations enable useful information extraction, especially for system maintenance when analyzing technical debts. Although repositories are mostly composed of multidimensional datasets, there is a lack of works that apply multidimensional visualizations to identify and track distinct groups of technical debts. In this sense, the proposal presents the Visual Debts Analytics tool, an approach based on Visual Analytics consisting of multiple coordinated multidimensional visualizations for the analysis of different groups of technical debts in software repositories. In particular, the proposed technique aims to identify and track the correlation, structure, evolution and similarities of technical debts in open-source software repositories. To evaluate our approach we presented case studies in two different repositories that demonstrate the a better comprehension in the project life cycle, enabling insights into the project quality.
2	ERINALDO SANTOS OLIVEIRA An Assessment Model to evaluate the efficacy of using FLOSS projects in Software Engineering Education: A Proposal Based on Content Analysis Leader : CHRISTINA VON FLACH GARCIA CHAVEZ MEMBRES DE LA BANQUE : CHRISTINA VON FLACH GARCIA CHAVEZ LAIS DO NASCIMENTO SALVADOR ROBERTO ALMEIDA BITTENCOURT Data: 31 mars 2023 Afficher le Résumé Context: The software industry's growing demand for qualified professionals highlights the importance and need for valuing Education in Software Engineering in the training of graduates from higher education courses in the area of Computing. Educators are continually challenged to keep up with emerging trends in the area, to identify appropriate techniques and incorporate them into their software engineering disciplines, in order to promote the development of skills and competences that guarantee graduates of courses in the area of Computing an education that reconciles the learning of concepts, processes, techniques and tools provided in the academic environment with the needs and best practices of the software industry. Free, Libre and Open Source software projects provide an opportunity for students and professors of undergraduate courses to work on the content of the software engineering discipline in realistic projects, which bring a real-world experience close to the demands of the world of work. Problem: One of the challenges associated with the use of FLOSS in the EES is how the educator can evaluate the chosen project as a teaching and learning resource, as well as its effectiveness for the pedagogical objective proposed by the activity or discipline. It is relevant to use evaluation methods or models that allow the professor to measure and evaluate the results from his choice, according to the learning requirements necessary after the conclusion of the activity or discipline. Objective: This work proposes the creation of a model to evaluate the effectiveness of using FLOSS projects in the teaching of Software Engineering. The proposed model will be based on the mapping of evaluation criteria that will be structured in order to facilitate the analysis of the scope of the pedagogical objectives planned for the curricular component. Research Methods: Through Content Analysis, the aim is to identify the most relevant technical and sociotechnical skills and abilities for the Software Engineering discipline, according to the main curricular guidelines in Brazil and abroad, and to investigate aspects of interference, both technical and pedagogical, of the use of FLOSS projects in teaching the discipline of software engineering. Based on this information, it is intended to define the evaluation criteria and, through a theoretical dialogue, identify their relationships to propose a Model for evaluating the effectiveness of the use of FLOSS projects in the teaching of Software Engineering. Expected Contributions: The use of methods to evaluate FLOSS projects in the EES can stimulate the insertion of educators in the FLOSS ecosystem, in addition to offering new forms of evaluation to those who recognize and adopt such projects in the classroom, a solution to the real development of skills and abilities associated with theoretical and practical aspects of software engineering, important in the training of professionals from the various areas of the Computing area.
3	Levy Marlon Souza Santiago Gifflar: a framework for generating smart contracts code on the fly Leader : FABIOLA GONCALVES PEREIRA GREVE MEMBRES DE LA BANQUE : FABIOLA GONCALVES PEREIRA GREVE RODRIGO ROCHA GOMES E SOUZA ALEX BORGES VIEIRA Data: 18 avr. 2023 Afficher le Résumé Blockchain is a disruptive technology that offers a secure decentralized network, and allows for direct transactions between distributed entities, without the need for a trusted third party. Smart contracts (SmC) are executable codes hosted on the blockchain and enable the implementation of several decentralized applications, in different domains. However, due to the complexity of SmC construction, it is necessary to seek ways to facilitate and make this innovative development process robust. Many projects in the literature proposed solutions based on automatic code generation from high-level modeling, such as diagrams. However, this approach brings with it a constant need for a person to model the contracts. This paper presents Gifflar, a framework for generating SmC code on the fly, in such a way that it allows a system to write, compile and implement blockchain SmCs with an application still running, which reduces the frequent precision of the SmC developer and allows to further automate the SmC development process. One of the main tools is a component-structured library that implements design patterns to better define the responsibility of each of these parts. The Gifflar library offers an API that allows the developer to model, generate code and manage SmCs through methods that abstract to a certain level the SmC code using the JSON as a model to the code generation. In addition, project evaluations were carried out to validate the project: (i) a usability evaluation of the Gifflar library and (ii) a conceptual evaluation of Gifflar's application in other projects. As far as we know, this framework is one of the first to allow dynamically generating SmCs on the fly, thus contributing to the state of the art by approaching a new paradigm, where systems can act as smart contract developers.
4	Ibirisol Fontes Ferreira FAIRNESS-ORIENTED MULTICAST ROUTING FOR DISTRIBUTED INTERACTIVE APPLICATIONS WITH COMPUTING ON THE NETWORK EDGE Leader : GUSTAVO BITTENCOURT FIGUEIREDO MEMBRES DE LA BANQUE : GUSTAVO BITTENCOURT FIGUEIREDO TIAGO DE OLIVEIRA JANUARIO MAYCON LEONE MACIEL PEIXOTO JOSE AUGUSTO SURUAGY MONTEIRO Data: 27 avr. 2023 Afficher le Résumé Facing the challenges of network services on the edge, such as routing considering quality of service, is a crucial issue for the current research efforts in the area of network. Multicast routing is an essential technique for delivering routing services with a high level of optimization from the perspective of operators and application providers when there are user groups. Furthermore, the routing that considers latency constraints has obstacles to merging different conditions in the solution, mainly when there is a fairness perspective to be accomplished in the users' communication. This work aims to deal with this fairness requirement in multicasting, showing efficient solutions to increase the equilibrium in the routing by choosing better path options for fair group interaction.
5	Caiza Almeida Fortunato ATAM-4SAS: A Method for Assessing Quality Attributes in Self-Adaptive Systems Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : LUCIANA LOURDES SILVA CARLA ILANE MOREIRA BEZERRA IVAN DO CARMO MACHADO Data: 4 mai 2023 Afficher le Résumé Self-adaptive Systems (SAS) can monitor themselves and their context. They can detect changes and react to unexpected conditions with minimal human supervision during their execution. One of the challenges behind developing SAS is dealing with the decision-making process while analyzing the tradeoff points among the multiple quality attributes (QA). In Software Engineering, a widely accepted method of evaluating QA goals in software projects is the Architecture Tradeoff Analysis Method (ATAM). However, despite its importance and wide acceptance, there are few reports of empirical studies on analyzing QA tradeoffs in SAS. In this sense, the present investigation proposes an adapted version of ATAM called ATAM-4SAS to deal with the particularities of SAS. To achieve the research goal, we employed the UPPAAL SMC (statistical verification model) to analyze a set of QA. To evaluate the feasibility of the proposed method, we performed an empirical study on the execution of the ATAM-4SAS in a SAS developed according to the MAPE-K model. This model encompasses the Monitoring, Analysis, Planning, and Execution phases. Such steps share a knowledge base (K), which is fundamental in supporting decision-making. We complemented the empirical evaluation by conducting a focus group, which sought to assess the perceived ease of use and the perceived usefulness of the ATAM-4SAS to support the strategic choice of QA in a SAS. As a result, we observed that most participants agreed that ATAM-4SAS provides adequate support for the strategic choice of QA in SAS.
6	RENATA DE SOUSA SANTOS Process Smell 2.0: A catalogue of Bad Smells for software process using BPMN. Leader : RITA SUZANA PITANGUEIRA MACIEL MEMBRES DE LA BANQUE : ANA PATRICIA FONTES MAGALHÃES MASCARENHAS CLAUDIO NOGUEIRA SANT ANNA RITA SUZANA PITANGUEIRA MACIEL Data: 4 mai 2023 Afficher le Résumé An explicitly specified software process improves the quality of the generated product. The specification directs the path and steps to be followed so that the developed software reaches the quality desired by the project. The software process evolves along with the needs of the organization and the professionals who use it. It needs to be continuously monitored and evaluated to maintain its quality. To evaluate a software process, simula- tions or data obtained after execution are commonly used. However, in both cases, it is difficult to predict problems in the software process in a given development project before one or more executions. Therefore, Process Smells were proposed to enable problems in the design of software processes to be identified before they were executed. The presence of a Process Smell can negatively impact the quality of the Process, thus affecting the quality of the software product. Initially, Process Smells were specified using the Soft- ware Process Specification Metamodel (SPEM) notation. Although SPEM is a language for the domain of process modeling, Business Process Model and Notation (BPMN) has been widely used for processes in general, with high popularity both in the market and academia. In this sense, the specification of a new catalog can allow Process smells to be understood independently of the language domain. Still, in the context of process im- provement, just like SPEM, although BPMN is used to improve the understanding of a process, the specification of a process can be done inappropriately, injuring the necessary factors for the quality of a process of software. In this context, this research aimed to specify a Process Smells catalog to support the identification of anomalies in software processes specified with BPMN based on the (SANTOS; MACIEL; SANT’ANNA, 2018) proposal. Initially, Process Smells were specified in BPMN, thus originating a new ca- talog with 8 Process Smells, Process Smells 2.0. Thirty-two professionals evaluated the specification through an interview study, which indicated that these professionals accep- ted the proposed new Process Smells catalog. The results obtained in this study made it possible to verify that the proposal to identify smells regardless of the language domain makes sense. The second stage of the study proved to be more concurrent with SPEM. The catalog specification is expected to support the identification of Process Smells in software processes modeled using BPMN to indicate the points where the Process can be improved, even before its first execution, avoiding problems that negatively affect process quality attributes. Additionally, the new catalog is expected to expand the possibility of using Process Smells.
7	Jamile de Barros Vasconcelos EXPERIMENTAL EVALUATION OF EXTREME VALUE STATISTICS USAGE ON DETERMINING EXECUTION TIME BOUNDS FOR REAL-TIME SYSTEMS DESIGN Leader : GEORGE MARCONI DE ARAUJO LIMA MEMBRES DE LA BANQUE : GEORGE MARCONI DE ARAUJO LIMA TATIANE NOGUEIRA RIOS VERONICA MARIA CADENA LIMA Data: 30 mai 2023 Afficher le Résumé Real-time systems (RTS) are those whose actions are subject to time constraints, defined in terms of execution time deadlines. A hard RTS is usually designed to never miss a deadline, as in the case of avionics and space systems. To guarantee this it is essential to know the maximum time that each task takes to execute, a parameter known as Worst Case Execution Time (WCET). Estimating the WCET of a task is not simple and, in the case of modern architectures, it is subject to unpredictable influences caused by different hardware and software elements, which prevent the derivation of an absolute WCET value. One of the most used techniques in these cases is the Measurement-Based Probabilistic Timing Analysis (MBPTA) method based on the Extreme Values Theory (EVT) statistical area, for estimating worst-case probability distributions, called Probabilistic WCET (pWCET) . The present work aims to carry out an experimental analysis of the use of EVT via MBPTA for pWCET inference, presenting the process of applying the technique in a real RPi environment and exposing the challenges and flaws encountered during this process; a point rarely demonstrated in the literature. The partial results indicate that EVT is robust, but that has weaknesses because it does not always produce adequate models and coherent pWCET results. This work is part of the Kepler project, a cooperation between UFBA and INRIA-Paris.
8	TIAGO DA CONCEICAO OLIVEIRA A simulation-based iterated local search metaheuristic for pump scheduling in water distribution networks Leader : RAFAEL AUGUSTO DE MELO MEMBRES DE LA BANQUE : CELSO DA CRUZ CARNEIRO RIBEIRO ISLAME FELIPE DA COSTA FERNANDES RAFAEL AUGUSTO DE MELO Data: 17 juil. 2023 Afficher le Résumé In a water distribution network, the electricity amount to operate the pumps can achieve 90% of the total electricity consumed. The amount charged for electricity consumption can differ at each time of the day. Therefore, scheduling the pump’s operation at opportune times can reduce energy costs. Optimal or near-optimal pump scheduling is not trivial given the nonlinear constraints of the WDN, which include the pump scheduling problem in the NP-hard class. The pump scheduling problem consists of obtaining the lowest operating monetary cost, guaranteeing that water is delivered to all demand points, and without violating the physical constraints of the water distribution network. This work proposes heuristic methods based on simulations, combining them in a metaheuristic Iterated Local Search (ILS). Computational experiments show that the proposed approach is promissory, obtaining the best results using the binary representation with the restrictions presented in this work, and when compared to other pump scheduling representations, the values obtained for the Vanzyl instance reached solutions that deviate by only 0.73% of the best-known value and at only 2.02% value for Richmond.
9	Edmilson dos Santos de Jesus MACHINE LEARNING MODELS FOR WATER DEMAND FORECAST IN THE METROPOLITAN REGION OF SALVADOR, BAHIA Leader : GECYNALDA SOARES DA SILVA GOMES MEMBRES DE LA BANQUE : ANDERSON LUIZ ARA SOUZA GECYNALDA SOARES DA SILVA GOMES PAULO HENRIQUE FERREIRA DA SILVA Data: 31 juil. 2023 Afficher le Résumé The objective of this work is to proposes a new hybrid SVR-ANN model for water demand forecasting. Where an adaptation of the methodology proposed by Zhang (2003) is used to decompose the time series of 10 reservoirs that supply the Metropolitan Region of Salvador (RMS). The data used are from the historical consumption from January/2017 to February/2022, obtained from the local supply company, Empresa Baiana de Águas e Saneamento, and meteorological data obtained from the National Institute of Meteorology of Brazil. The results demonstrated the feasibility of using the proposed model, compared to other traditional models such as the Multilayer Perceptron (MLP), Support Vector Regression (SVR), Short Long Term Memory (LSTM) and Autoregressive and Integrated Moving Average (ARIMA).
10	Dárcio Santos Rocha IDENTIFICATION OF EVENT-TIME TEMPORAL RELATIONS IN PORTUGUESE: A RULES-BASED APPROACH WITH ASSOCIATIVE CLASSIFICATION Leader : MARLO VIEIRA DOS SANTOS E SOUZA MEMBRES DE LA BANQUE : MARLO VIEIRA DOS SANTOS E SOUZA RERISSON CAVALCANTE DE ARAUJO ROBESPIERRE DANTAS DA ROCHA PITA Data: 14 août 2023 Afficher le Résumé This work aims to develop a computational method to identify types of temporal relations between events and temporal expressions in texts written in Portuguese. In order to achieve this goal, rule learning techniques will be employed to discover the best combinations of available linguistic information, formulating decision rules that can efficiently identify the types of temporal relations between events and temporal expressions. Most related works adopted a machine learning-based approach, while only one used a hybrid approach, combining manual rules. The methodology proposed in this work consists of a rule-based approach, which incorporates lexical, morphosyntactic and contextual information, Reichenbach tenses, temporal signals and knowledge about the world, in addition to TimeML annotations in the corpus. Unlike a purely machine learning approach, the rule sets generated by our method allow the combination of rules generated by different algorithms, or the combination of complete sets, which can result in better performance. In short, the method takes event/time expression pairs as input and uses a filtering strategy to select the pairs most likely to have been annotated in the corpus. It then applies sets of rules to each pair to identify the type of existing temporal relationship and a data augmentation strategy to calculate the temporal closure of all pairs and their respective identified relationships. In preliminary experiments, we proposed an initial set of manual rules for the Portuguese language. However, the results obtained showed that this set was limited, resulting in low coverage and consequently low accuracy. The maximum value achieved was 45.1% accuracy and 34.1% coverage in the test data. To improve these results, we propose to incorporate rule learning techniques to the method, aiming to increase the set of rules. These techniques are able to handle noisy data well, work well on unseen data and generate more efficient rules, in addition to offering competitive performance and working efficiently. With this incorporation, we hope that the proposed experiments will produce a set of rules capable of identifying types of event-time temporal relations efficiently and achieving superior results. This will contribute to the advancement of the state of the art in the area, in addition to disseminating the research carried out and contributing to the scientific community.
11	Davi Lima Alves A SIDECHAIN PLATFORM FOR RESTAURANT E-COMMERCE TRANSACTIONS Leader : MANOEL GOMES DE MENDONCA NETO MEMBRES DE LA BANQUE : ALLAN EDGARD SILVA FREITAS CASSIO VINICIUS SERAFIM PRAZERES LEOBINO NASCIMENTO SAMPAIO MANOEL GOMES DE MENDONCA NETO ROBESPIERRE DANTAS DA ROCHA PITA Data: 22 août 2023 Afficher le Résumé Ordering food online is present in people's lives every day, however, this procedure requires sending customer data to different restaurant systems, and often these restaurants share sensitive information with their partners without authorization, creating a security risk when sharing such information. data. Also, in many cases, when a person travels to a country other than their origin and wants to order food, an exchange transaction for local currency is required, which is expensive. Blockchains have revolutionized the way of making payments online and intrinsically secure, eliminating the centralizing and authorizing entity, such as Bitcoin, the first blockchain used on a large scale. However, the scientific literature records the difficulty in view of the high frequency of transactions in Bitcoin, the high transaction time, and also its high cost per transaction, in addition to the limitation of the Script language for application development. In this step, the Ethereum blockchain emerges as an alternative to Bitcoin, in an environment of more complex applications with a higher frequency of applications and number of transactions per second, but still far from the performance of solutions such as credit cards, which disadvantages this “chain of blocks”. ” compared to traditional payment solutions such as credit cards, which also had high transaction cost rates for a long time. From the study of this context, we propose a payment model for retail transactions through the blockchain based on a Proof of Concept (POC) aimed at restaurants using the Software Development Kit (SDK) of the Lisk blockchain. POC Lisk Restaurante represents an integration of Lisk blockchain in the food industry allowing customers to buy food quickly and securely through sidechains - exclusive blockchains for special restaurant-type transactions, without intermediation and with lower fees when using a single crypto asset in the world : the LSK. We performed an empirical study of the proposed solution in which the performance of block transactions, transaction fees, auditability, solution scale, privacy and security in sending messages in transactions is evaluated. The results were used in comparison with data from the literature of Bitcoin, Ethereum, and Multichain blockchains. The results indicate that the special transactions used at Lisk Restaurante allow guaranteeing data privacy for its customers, since only the recipient of a transaction can access sensitive sender data. Thus, through research, literature review and performance evaluation experiments, it was observed that the sidechain solution for restaurants demonstrated the efficiency of blockchain technology in providing global services at a lower cost than common means of payment; also, when compared to their transaction fees with other blockchains like Bitcoin or Ethereum, their values are lower and the privacy of their customers is maintained. Also, our solution has shown an increase in the capacity of Food transactions in a single block and better scalability through sidechains, which accept transactions specific to the business involved, which differs from single blockchains such as Bitcoin or Ethereum that execute transactions for different purposes, also surpassing the first version of the Proof of Concept (PoC) Lisk Restaurante, exceeding twice the capacity of number of transactions or frequency, in a block. Finally, experiments were carried out with the Lightning network, an off-chain technology that works on top of Bitcoin, being a scalability solution for Bitcoin. The evaluation of experimental results of the Lisk sidechain solution and the Lightning network demonstrated the differences of each technology and the advantages and disadvantages of each one were praised. Thus, it was observed that the Lightning network technology can also be used in the retail industry, but without the possibility of customizing transaction costs.
12	João Medrado Gondim INCREASING THE IMAGE CAPTIONING MODELS IN PORTUGUESE THROUGH LINGUISTIC INFORMATION Leader : DANIELA BARREIRO CLARO MEMBRES DE LA BANQUE : SANDRA ELIZA FONTES DE AVILA DANIELA BARREIRO CLARO TATIANE NOGUEIRA RIOS Data: 23 août 2023 Afficher le Résumé The increase in the number of applications that require accessibility, information retrieval and human-computer interaction has culminated in a growing need for automated generation of the description of an image. This automated description requires an identification of the scenario, characters and objects present and how these elements relate to each other. From these elements it becomes possible to generate a sentence in natural language describing the content of the image. The development of methods capable of automatically generating the sentences describing the image permeates a research area called textit{Image Captioning}. Most researches and datasets in the area of Image Captioning focus on the English language, developing models and building efficient state-of-the-art resources. Languages with few resources for development, such as Portuguese, require more research to achieve a descriptive and understandable sentence. However, only the agglomeration of several objects in the image does not generate a sentence in the Portuguese language. In this context, this work proposes the analysis and incorporation of linguistic resources that can guide the language model in generating a description that is more representative of the image and the sentence in Portuguese. Experiments were performed with the translation of datasets to generate the description in Portuguese. The results indicate that the morphological analysis of the outputs of an Image Captioning model, as well as the incorporation of grammatical classes during the training, will contribute to a better description of the image in Portuguese.
13	CLEBER BRITO SANTOS Determinação de vizinhança em rede veiculares ad hoc: uma análise dos tempos limite para entrega de mensagens Leader : ALINE MARIA SANTOS ANDRADE MEMBRES DE LA BANQUE : ALINE MARIA SANTOS ANDRADE ALIRIO SANTOS DE SA ALLAN EDGARD SILVA FREITAS LEOBINO NASCIMENTO SAMPAIO Data: 5 sept. 2023 Afficher le Résumé A detecção de vizinhança em redes veiculares ad hoc envolve a determinação de veículos vizinhos para uma organização colaborativa. Esse problema apresenta desafios em relação à comunicação entre veículos em redes veiculares devido à natureza dinâmica da rede, que provoca variações imprevisíveis na qualidade do canal de comunicação devido a fatores como condições ambientais, interferências e congestionamentos. Nesta dissertação, abordamos a detecção de vizinhança entre pelotões em redes dinâmicas, enfrentando desafios como a movimentação variada dos veículos e as interferências de edificações. A organização em pelotões representa um cenário relevante para o estudo em redes veiculares, uma vez que traz benefícios na qualidade do trânsito, como a redução do consumo de combustível, a melhoria do fluxo de tráfego e o aumento da segurança. Desenvolvemos um protocolo básico de comunicação, projetado para a comunicação entre líderes de pelotões, que garante o conhecimento mútuo entre eles, possibilitando a troca de informações essenciais para a detecção de vizinhança e o funcionamento colaborativo dos pelotões. Verificamos o protocolo utilizando o verificador de modelos PRISM. Utilizando este protocolo, realizamos experimentos para determinar os tempos necessários que ocorrem entre o envio e o recebimento de mensagens trocadas entre líderes de pelotões para a determinação de vizinhança. Realizamos experimentos em canais confiáveis e não confiáveis e determinamos tempos limite de progresso por meio de simulações utilizando o framework VEINS, que integra SUMO e OMNET++. Os resultados dos experimentos forneceram informações sobre os tempos limite de progresso em vários cenários que podem servir de referência para o desenvolvimento de aplicações relacionadas à comunicação em VANETs.
14	Iury Gomes de Oliveira CycleVis - A sport visualization approach for cyclist performance analysis Leader : DANILO BARBOSA COIMBRA MEMBRES DE LA BANQUE : DANILO BARBOSA COIMBRA FREDERICO ARAUJO DURAO RENATO LIMA NOVAIS Data: 14 sept. 2023 Afficher le Résumé The technological evolution together with the growing mass production of data demand the creation of new mechanisms for processing and analysis in the most different areas of information. The same goes for the field of sports, which has a great popular appeal around the world and has been gaining more and more interest from the scientific community. Among the different sports genres, cycling has been gaining special attention in the community of information visualization and Visual Analytics over the years. In general, the focus of these works is on finding patterns to draw strategies, display statistical information, or analyze performance of groups of cyclists. However, there is a gap of studies that focus on paths not related to professional competitions or that consider attributes of climate and intensity in their analysis. In this context, this work presents CycleVis, a data visualization approach for cycling that allows a better understanding of the sports performance of cyclists. For this, intensity characteristics such as speed, heart rate and elevation were used, in addition to the geolocation of the runs/pedals. To validate the approach, a use case was developed and it was evaluated through a study with users, which resulted in a very positive evaluation for effectiveness and usability.
15	JOSÉ AUGUSTO DUARTE GOMES MultiVisD3: A Visual Analytics Approach for Quality Analysis of Multidimensional Projections Using Multiple Coordinated Views Leader : DANILO BARBOSA COIMBRA MEMBRES DE LA BANQUE : RAFAEL MESSIAS MARTINS BRUNO PEREIRA DOS SANTOS DANILO BARBOSA COIMBRA Data: 29 sept. 2023 Afficher le Résumé Multidimensional visualizations are graphical representations that assist in presenting multidimensional datasets. However, many of these techniques exhibit limited visual scalability as the number of dimensions increases, requiring ever-expanding visual space to accommodate the entire dataset. One way to address this problem is by utilizing Multidimensional Projection techniques, which perform dimensionality reduction while aiming to preserve the patterns of the original data in the projected space. However, it is highly likely that errors and distortions may be present in the projection layouts. A challenge in researching this topic is to measure and visualize the errors and distortions embedded in the projection mappings. In this regard, this article introduces MultiVisD3, a Visual Analytics approach to analyzing the quality of Multidimensional Projections. This approach comprises an interactive visual dashboard containing multiple coordinated views to facilitate understanding and comparison of error metrics between two projections. Other visualization techniques such as Treemap and tabular views support the extraction of information regarding projection quality. Finally, the proposed approach underwent user evaluation using well-known datasets to analyze functionality and usability aspects. The results demonstrated that users positively evaluated the proposed approach. More than 90% of the participants correctly completed eight out of nine tasks assigned to them, and over 90% of them approved of usability aspects related to interactions and the arrangement of visualizations.
16	LEONARDO RODRIGUES RIBEIRO MetaLProjection: An Approach for Recommending Dimensionality Reduction Algorithms Using Meta-Learning Leader : DANILO BARBOSA COIMBRA MEMBRES DE LA BANQUE : THIAGO FERREIRA COVÕES DANILO BARBOSA COIMBRA RICARDO ARAUJO RIOS Data: 6 oct. 2023 Afficher le Résumé Data visualization techniques have significant potential for analysis, summarization, and comprehension to facilitate information extraction. Recent areas such as Visual Analytics and Data Science underscore their importance, especially in analyzing complex datasets. In this sense, Multidimensional Projection techniques are particularly used for visual analysis of high-dimensional datasets because they perform dimensionality reduction and, consequently, exhibit better scalability in terms of the number of attributes/dimensions. However, there is a wide variety of these projection techniques, and determining the most suitable one for discovering visual patterns of information in one or multiple datasets is not a trivial task. While works in the literature test and compare different techniques on datasets with distinct characteristics, they do not do so systematically to assist user decision-making. In this context, this research leverages meta-learning to classify and recommend multidimensional projections, considering specific evaluation metrics from a knowledge base comprising over 500 distinct datasets. To evaluate the approach, we observed i) the relationship between the meta-attributes of all datasets, ii) the generation of a ranking containing the performance of the selected projection techniques, and iii) the accuracy of the recommendation of these techniques. Finally, the results obtained demonstrate that the developed approach effectively contributes to the selection and recommendation of multidimensional projection techniques.
17	FELIPE REBOUCAS FERREIRA ABREU An Intelligent Self-Configuring Recommender System as a Service Leader : FREDERICO ARAUJO DURAO MEMBRES DE LA BANQUE : FREDERICO ARAUJO DURAO CLAUDIO NOGUEIRA SANT ANNA ROSALVO FERREIRA DE OLIVEIRA NETO Data: 28 nov. 2023 Afficher le Résumé In today's dynamic digital realm, the plethora of listing services, spanning from music platforms to product recommenders and social media content suggestions, often leaves users searching for items that truly align with their tastes. Addressing this intricacy, the rise of Recommender Systems has proven invaluable. These systems efficiently filter vast data to align items with individual preferences, enhancing user choices. This work centers on the creation of an advanced Recommender Systems API. Distinctively crafted, this API boasts universal accessibility and an uncomplicated deployment procedure. As the foundation for various Web Services, the API draws strength from the stalwart REST architecture. It is designed with a commitment to modularity, championing adaptability and flexibility. The API processes user data and queries to provide tailor-made recommendations quickly. Performance evaluations illuminated the API's commendable accuracy. It particularly shone with smaller datasets, displaying impressive data processing and algorithm execution times. The API manifested exceptional efficiency and resilience under specific test conditions, including cloud environments, especially notable in extensive 16,000-item dataset scenarios. The API is more than a tool; it paves the way for personalized digital experiences, showcasing its prowess in CRUD operations and tailored recommendations. The user evaluation phase encompassed a varied demographic: novice to experienced developers. Over half had considerable software development experience, and a significant percentage had prior engagements with coding recommender systems. With diverse knowledge of recommender libraries, most feedback praised the API's effectiveness. 81% valued the recommendations, and many felt confident in its filtering techniques. The highlight of this work is the Recommender System API's versatility. Despite positive feedback, users suggested improvements in documentation, data security, and features. These insights will shape future API refinements and user experience. Participants' enthusiastic engagement and feedback underscore the API's potential to enhance applications requiring a recommendation system, especially for developers who are perhaps less versed in the theoretical nuances. The solid research foundation and participant dedication highlight the API's potential for broader adoption by developers.
18	GENICLEITO CARVALHO BELTRAO GONCALVES Granularity to Ensure Interpretability of the Fuzzy Rules Leader : TATIANE NOGUEIRA RIOS MEMBRES DE LA BANQUE : MARCOS EVANDRO CINTRA HELOISA DE ARRUDA CAMARGO TATIANE NOGUEIRA RIOS Data: 30 nov. 2023 Afficher le Résumé The computational representation of human knowledge, when composed of imprecise data, is a task facilitated by the use of systems that use resources from set theory and fuzzy logic. Such systems use rules that allow mapping the knowledge obtained from data into an easy-to-understand linguistic representation. The fuzzy rule base that constitutes this type of system can be generated by a specialist or through methods that consider the characteristics of the data itself, reducing the need for a specialist in this creation process. With the appropriate adjustments and use of automated methods, it is possible to increase the interpretability of these rules without reducing the system's accuracy. In this sense, the rule base of a fuzzy system can be improved through the principle of justifiable granularity to adjust the fuzzy sets representative of the data. Therefore, in this master's thesis, the method called GEnI-FR (Granularity to Ensure Interpretability of the Fuzzy Rules) is presented, which makes adjustments and refinements in the process of generating fuzzy rules, achieving a balance between interpretability and precision, adjusting sets fuzzy based on the characteristics of the data itself. GEnI-FR presents itself as a promising method as it provides a reduction in the number of fuzzy rules while maintaining the same levels of accuracy when compared to other state-of-the-art methods.
19	JALISSON DOS SANTOS HENRIQUE MOTIVATIONS FOR APPLYING THE EXTRACT METHOD REFACTORING: A STUDY BASED ON COMMIT MESSAGES Leader : CLAUDIO NOGUEIRA SANT ANNA MEMBRES DE LA BANQUE : CHRISTINA VON FLACH GARCIA CHAVEZ CLAUDIO NOGUEIRA SANT ANNA RAPHAEL PEREIRA DE OLIVEIRA Data: 11 déc. 2023 Afficher le Résumé Refactorings are operations performed on source code aimed at improving the maintainability of a software system. Although literature contains a high number of studies on refactorings, there are only few works that investigate the motivations reported by developers to perform refactorings. Therefore, this research aims to investigate the motivations behind extract method refactorings in real systems. To achieve this, we conducted two empirical studies: a preliminary study involving a single software system, and another study considering multiple real systems. The studies were based on mining software repository commits in which extract method refactorings occurred. Essentially, the goal was to analyze commit messages in order to capture developers' motivations for performing such refactorings. The results identified 16 different motivations for applying extract method. Additionally, it was observed that only 16% of the refactorings had an explicit intention of improving code quality. It was also identified that refactorings with the intention of improving code quality occurred more frequently in methods with size higher than 61 lines of code and less frequently in methods smaller than 20 lines of code.
20	MATEUS CARVALHO DA SILVA Application of biased random-key genetic algorithm and formulations for the Grundy coloring problem and the connected Grundy coloring problem Leader : RAFAEL AUGUSTO DE MELO MEMBRES DE LA BANQUE : PEDRO HENRIQUE GONZÁLEZ SILVA MAURICIO GUILHERME DE CARVALHO RESENDE FREDERICO ARAUJO DURAO RAFAEL AUGUSTO DE MELO Data: 13 déc. 2023 Afficher le Résumé Given a graph G, its Grundy number $\Gamma(G)$ defines the worst-case behavior for the well-known and widely used first-fit greedy coloring heuristic. Specifically, $\Gamma(G)$ is the largest $k$ for which a $k$-coloring can be obtained with the first-fit heuristic. The connected Grundy number $\Gamma_c(G)$ gives the worst-case behavior for the connected first-fit coloring heuristic, that is, one in which each vertex to be colored, except the first, is added adjacent to an already colored vertex. Both problems are NP-hard. In this master's thesis, we present heuristic and exact approaches to the Grundy coloring problem and the connected Grundy coloring problem, which are optimization problems consisting of obtaining the Grundy number and the connected Grundy number, respectively. This study proposes the use of a algorithm Biased Random-Key Genetic Algorithm (BRKGA) and the use of integer programming formulations using a more traditional (standard) approach and a representative one. A new combinatorial upper bound is also proposed that is valid for both problems and an algorithm using dynamic programming for its calculation. The computational experiments show that the new upper bound can improve over a well-established combinatorial bound available in the literature for several instances. The results also evidence that the formulation by representatives has an overall superior performance than the standard formulation, achieving better results for the denser instances, while the latter performs better for the sparser ones to the Grundy coloring problem. However, we show that these types of integer programming formulations are computationally impractical for the connected version. Furthermore, the BRKGA can find high-quality solutions for both problems and can be used with confidence in large instances where the formulations fail for the Grundy coloring problem.
21	JUVENAL CONSTANTINO DE MACÊDO JÚNIOR Investigating the association between change bursts and build status Leader : RODRIGO ROCHA GOMES E SOUZA MEMBRES DE LA BANQUE : RODRIGO ROCHA GOMES E SOUZA IVAN DO CARMO MACHADO JOSÉ AMÂNCIO MACEDO SANTOS TIAGO OLIVEIRA MOTTA Data: 18 déc. 2023 Afficher le Résumé Commit bursts are sequences of changes made by developers that occur in code within a short period of time. In projects that adopt the practice of Continuous Integration (CI), every time a modification is completed, a new version of the code is created, generating a new build. In this new version, the changes made are verified automatically, running unit tests and reporting the result of failure or success of build to the developers. In this sense, the objective of this research is to carry out an empirical study to verify the association between bursts of commits and failures in build. The first step was to carry out an empirical study based on the mining of deposits, through which the relationship between bursts of commits and construction failure was identified. After this study, a survey was followed to evaluate the results obtained, considering the opinions of developers who work with CI. Among the discoveries made, the results of the first study show that in some projects the success rate of builds after bursts of changes tends to decrease. However, we cannot generalize the results to all projects since, in most of the studied projects, the difference was not statistically significant. In the second study, most survey participants agree that the proximity of a project's delivery data is a factor responsible for bursts of commits and build failure. However, we cannot generalize the results, since some developers had small disagreements about who was responsible for the construction failure. Thus, the results of this work intend to contribute to the community of developers who use CI, helping them to reduce failures in build, facilitating future work on good development practices.
22	ANGELA PEIXOTO SANTANA A Characterization Study of Technical Debt in Software in Industry Leader : CHRISTINA VON FLACH GARCIA CHAVEZ MEMBRES DE LA BANQUE : CHRISTINA VON FLACH GARCIA CHAVEZ IVAN DO CARMO MACHADO CRESCENCIO RODRIGUES LIMA NETO TIAGO OLIVEIRA MOTTA Data: 20 déc. 2023 Afficher le Résumé Context. The term Technical Debt (TD) denotes the results and consequences of the decision to prioritize the rapid delivery of software, focusing on short-term effectiveness at the expense of its quality. Technical debts incurred during the development and maintenance of software must be identified, characterized, and repaid in the future to avoid compromising its quality and the ease of its evolution. In software projects developed in the industry, there is a recognized need to identify and characterize TD throughout the software lifecycle, aiming to manage it and mitigate threats to the long-term success of software. Industry studies allow exploring different aspects of TD in practice and from various perspectives, such as those of developers and managers, enriching the characterization of TD. Objectives. The main objective of this work is to investigate, in the context of an IT company, the perceptions of development team professionals regarding the concept of Technical Debt (TD) and how TD manifests in one of the developed software systems. Methods. We conducted a study in the context of a software development company branch, comprising: (i) a survey with employees engaged in software development at the branch; (ii) a preliminary characterization of the Technical Debt (TD) in a software system based on the identification of TD using the static analysis tool adopted by the company, SonarQube; (iii) semi-structured interviews with the development team of the analyzed software regarding the TD identified by SonarQube. Results. The survey study received 32 valid responses. It was observed that the concept of Technical Debt (TD) was known to the majority of professionals who participated in the study; however, they exhibited little familiarity with the long-term consequences of TD. The characterization of the software system based on the automatic identification of TD was conducted and used as support for the interviews. The deadlines set for the implementation of requirements are notably short. The documentation, when not updated, becomes insufficient in the long term, while testing is relegated to the background in favor of faster deliveries. Conclusions. A characterization of Technical Debt in the industry revealed that established deadlines are the predominant factor for development teams to decide to incur technical debt. The team often chooses this approach to meet delivery deadlines. Despite being a known topic, there is still a need to disseminate the long-term consequences of Technical Debt. However, despite the increasing research on technical debt, the field lacks consensus on its definitions and long-term impacts on the industry.
23	ADRIANO HUMBERTO DE OLIVEIRA MAIA A Fog Computing-based Framework for Data Reduction in a Traffic Detection System for VANETs Leader : MAYCON LEONE MACIEL PEIXOTO MEMBRES DE LA BANQUE : RODRIGO AUGUSTO CARDOSO DA SILVA ROBERTO RODRIGUES FILHO MAYCON LEONE MACIEL PEIXOTO Data: 20 déc. 2023 Afficher le Résumé With the growth in the number of vehicles in the world in recent years, it has become necessary to adopt technologies to deal with the consequences that this vehicle volume can generate for large cities, such as increased congestion on highways. Vehicular Ad-Hoc Networks (VANETs) present themselves as a promising technology in this scenario, helping to form vehicular networks capable of interconnecting vehicles and infrastructure to understand and deal with vehicle congestion. Considering this, the amount of data generated by this environment increases as the number of vehicles on the roads increases. Consequently, sending data from the vehicular environment to the structure that identifies congestion can be increasingly costly from the point of view of network use, potentially generating overloads and increased latency, making quick decision-making difficult. Therefore, in this work, we propose the construction of a Framework that aims to identify vehicle congestion, with an approach to reduce the data generated by a VANET in the fog layer and then send only the most relevant data to the cloud. for decision making. In addition to congestion detection, with historical data in time series format we perform congestion prediction using ARIMA. To work with data reduction, \textit{Framework} uses simple sampling algorithms and clustering techniques (DBSCAN and XMEANS). The results showed that the use of clustering algorithms in this Framework can achieve a significant level of accuracy in detecting traffic congestion together with a marked reduction in network usage.
24	IURY MAIA DE ALMEIDA Predicting Player Churn in a Free-To-Play Game Using Game Analytics Leader : RODRIGO ROCHA GOMES E SOUZA MEMBRES DE LA BANQUE : LYNN ROSALINA GAMA ALVES RODRIGO ROCHA GOMES E SOUZA TIAGO OLIVEIRA MOTTA Data: 21 déc. 2023 Afficher le Résumé The issue of player churn in free-to-play games poses a significant challenge in the electronic gaming industry. The growing popularity of these business models, where players can access the game for free, places a crucial emphasis on retaining these users to ensure the financial success and sustainability of the game. In this scenario, predictive analysis emerges as an essential tool to anticipate and understand the patterns of player churn. This study began with a systematic literature review in the field of predictive models in game analytics, aiming to answer the main research question: How are predictive models applied in game analytics? The research was conducted based on a protocol that defined the objectives, research questions, and inclusion and exclusion criteria. The main findings indicate that research on predictive models in game analytics has grown significantly since 2010, with a variety of machine learning techniques being applied. Furthermore, the most investigated prediction targets include the probability of winning, churn prediction, and player expertise. Regarding preprocessing techniques, several approaches were identified, such as Principal Component Analysis (PCA) and web scraping techniques. We focused our research on churn prediction, initially by defining churn and establishing cutoff dates, considering multiple time windows for classifying players as churned or recurrent. The analysis addressed the validity threats of the work, including churn definition issues, class imbalance, and the use of techniques like SMOTE to balance the data. Six machine learning models were evaluated, with an emphasis on metrics like accuracy, precision, recall, and AUC (Area Under the Curve). The 10-fold cross-validation technique was applied to validate the models, providing a more comprehensive view of their performance. The analysis of feature importance revealed which player characteristics were most relevant for churn prediction, although the interpretation of these features was highlighted as context-dependent. Ultimately, the work offered promising insights into the prediction of player churn in free-to-play games but emphasized the need for careful approaches and contextual considerations to mitigate validity threats and ensure the generalization of models to different datasets and time periods.
	Thèses
1	EMMANUEL SÁVIO SILVA FREIRE Organizing the State of Practice on Technical Debt Prevention, Monitoring, and Payment in Software Projects Leader : MANOEL GOMES DE MENDONCA NETO MEMBRES DE LA BANQUE : DANIELA SOARES CRUZES MANOEL GOMES DE MENDONCA NETO RENATO LIMA NOVAIS THIAGO SOUTO MENDES UIRÁ KULESZA Data: 30 janv. 2023 Afficher le Résumé Context: Technical debt (TD) describes the effects of immature artifacts on software development that can bring benefits in the short term but may have to be paid with interest in the long term. TD management balances short-term and long-term goals, supporting development teams to decide on the need and the best time to eliminate the debt. TD management activities include prevention, monitoring, and payment. Through prevention, it is possible to prevent teams from incurring TD, while monitoring helps them follow the evolution of TD items concerning the cost-benefit of eliminating them or not, that is, paying the debt items. Knowing the practices used to prevent, monitor, and pay TD items can help development teams to choose the best practice to be used in their projects. Identifying the practice avoidance reasons (PARs) that lead to nonprevention, non-monitoring, and non-payment of TD can help teams understand which aspects need to be improved to enable TD management. Although the technical literature has investigated the prevention, monitoring, and payment of TD, current results only reflect the viewpoint of a small number of professionals and organizations. To achieve the benefits of TD management, it is necessary to investigate more deeply the practices and PARs associated with these TD activities. Aims: This Ph.D. dissertation aims to investigate, through the continuous and independent replication of a family of surveys conducted globally, the state of practice on the prevention, monitoring, and payment of TD items in software projects. Method: Initially, we conducted a literature review on the current state of research on TD and its prevention, monitoring, and payment. Then, we analyzed data collected by six replication teams from the InsighTD project, which is a family of globally distributed surveys on the causes, effects, and management of TD. From the body of knowledge resulting from the analysis of InsighTD data, we defined three artifacts: an updated version of the conceptual model for TD, a set of conceptual maps, and IDEA (Impediments, Decision factors, Enabling practices, and Actions) diagrams. Finally, we assessed these artifacts through case studies in academic and industrial settings. Results: This Ph.D. dissertation presents the leading practices used to prevent, monitor, and pay off TD items and the PARs that justify the non-application of these practices. Regarding the prevention of TD, well-defined requirements, adopting good programming practices, and better project management are among the five most cited practices related to prevention, while short deadlines, ineffective management, and lack of predictability in the software development are among the five most cited PARs to justify the non-prevention of debt. About TD monitoring, TD item backlog, use of tools, and team meetings are among the five most cited practices related to monitoring, while lack of interest, focus on short-term goals, and lack of time are among the five PARs used to explain the non-monitoring of TD items. Regarding TD payment, code refactoring, investing effort in TD payment activities, and design refactoring are among the top five payment-related practices, while focusing on short-term goals, lack of organizational interest, and lack of time are among the five most cited PARs to explain the non-payment of TD. We update the conceptual model for TD by including the knowledge we learn from the state of practice and organize all practices and PARs along with their types, natures, and categories into maps and IDEA diagrams. From the conceptual model and TD payment map assessment, we found that they are well organized and provide valuable information to define strategies for TD management. The IDEA diagrams assessment provided positive evidence that the diagrams are easy to read and follow and can influence decisions on how to manage TD items. Conclusion: Using the InsighTD data, this Ph.D. dissertation explores the state of practice on TD prevention, monitoring, and payment, revealing the primary practices used to perform these activities and the PARs that avoid their execution. All body of knowledge was organized into three artifacts that can drive new investigations on TD and support software practitioners in increasing their capabilities and reducing their issues in managing debt items.
2	Edson Mota da Cruz DaRkaM: A Fog-Based Data Reduction Framework Applied to the Context of Urban Computing Leader : MAYCON LEONE MACIEL PEIXOTO MEMBRES DE LA BANQUE : ADEMAR TAKEO AKABANE DIONISIO MACHADO LEITE FILHO GERALDO PEREIRA ROCHA FILHO LOURENÇO ALVES PEREIRA JUNIOR MAYCON LEONE MACIEL PEIXOTO Data: 1 févr. 2023 Afficher le Résumé The Intelligent Transport Systems (ITS) has a function to analyze the flow of vehicles on highways in order to identify any traffic anomalies, ensuring greater efficiency during the decision-making process. These systems can be based on an Ad-Hoc vehicular network (VANETs) able to integrate the elements of urban space through a distributed communication system. Similarly, ITS applications require constant monitoring of roads, and such monitoring aims to analyze, among other aspects, the variation of the vehicle density over time. In general, this process occurs by means of the periodic sending of situational data from the mobility environment to the cloud. Consequently, the data sets sent at high frequency to the cloud form a continuous data flow that should be processed in a real-time context. However, this dynamic implies in a progressive increase of the communication cost, in function of the volume of data transferred in the link between the fog and the cloud, increasing the risks of overload beyond increasing the latency during requests for services made available in the cloud. Therefore, this work proposes the development of a multilayer architecture for data reduction based on Fog Computing called DaRkaM, acronym in English for (Data Reduction Framework for Traffic Management). The strategy consists of using a monitoring model able to perform data reduction processes directly at the edge of the vehicular network. At the cloud layer, DaRkaM acts as a central controller, analyzing the geographic positions of vehicles that are received from a continuous data flow. These data are used to monitor and perform the traffic management processes addressed in this proposal. At the edge network, a data reduction module was designed to host different traffic monitoring strategies. This architecture favors the comparative analysis among different approaches, ranging from the use of algorithms based on simple sampling until clustering algorithms, in which the data reduction processes are structured based on the number of clusters. The results showed that the use of cluster-based algorithms, hosted in the data reduction core of the DaRkaM framework, are able to achieve high accuracy in monitoring and detecting traffic congestion, in addition, they are able to reach a significant reduction in communication cost, especially in overloaded scenarios.
3	DIEGO BRAGA MONTEIRO DE MOURA Heterogeneous Memory Management for Graph Applications Leader : VINICIUS TAVARES PETRUCCI MEMBRES DE LA BANQUE : PAUL M. CARPENTER LUIS FILIPE NUNES QUARESMA DE OLIVEIRA DANIEL MOSSÉ ESBEL TOMÁS VALERO ORELLANA GEORGE MARCONI DE ARAUJO LIMA VINICIUS TAVARES PETRUCCI Data: 9 mars 2023 Afficher le Résumé The demand for memory has increased due to various applications such as big data, machine learning, social networks, and streaming analytics. DRAM technology is facing issues with scalability, energy, and costs, despite its low latency advantage. Non-Volatile Memory (NVM) is an alternative to DRAM scalability issues. Although NVM has high density, low cost, and low energy, its high latency prevents it from replacing DRAM. Heterogeneous memories like DRAM+NVM are likely to become common, and efficient data placement is an important research question. This PhD dissertation focuses on managing data placement for graph analytics applications. It studies machine learning models to predict application performance in a context with multiple applications and explores data mapping choices at the object or chunk level for a single application. The dissertation contributes to offline identification of performance-influencing object attributes, implementation of an offline scheduler at the chunk level, and implementation of an online scheduler at the chunk level.
4	Valéria Argôlo Rosa de Queiroz Semio-participatory interaction design by elderly people: empathy and engagement in technology production Leader : ECIVALDO DE SOUZA MATOS MEMBRES DE LA BANQUE : SIMONE BACELLAR LEAL FERREIRA KAMILA RIOS DA HORA RODRIGUES FAUSTO ORSI MEDOLA ECIVALDO DE SOUZA MATOS MARLO VIEIRA DOS SANTOS E SOUZA Data: 27 mars 2023 Afficher le Résumé The number of elderly people has increased considerably around the world. Thus, science has developed strategies to help people age healthily and maintain an active life. Digital technologies can contribute to this. However, these technologies are not always designed according to the real needs and expectations of the elderly. Encouraging the elderly to design computational technologies through open and participatory perspectives of interaction design has been shown to be a possible solution. However, engaging older people in this process and making them a design partner is still challenging. Most studies based on participatory practices found do not establish the engagement of potential users, especially in relation to elderly people. Therefore, this research aimed to investigate and develop techniques to engage the elderly in designing digital technologies through a semi-participatory interaction (co)design process, establishing empathy as a fundamental engagement element. To this end, an engagement model was initially developed to integrate the elements and variables of this model in a semi-participatory process. From this perspective, two case studies were carried out, the first in a face-to-face format and the second in a remote format, in a qualitative approach with elderly women. The results analyzed using the content analysis method confirmed the potentiality of the elements and variables to improve the engagement of the elderly person and, thus, provided the conception of an informative educational document in Guide format, highlighting the main techniques, activities and strategies with the purpose of making the conduction of a semi-participatory Interaction co-design process more practical and accessible. It is expected that these results may contribute to the expansion of semi-participatory interaction co-design processes from the perspective of the elderly person, besides offering epistemological elements for developing inclusive interaction design methods and processes.
5	Djan Almeida Santos Comprehensibility of Source Code with Feature Dependency in Configurable Systems Leader : CLAUDIO NOGUEIRA SANT ANNA MEMBRES DE LA BANQUE : ROHIT GHEYI FLÁVIO MOTA MEDEIROS CLAUDIO NOGUEIRA SANT ANNA EDUARDO SANTANA DE ALMEIDA IVAN DO CARMO MACHADO Data: 28 mars 2023 Afficher le Résumé Conditional compilation is often used to implement variability in configurable systems. This technique relies on #ifdefs to delimit feature code. Previous studies have shown that #ifdefs may hinder code comprehensibility. However, those studies did not explicitly take feature dependencies into account. Feature dependency occurs when different features refer to the same program element, such as a variable. Comprehensibility may be even more affected in the presence of feature dependency, as the developer must reason about different scenarios affecting the same variable. Our goal is to understand how feature dependency affects the comprehensibility of configurable system source code. We conducted four complementary empirical studies. In Study 1, forty-six developers responded an online survey. They executed tasks in which they had to analyze programs containing #ifdefs with and without feature dependency. However, feature dependencies may be of different types depending on the scope of the shared variable. In Study 1, we were not concerned with different types of dependency. Thus, in Study 2, we carried out an experiment in which 30 developers debugged programs with different types of feature dependency. Each program included a different type of feature dependency: global, intraprocedural or interprocedural. We used an eye-tracking device to record developers' gaze movements while they debugged programs. However, feature dependencies do not differ from each other only in terms of types. They can also present differences in terms of number of dependent variables and degree of variability, among others. To address those characteristics, we complemented Study 1 and 2 by means of Studies 3 and 4. In Study 3, we executed a controlled experiment with 12 participants who analyzed programs with different numbers of dependent variables and number of uses of dependent variables. In Study 4, we carried out an experiment in which 12 developers analyzed programs with different degrees of variability. Our results show that: (i) analyzing programs containing #ifdefs and feature dependency required more time than programs containing #ifdefs but without feature dependency, (ii) debugging programs with #ifdefs and global or interprocedural dependency required more time and higher visual effort than programs with intraprocedural dependency, (iii) the higher the number of dependent variables the more difficult it was to understand programs with feature dependency, and (iv) the degree of variability did not affect the comprehensibility of programs with feature dependency. In summary, our studies showed that #ifdefs affected comprehensibility of configurable systems in different degrees depending on the presence or not of feature dependency, on the type of feature dependency, and on the number of dependent variables.
6	LEANDRO OLIVEIRA DE SOUZA AN AUTOMATED SOFTWARE TRANSPLANTATION APPROACH FOR REENGINEERING OF SYSTEMS INTO PRODUCT LINES Leader : EDUARDO SANTANA DE ALMEIDA MEMBRES DE LA BANQUE : CLAUDIO NOGUEIRA SANT ANNA EDUARDO SANTANA DE ALMEIDA GUSTAVO HENRIQUE LIMA PINTO LEOPOLDO MOTTA TEIXEIRA RODRIGO ROCHA GOMES E SOUZA Data: 30 juin 2023 Afficher le Résumé Although Software Product Lines (SPL) offer the potential for order-of-magnitude improvements in software engineering performance, the up-front cost, level of effort, assumed risk, and latency required to make the transition to SPL are prohibitive adoption barriers for many organizations that could otherwise benefit from reusing of their existing systems. The SPL adoption from an extractive model based on a reengineering process of existing systems into SPL is an active research topic with tangible benefits in practice. It allows software development companies to preserve their investment and aggregate knowledge obtained while developing their individually developed systems portfolio. Despite these benefits, adopting an extractive product line approach still requires a considerable upfront investment and is more complex to evolve than one-off systems. Because of these drawbacks, software companies refrain from adopting SPL, resorting to an ad-hoc practice of clone-and-own. To speed the conversion to and maintenance of SPL, we present FOUNDRY, a Software Transplantation (ST) approach that guides transplanting and merging features in a product line from existing systems. It is the first approach for SPL that automates all stages of product line construction using the ST technique. We realized Foundry in a software transplantation tool for SPL that automates identifying, adapting, and transferring features from existing systems to a standard product base. Its code transfer mechanism between different systems allows it to be used not only for generating product lines but also as an alternative to the clone-and-own technique for system specialization. We compared our proposal with the existing reengineering solutions to demonstrate evidence that the ST is an alternative with the potential for application in reengineering existing systems to SPL. In the search for more concrete evidence, we evaluated two case studies where two products were generated by transplanting features from three real-world systems. Moreover, we experimented with comparing Foundry’s feature migration with manual effort. We show that Foundry automatically migrated features across codebases 4.8 times faster, on average, than the average time a group of SPL experts took to accomplish the task. Although preliminary, our evaluation provides evidence to support the claim that ST for Software Product Line Engineering (SPLE) is a feasible and promising new research direction.
7	SILVIO LUIZ BRAGATTO BOSS Avaliação automática de mapas conceituais para identificar indícios de aprendizagem significativa Leader : ALINE MARIA SANTOS ANDRADE MEMBRES DE LA BANQUE : DAVIDSON CURY ALINE MARIA SANTOS ANDRADE CREDINE SILVA DE MENEZES ECIVALDO DE SOUZA MATOS LYNN ROSALINA GAMA ALVES MARLO VIEIRA DOS SANTOS E SOUZA Data: 1 sept. 2023 Afficher le Résumé Mapas Conceituais são ferramentas pedagógicas amplamente utilizadas para facilitar o processo de aprendizagem. De forma geral, podem ser definidos como diagramas com organização hierárquica que apresentam relações entre conceitos. São um importante instrumento para o ensino e avaliação do conhecimento por representarem, entre outras características, a estrutura cognitiva do aprendiz. Os mapas conceituais estão fundamentados na Teoria da Aprendizagem Significativa de Ausubel, assim, é possível avaliar os mapas com base em aspectos dessa Teoria, como as categorias-chave da diferenciação progressiva e reconciliação integrativa. Estas representam informações sobre o processo de construção do conhecimento sendo, portanto, fundamentais para uma avaliação de aprendizagem significativa. As categorias-chave podem ser identificadas no processo de elaboração dos mapas, no entanto, este processo não vem sendo muito explorado na literatura. A maioria dos trabalhos focam na análise comparativa entre o mapa do aprendiz e um mapa de referência para verificar se os conceitos e relações estão representados corretamente. A análise comparativa entre mapas é importante, mas não suficiente para uma avaliação da aprendizagem significativa, visto que as categorias-chave bem como outras informações da estrutura cognitiva do aprendiz, ou seja, como ele constrói e reconstrói seu conhecimento no decurso do seu processo de aprendizagem, são essenciais para identificar indícios de aprendizagem. Na literatura, há trabalhos que analisam a estrutura do mapa conceitual para classificar a qualidade da aprendizagem em aprendizagem mecânica, significativa ou não- aprendizagem, mas não analisa o processo de construção do mapa e nenhuma discussão é feita sobre a automatização da avaliação. Pesquisas no campo da Informática na Educação têm investigado o problema de se avaliar a aprendizagem, comparando mapas conceituais de aprendizes com um mapa de referência, por meio de ferramentas computacionais. Há diversas soluções que envolvem análise estrutural utilizando padrões de comparação e análise semântica por meio de ontologias de domínio. Esta tese de doutorado apresenta um modelo e um método para avaliação de mapas conceituais que considera avaliação sintática e semântica bem como o processo de construção de mapas, propondo critérios para identificar indícios de aprendizagem mecânica, significativa e não-aprendizagem. São consideradas avaliação qualitativa e quantitativa. Um framework conceitual é especificado fornecendo um arcabouço sobre a organização e estruturação de avaliação de mapas conceituais para apoiar a construção de sistemas computacionais para a sua automatização. Foram desenvolvidos estudos de caso, que serviram como uma primeira avaliação do método proposto.
8	GABRIELA OLIVEIRA MOTA DA SILVA EXPLOITING LOD-BASED SIMILARITY PERSONALIZATION STRATEGIES FOR RECOMMENDER SYSTEMS Leader : FREDERICO ARAUJO DURAO MEMBRES DE LA BANQUE : DANIELA BARREIRO CLARO FREDERICO ARAUJO DURAO LAIS DO NASCIMENTO SALVADOR NATASHA CORREIA QUEIROZ LINO ROSALVO FERREIRA DE OLIVEIRA NETO Data: 28 sept. 2023 Afficher le Résumé Linked Open Data (LOD) is a cloud of freely accessible and interconnected datasets that encompass machine-readable data. These data are available under open Semantic Web standards, such as Resource Description Framework (RDF), SPARQL Protocol, and RDF Query Language (SPARQL). One notable example of a LOD set is DBpedia, a crowd-sourced community effort to extract structured information from Wikipedia and make this information openly available on the Web. The semantic content of LOD and the advanced features of SPARQL has opened unprecedented opportunities for enabling semantic-aware applications. LOD-based Recommender Systems Recommender Systems usually leverage the data available within LOD datasets such as DBpedia to recommend items such as movies, places, books, and music to end-users. These systems use a semantic similarity algorithm that calculates the degree of matching between pairs of resources by counting the number of direct and indirect links between them, the length of the path between them, or the hierarchy of classes. Conversely, calculating similarity in RDF graphs could be difficult because each resource can have hundreds of links to other nodes. Not all of them are semantically relevant or can be applied to all resources in the graph. This can lead to the well-known matrix sparsity problem. Nevertheless, some effort has been made to select subsets of features, i.e., links, which are more helpful to computing similarity between items of a graph dataset, reducing the matrix dimension. Despite several studies in this field, there is still a lack of solutions applied to the personalization of feature selection tasks. In this context, we propose personalized strategies to improve semantic similarity precision in LOD-based Recommender Systems, including i) applying a feature selection approach to filter the best features for a particular user, ii) personalizing the RDF graph by adding weights to the edges, according to the user’s previous preferences; and iii) exploiting the similarity of literal properties as well as the links from the user model. The evaluation experiments used combined data from DBpedia and MovieLens and DBpedia and LastFM datasets. Results indicate significant increases in top-n recommendation tasks in Precision@K (K=5, 10), Map, and NDCG over non-personalized baseline similarities methods such as Linked Data Semantic Distance (LDSD) and Resource Similarity (ReSim). The results show that the LOD-based strategies of user model personalization.
9	SAULO ANTONIO DE LIMA MATOS Invariants and Neighborhood Structures for 1-factorizations of complete graphs Leader : RAFAEL AUGUSTO DE MELO MEMBRES DE LA BANQUE : RAFAEL AUGUSTO DE MELO TIAGO DE OLIVEIRA JANUARIO CELSO DA CRUZ CARNEIRO RIBEIRO MARCIO COSTA SANTOS VINICIUS FERNANDES DOS SANTOS SEBASTIÁN ALBERTO URRUTIA Data: 3 oct. 2023 Afficher le Résumé A 1-factorization is a partition of the edge set of a graph into perfect matchings. The concept of 1-factorization is of great interest due to its applications in modeling sports tournaments. Two 1-factorizations are said to be isomorphic (belong to the same isomorphism class) if there exists a bijection between their sets of vertices that transforms one into the other. The non-isomorphic 1-factorization search space is a graph in which each isomorphism class is represented by a vertex and each edge that connects the vertices $\mathcal{F}_a$ and $\mathcal{F}_b$ corresponds to a move in a neighborhood structure, which from a 1-factorization isomorphic to $\mathcal{F}_a$ generates a 1-factorization isomorphic to $\mathcal{F}_b$. An invariant of a 1-factorization is a property that depends only on its structure such that isomorphic 1-factorizations are guaranteed to have equal invariant values. An invariant is complete when any two non-isomorphic 1-factorizations have distinct invariant values. This thesis reviews seven invariants used to distinguish non-isomorphic 1-factorizations of $K_{2n}$ (complete graph with an even number of vertices). Additionally, considering that the invariants available in the literature are not complete, we propose two new ones, denoted lantern profiles and even-size bichromatic chains. The invariants are compared regarding their sizes and calculation time complexity. Furthermore, we conduct computational experiments to assess their ability to distinguish non-isomorphic 1-factorizations. To accomplish that we use the sets of non-isomorphic 1-factorizations of $K_{10}$ and $K_{12}$, as well as the sets of non-isomorphic perfect 1-factorizations of $K_{14}$ and $K_{16}$. We also consider algorithmic and computational aspects for exploring the generalized partial team swap (GPTS) neighborhood, a neighborhood structure for round-robin sports scheduling problems recently proposed in the literature. In this regard, we present a framework to explore the GPTS neighborhood. Furthermore, a discussion is presented on how this neighborhood structure increases the connectivity of the search space defined by non-isomorphic 1-factorizations of $K_{2n}$ (for $8 \le 2n \le 12$) when compared to other neighborhood structures.
10	RENATA MARIA DE SOUZA SANTOS A Multi-Case Study of Software Engineering Practices in Early-Stage Startups Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : GLEISON DOS SANTOS SOUZA KIEV SANTOS DA GAMA CHRISTINA VON FLACH GARCIA CHAVEZ IVAN DO CARMO MACHADO RITA SUZANA PITANGUEIRA MACIEL Data: 14 nov. 2023 Afficher le Résumé Early-stage startups, often founded by small teams with innovative yet unproven ideas, face the challenge of validating their concepts in the market. These ideas take shape through a fluid business model, adapting until a repeatable and scalable approach is established. Startups must grapple with the pressure to deliver a minimum viable product or service swiftly. For software startups, this often centers around software or technology-mediated offerings, operating within a competitive landscape alongside companies of various sizes. Recent research has explored software startups' context, objectives, challenges, and practices. While extensive research has delved into how software startups engage in software development, there remains a gap in understanding how they select and implement specific development practices. Uncovering their software development processes and methodologies is equally essential, as it directly impacts their ability to overcome software development challenges. Key questions include: What characterizes software development in startup environments? How do startups prioritize product quality attributes in their development processes? What software engineering practices support their development endeavors, and which tools aid their progress? This thesis seeks to gain insights into how early-stage software startups navigate the software development journey, shedding light on their priorities, processes, and the tools they leverage. Our research adopts a qualitative approach in the form of a multiple case study involving 14 organizations. We analyze the data employing Grounded Theory techniques, including open, axial, and selective coding. The findings underscore the critical relevance of human factors, software development processes, software engineering practices, and external influences in the software development journey. Understanding these factors has led to the formulation of practical recommendations to bolster early-stage software development within startups. This work identifies pivotal factors influencing software development within startups and offers practical recommendations for their early-stage software development endeavors.
11	RENATA MARIA DE SOUZA SANTOS A Multi-Case Study of Software Engineering Practices in Early-Stage Startups Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : CHRISTINA VON FLACH GARCIA CHAVEZ GLEISON DOS SANTOS SOUZA IVAN DO CARMO MACHADO KIEV SANTOS DA GAMA RITA SUZANA PITANGUEIRA MACIEL Data: 14 nov. 2023 Afficher le Résumé Early-stage startups, often founded by small teams with innovative yet unproven ideas, face the challenge of validating their concepts in the market. These ideas take shape through a fluid business model, adapting until a repeatable and scalable approach is established. Startups must grapple with the pressure to deliver a minimum viable product or service swiftly. For software startups, this often centers around software or technology-mediated offerings, operating within a competitive landscape alongside companies of various sizes. Recent research has explored software startups' context, objectives, challenges, and practices. While extensive research has delved into how software startups engage in software development, there remains a gap in understanding how they select and implement specific development practices. Uncovering their software development processes and methodologies is equally essential, as it directly impacts their ability to overcome software development challenges. Key questions include: What characterizes software development in startup environments? How do startups prioritize product quality attributes in their development processes? What software engineering practices support their development endeavors, and which tools aid their progress? This thesis seeks to gain insights into how early-stage software startups navigate the software development journey, shedding light on their priorities, processes, and the tools they leverage. Our research adopts a qualitative approach in the form of a multiple case study involving 14 organizations. We analyze the data employing Grounded Theory techniques, including open, axial, and selective coding. The findings underscore the critical relevance of human factors, software development processes, software engineering practices, and external influences in the software development journey. Understanding these factors has led to the formulation of practical recommendations to bolster early-stage software development within startups. This work identifies pivotal factors influencing software development within startups and offers practical recommendations for their early-stage software development endeavors.

Thèses

PEDRO HENRIQUE BATISTA DIAMANTINO
Visual Analytics to Support the Visual Exploration of Technical Debts in Software Repositories
Leader : DANILO BARBOSA COIMBRA
MEMBRES DE LA BANQUE :
DANILO BARBOSA COIMBRA
MANOEL GOMES DE MENDONCA NETO
DANIEL GARCIA FEITOSA
Data: 26 janv. 2023

Afficher le Résumé
The increasing access to digital technologies leads to a large-scale data production and consumption worldwide. Consequently, there is a high demand for computing analysis techniques that help users to obtain a better understanding of data. An inherent challenge in this context is to analyze large volume of complex and heterogeneous datasets, such as conceived by software repositories. Areas such as Software Visualization and Visual Software Analytics have become increasingly used to support developers in software comprehension by providing a visual panorama of the entire software development process. Those graphical representations enable useful information extraction, especially for system maintenance when analyzing technical debts. Although repositories are mostly composed of multidimensional datasets, there is a lack of works that apply multidimensional visualizations to identify and track distinct groups of technical debts. In this sense, the proposal presents the Visual Debts Analytics tool, an approach based on Visual Analytics consisting of multiple coordinated multidimensional visualizations for the analysis of different groups of technical debts in software repositories. In particular, the proposed technique aims to identify and track the correlation, structure, evolution and similarities of technical debts in open-source software repositories. To evaluate our approach we presented case studies in two different repositories that demonstrate the a better comprehension in the project life cycle, enabling insights into the project quality.

ERINALDO SANTOS OLIVEIRA
An Assessment Model to evaluate the efficacy of using FLOSS projects in Software Engineering Education: A Proposal Based on Content Analysis
Leader : CHRISTINA VON FLACH GARCIA CHAVEZ
MEMBRES DE LA BANQUE :
CHRISTINA VON FLACH GARCIA CHAVEZ
LAIS DO NASCIMENTO SALVADOR
ROBERTO ALMEIDA BITTENCOURT
Data: 31 mars 2023

Afficher le Résumé
Context: The software industry's growing demand for qualified professionals highlights the importance and need for valuing Education in Software Engineering in the training of graduates from higher education courses in the area of Computing. Educators are continually challenged to keep up with emerging trends in the area, to identify appropriate techniques and incorporate them into their software engineering disciplines, in order to promote the development of skills and competences that guarantee graduates of courses in the area of Computing an education that reconciles the learning of concepts, processes, techniques and tools provided in the academic environment with the needs and best practices of the software industry. Free, Libre and Open Source software projects provide an opportunity for students and professors of undergraduate courses to work on the content of the software engineering discipline in realistic projects, which bring a real-world experience close to the demands of the world of work. Problem: One of the challenges associated with the use of FLOSS in the EES is how the educator can evaluate the chosen project as a teaching and learning resource, as well as its effectiveness for the pedagogical objective proposed by the activity or discipline. It is relevant to use evaluation methods or models that allow the professor to measure and evaluate the results from his choice, according to the learning requirements necessary after the conclusion of the activity or discipline. Objective: This work proposes the creation of a model to evaluate the effectiveness of using FLOSS projects in the teaching of Software Engineering. The proposed model will be based on the mapping of evaluation criteria that will be structured in order to facilitate the analysis of the scope of the pedagogical objectives planned for the curricular component. Research Methods: Through Content Analysis, the aim is to identify the most relevant technical and sociotechnical skills and abilities for the Software Engineering discipline, according to the main curricular guidelines in Brazil and abroad, and to investigate aspects of interference, both technical and pedagogical, of the use of FLOSS projects in teaching the discipline of software engineering. Based on this information, it is intended to define the evaluation criteria and, through a theoretical dialogue, identify their relationships to propose a Model for evaluating the effectiveness of the use of FLOSS projects in the teaching of Software Engineering. Expected Contributions: The use of methods to evaluate FLOSS projects in the EES can stimulate the insertion of educators in the FLOSS ecosystem, in addition to offering new forms of evaluation to those who recognize and adopt such projects in the classroom, a solution to the real development of skills and abilities associated with theoretical and practical aspects of software engineering, important in the training of professionals from the various areas of the Computing area.

Levy Marlon Souza Santiago
Gifflar: a framework for generating smart contracts code on the fly
Leader : FABIOLA GONCALVES PEREIRA GREVE
MEMBRES DE LA BANQUE :
FABIOLA GONCALVES PEREIRA GREVE
RODRIGO ROCHA GOMES E SOUZA
ALEX BORGES VIEIRA
Data: 18 avr. 2023

Afficher le Résumé
Blockchain is a disruptive technology that offers a secure decentralized network, and allows for direct transactions between distributed entities, without the need for a trusted third party. Smart contracts (SmC) are executable codes hosted on the blockchain and enable the implementation of several decentralized applications, in different domains. However, due to the complexity of SmC construction, it is necessary to seek ways to facilitate and make this innovative development process robust. Many projects in the literature proposed solutions based on automatic code generation from high-level modeling, such as diagrams. However, this approach brings with it a constant need for a person to model the contracts. This paper presents Gifflar, a framework for generating SmC code on the fly, in such a way that it allows a system to write, compile and implement blockchain SmCs with an application still running, which reduces the frequent precision of the SmC developer and allows to further automate the SmC development process. One of the main tools is a component-structured library that implements design patterns to better define the responsibility of each of these parts. The Gifflar library offers an API that allows the developer to model, generate code and manage SmCs through methods that abstract to a certain level the SmC code using the JSON as a model to the code generation. In addition, project evaluations were carried out to validate the project: (i) a usability evaluation of the Gifflar library and (ii) a conceptual evaluation of Gifflar's application in other projects. As far as we know, this framework is one of the first to allow dynamically generating SmCs on the fly, thus contributing to the state of the art by approaching a new paradigm, where systems can act as smart contract developers.

Ibirisol Fontes Ferreira
FAIRNESS-ORIENTED MULTICAST ROUTING FOR DISTRIBUTED INTERACTIVE APPLICATIONS WITH COMPUTING ON THE NETWORK EDGE
Leader : GUSTAVO BITTENCOURT FIGUEIREDO
MEMBRES DE LA BANQUE :
GUSTAVO BITTENCOURT FIGUEIREDO
TIAGO DE OLIVEIRA JANUARIO
MAYCON LEONE MACIEL PEIXOTO
JOSE AUGUSTO SURUAGY MONTEIRO
Data: 27 avr. 2023

Afficher le Résumé
Facing the challenges of network services on the edge, such as routing considering quality of service, is a crucial issue for the current research efforts in the area of network. Multicast routing is an essential technique for delivering routing services with a high level of optimization from the perspective of operators and application providers when there are user groups.
Furthermore, the routing that considers latency constraints has obstacles to merging different conditions in the solution, mainly when there is a fairness perspective to be accomplished in the users' communication. This work aims to deal with this fairness requirement in multicasting, showing efficient solutions to increase the equilibrium in the routing by choosing better path options for fair group interaction.

Caiza Almeida Fortunato
ATAM-4SAS: A Method for Assessing Quality Attributes in Self-Adaptive Systems
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
LUCIANA LOURDES SILVA
CARLA ILANE MOREIRA BEZERRA
IVAN DO CARMO MACHADO
Data: 4 mai 2023

Afficher le Résumé
Self-adaptive Systems (SAS) can monitor themselves and their context. They can detect changes and react to unexpected conditions with minimal human supervision during their execution. One of the challenges behind developing SAS is dealing with the decision-making process while analyzing the tradeoff points among the multiple quality attributes (QA). In Software Engineering, a widely accepted method of evaluating QA goals in software projects is the Architecture Tradeoff Analysis Method (ATAM). However, despite its importance and wide acceptance, there are few reports of empirical studies on analyzing QA tradeoffs in SAS. In this sense, the present investigation proposes an adapted version of ATAM called ATAM-4SAS to deal with the particularities of SAS. To achieve the research goal, we employed the UPPAAL SMC (statistical verification model) to analyze a set of QA. To evaluate the feasibility of the proposed method, we performed an empirical study on the execution of the ATAM-4SAS in a SAS developed according to the MAPE-K model. This model encompasses the Monitoring, Analysis, Planning, and Execution phases. Such steps share a knowledge base (K), which is fundamental in supporting decision-making. We complemented the empirical evaluation by conducting a focus group, which sought to assess the perceived ease of use and the perceived usefulness of the ATAM-4SAS to support the strategic choice of QA in a SAS. As a result, we observed that most participants agreed that ATAM-4SAS provides adequate support for the strategic choice of QA in SAS.

RENATA DE SOUSA SANTOS
Process Smell 2.0: A catalogue of Bad Smells for software process using BPMN.
Leader : RITA SUZANA PITANGUEIRA MACIEL
MEMBRES DE LA BANQUE :
ANA PATRICIA FONTES MAGALHÃES MASCARENHAS
CLAUDIO NOGUEIRA SANT ANNA
RITA SUZANA PITANGUEIRA MACIEL
Data: 4 mai 2023

Afficher le Résumé
An explicitly specified software process improves the quality of the generated product.
The specification directs the path and steps to be followed so that the developed software
reaches the quality desired by the project. The software process evolves along with the
needs of the organization and the professionals who use it. It needs to be continuously

monitored and evaluated to maintain its quality. To evaluate a software process, simula-
tions or data obtained after execution are commonly used. However, in both cases, it is

difficult to predict problems in the software process in a given development project before
one or more executions. Therefore, Process Smells were proposed to enable problems in
the design of software processes to be identified before they were executed. The presence
of a Process Smell can negatively impact the quality of the Process, thus affecting the

quality of the software product. Initially, Process Smells were specified using the Soft-
ware Process Specification Metamodel (SPEM) notation. Although SPEM is a language

for the domain of process modeling, Business Process Model and Notation (BPMN) has
been widely used for processes in general, with high popularity both in the market and
academia. In this sense, the specification of a new catalog can allow Process smells to

be understood independently of the language domain. Still, in the context of process im-
provement, just like SPEM, although BPMN is used to improve the understanding of a

process, the specification of a process can be done inappropriately, injuring the necessary
factors for the quality of a process of software. In this context, this research aimed to
specify a Process Smells catalog to support the identification of anomalies in software
processes specified with BPMN based on the (SANTOS; MACIEL; SANT’ANNA, 2018)

proposal. Initially, Process Smells were specified in BPMN, thus originating a new ca-
talog with 8 Process Smells, Process Smells 2.0. Thirty-two professionals evaluated the

specification through an interview study, which indicated that these professionals accep-
ted the proposed new Process Smells catalog. The results obtained in this study made it

possible to verify that the proposal to identify smells regardless of the language domain
makes sense. The second stage of the study proved to be more concurrent with SPEM.
The catalog specification is expected to support the identification of Process Smells in
software processes modeled using BPMN to indicate the points where the Process can be
improved, even before its first execution, avoiding problems that negatively affect process
quality attributes. Additionally, the new catalog is expected to expand the possibility of
using Process Smells.

Jamile de Barros Vasconcelos
EXPERIMENTAL EVALUATION OF EXTREME VALUE STATISTICS USAGE ON DETERMINING EXECUTION TIME BOUNDS FOR REAL-TIME SYSTEMS DESIGN
Leader : GEORGE MARCONI DE ARAUJO LIMA
MEMBRES DE LA BANQUE :
GEORGE MARCONI DE ARAUJO LIMA
TATIANE NOGUEIRA RIOS
VERONICA MARIA CADENA LIMA
Data: 30 mai 2023

Afficher le Résumé
Real-time systems (RTS) are those whose actions are subject to time constraints, defined in terms of execution time deadlines. A hard RTS is usually designed to never miss a deadline, as in the case of avionics and space systems. To guarantee this it is essential to know the maximum time that each task takes to execute, a parameter known as Worst Case Execution Time (WCET). Estimating the WCET of a task is not simple and, in the case of modern architectures, it is subject to unpredictable influences caused by different hardware and software elements, which prevent the derivation of an absolute WCET value. One of the most used techniques in these cases is the Measurement-Based Probabilistic Timing Analysis (MBPTA) method based on the Extreme Values Theory (EVT) statistical area, for estimating worst-case probability distributions, called Probabilistic WCET (pWCET) . The present work aims to carry out an experimental analysis of the use of EVT via MBPTA for pWCET inference, presenting the process of applying the technique in a real RPi environment and exposing the challenges and flaws encountered during this process; a point rarely demonstrated in the literature. The partial results indicate that EVT is robust, but that has weaknesses because it does not always produce adequate models and
coherent pWCET results. This work is part of the Kepler project, a cooperation between UFBA and INRIA-Paris.

TIAGO DA CONCEICAO OLIVEIRA
A simulation-based iterated local search metaheuristic for pump scheduling in water distribution networks
Leader : RAFAEL AUGUSTO DE MELO
MEMBRES DE LA BANQUE :
CELSO DA CRUZ CARNEIRO RIBEIRO
ISLAME FELIPE DA COSTA FERNANDES
RAFAEL AUGUSTO DE MELO
Data: 17 juil. 2023

Afficher le Résumé
In a water distribution network, the electricity amount to operate the pumps can achieve 90% of the total electricity consumed. The amount charged for electricity consumption can differ at each time of the day. Therefore, scheduling the pump’s operation at opportune times can reduce energy costs. Optimal or near-optimal pump scheduling is not trivial given the nonlinear constraints of the WDN, which include the pump scheduling problem in the NP-hard class. The pump scheduling problem consists of obtaining the lowest operating monetary cost, guaranteeing that water is delivered to all demand points, and without violating the physical constraints of the water distribution network. This work proposes heuristic methods based on simulations, combining them in a metaheuristic Iterated Local Search (ILS). Computational experiments show that the proposed approach is promissory, obtaining the best results using the binary representation with the restrictions presented in this work, and when compared to other pump scheduling representations, the values obtained for the Vanzyl instance reached solutions that deviate by only 0.73% of the best-known value and at only 2.02% value for Richmond.

Edmilson dos Santos de Jesus
MACHINE LEARNING MODELS FOR WATER DEMAND FORECAST IN THE METROPOLITAN REGION OF SALVADOR, BAHIA
Leader : GECYNALDA SOARES DA SILVA GOMES
MEMBRES DE LA BANQUE :
ANDERSON LUIZ ARA SOUZA
GECYNALDA SOARES DA SILVA GOMES
PAULO HENRIQUE FERREIRA DA SILVA
Data: 31 juil. 2023

Afficher le Résumé
The objective of this work is to proposes a new hybrid SVR-ANN model for water demand forecasting. Where an adaptation of the methodology proposed by Zhang (2003) is used to decompose the time series of 10 reservoirs that supply the Metropolitan Region of Salvador (RMS). The data used are from the historical consumption from January/2017 to February/2022, obtained from the local supply company, Empresa Baiana de Águas e Saneamento, and meteorological data obtained from the National Institute of Meteorology of Brazil. The results demonstrated the feasibility of using the proposed model, compared to other traditional models such as the Multilayer Perceptron (MLP), Support Vector Regression (SVR), Short Long Term Memory (LSTM) and Autoregressive and Integrated Moving Average (ARIMA).

Dárcio Santos Rocha
IDENTIFICATION OF EVENT-TIME TEMPORAL RELATIONS IN PORTUGUESE: A RULES-BASED APPROACH WITH ASSOCIATIVE CLASSIFICATION
Leader : MARLO VIEIRA DOS SANTOS E SOUZA
MEMBRES DE LA BANQUE :
MARLO VIEIRA DOS SANTOS E SOUZA
RERISSON CAVALCANTE DE ARAUJO
ROBESPIERRE DANTAS DA ROCHA PITA
Data: 14 août 2023

Afficher le Résumé
This work aims to develop a computational method to identify types of temporal relations between events and temporal expressions in texts written in Portuguese. In order to achieve this goal, rule learning techniques will be employed to discover the best combinations of available linguistic information, formulating decision rules that can efficiently identify the types of temporal relations between events and temporal expressions. Most related works adopted a machine learning-based approach, while only one used a hybrid approach, combining manual rules. The methodology proposed in this work consists of a rule-based approach, which incorporates lexical, morphosyntactic and contextual information, Reichenbach tenses, temporal signals and knowledge about the world, in addition to TimeML annotations in the corpus. Unlike a purely machine learning approach, the rule sets generated by our method allow the combination of rules generated by different algorithms, or the combination of complete sets, which can result in better performance. In short, the method takes event/time expression pairs as input and uses a filtering strategy to select the pairs most likely to have been annotated in the corpus. It then applies sets of rules to each pair to identify the type of existing temporal relationship and a data augmentation strategy to calculate the temporal closure of all pairs and their respective identified relationships. In preliminary experiments, we proposed an initial set of manual rules for the Portuguese language. However, the results obtained showed that this set was limited, resulting in low coverage and consequently low accuracy. The maximum value achieved was 45.1% accuracy and 34.1% coverage in the test data. To improve these results, we propose to incorporate rule learning techniques to the method, aiming to increase the set of rules. These techniques are able to handle noisy data well, work well on unseen data and generate more efficient rules, in addition to offering competitive performance and working efficiently. With this incorporation, we hope that the proposed experiments will produce a set of rules capable of identifying types of event-time temporal relations efficiently and achieving superior results. This will contribute to the advancement of the state of the art in the area, in addition to disseminating the research carried out and contributing to the scientific community.

Davi Lima Alves
A SIDECHAIN PLATFORM FOR RESTAURANT E-COMMERCE TRANSACTIONS
Leader : MANOEL GOMES DE MENDONCA NETO
MEMBRES DE LA BANQUE :
ALLAN EDGARD SILVA FREITAS
CASSIO VINICIUS SERAFIM PRAZERES
LEOBINO NASCIMENTO SAMPAIO
MANOEL GOMES DE MENDONCA NETO
ROBESPIERRE DANTAS DA ROCHA PITA
Data: 22 août 2023

Afficher le Résumé
Ordering food online is present in people's lives every day, however, this procedure requires sending customer data to different restaurant systems, and often these restaurants share sensitive information with their partners without authorization, creating a security risk when sharing such information. data. Also, in many cases, when a person travels to a country other than their origin and wants to order food, an exchange transaction for local currency is required, which is expensive. Blockchains have revolutionized the way of making payments online and intrinsically secure, eliminating the centralizing and authorizing entity, such as Bitcoin, the first blockchain used on a large scale. However, the scientific literature records the difficulty in view of the high frequency of transactions in Bitcoin, the high transaction time, and also its high cost per transaction, in addition to the limitation of the Script language for application development. In this step, the Ethereum blockchain emerges as an alternative to Bitcoin, in an environment of more complex applications with a higher frequency of applications and number of transactions per second, but still far from the performance of solutions such as credit cards, which disadvantages this “chain of blocks”. ” compared to traditional payment solutions such as credit cards, which also had high transaction cost rates for a long time. From the study of this context, we propose a payment model for retail transactions through the blockchain based on a Proof of Concept (POC) aimed at restaurants using the Software Development Kit (SDK) of the Lisk blockchain. POC Lisk Restaurante represents an integration of Lisk blockchain in the food industry allowing customers to buy food quickly and securely through sidechains - exclusive blockchains for special restaurant-type transactions, without intermediation and with lower fees when using a single crypto asset in the world : the LSK. We performed an empirical study of the proposed solution in which the performance of block transactions, transaction fees, auditability, solution scale, privacy and security in sending messages in transactions is evaluated. The results were used in comparison with data from the literature of Bitcoin, Ethereum, and Multichain blockchains. The results indicate that the special transactions used at Lisk Restaurante allow guaranteeing data privacy for its customers, since only the recipient of a transaction can access sensitive sender data. Thus, through research, literature review and performance evaluation experiments, it was observed that the sidechain solution for restaurants demonstrated the efficiency of blockchain technology in providing global services at a lower cost than common means of payment; also, when compared to their transaction fees with other blockchains like Bitcoin or Ethereum, their values are lower and the privacy of their customers is maintained. Also, our solution has shown an increase in the capacity of Food transactions in a single block and better scalability through sidechains, which accept transactions specific to the business involved, which differs from single blockchains such as Bitcoin or Ethereum that execute transactions for different purposes, also surpassing the first version of the Proof of Concept (PoC) Lisk Restaurante, exceeding twice the capacity of number of transactions or frequency, in a block. Finally, experiments were carried out with the Lightning network, an off-chain technology that works on top of Bitcoin, being a scalability solution for Bitcoin. The evaluation of experimental results of the Lisk sidechain solution and the Lightning network demonstrated the differences of each technology and the advantages and disadvantages of each one were praised. Thus, it was observed that the Lightning network technology can also be used in the retail industry, but without the possibility of customizing transaction costs.

João Medrado Gondim
INCREASING THE IMAGE CAPTIONING MODELS IN PORTUGUESE THROUGH LINGUISTIC INFORMATION
Leader : DANIELA BARREIRO CLARO
MEMBRES DE LA BANQUE :
SANDRA ELIZA FONTES DE AVILA
DANIELA BARREIRO CLARO
TATIANE NOGUEIRA RIOS
Data: 23 août 2023

Afficher le Résumé
The increase in the number of applications that require accessibility, information retrieval and human-computer interaction has culminated in a growing need for automated generation of the description of an image. This automated description requires an identification of the scenario, characters and objects present and how these elements relate to each other. From these elements it becomes possible to generate a sentence in natural language describing the content of the image. The development of methods capable of automatically generating the sentences describing the image permeates a research area called textit{Image Captioning}. Most researches and datasets in the area of Image Captioning focus on the English language, developing models and building efficient state-of-the-art resources. Languages with few resources for development, such as Portuguese, require more research to achieve a descriptive and understandable sentence. However, only the agglomeration of several objects in the image does not generate a sentence in the Portuguese language. In this context, this work proposes the analysis and incorporation of linguistic resources that can guide the language model in generating a description that is more representative of the image and the sentence in Portuguese. Experiments were performed with the translation of datasets to generate the description in Portuguese. The results indicate that the morphological analysis of the outputs of an Image Captioning model, as well as the incorporation of grammatical classes during the training, will contribute to a better description of the image in Portuguese.

CLEBER BRITO SANTOS
Determinação de vizinhança em rede veiculares ad hoc: uma análise dos tempos limite para entrega de mensagens
Leader : ALINE MARIA SANTOS ANDRADE
MEMBRES DE LA BANQUE :
ALINE MARIA SANTOS ANDRADE
ALIRIO SANTOS DE SA
ALLAN EDGARD SILVA FREITAS
LEOBINO NASCIMENTO SAMPAIO
Data: 5 sept. 2023

Afficher le Résumé
A detecção de vizinhança em redes veiculares ad hoc envolve a determinação de veículos
vizinhos para uma organização colaborativa. Esse problema apresenta desafios em relação à
comunicação entre veículos em redes veiculares devido à natureza dinâmica da rede, que
provoca variações imprevisíveis na qualidade do canal de comunicação devido a fatores como
condições ambientais, interferências e congestionamentos. Nesta dissertação, abordamos a
detecção de vizinhança entre pelotões em redes dinâmicas, enfrentando desafios como a
movimentação variada dos veículos e as interferências de edificações. A organização em
pelotões representa um cenário relevante para o estudo em redes veiculares, uma vez que traz
benefícios na qualidade do trânsito, como a redução do consumo de combustível, a melhoria
do fluxo de tráfego e o aumento da segurança. Desenvolvemos um protocolo básico de
comunicação, projetado para a comunicação entre líderes de pelotões, que garante o
conhecimento mútuo entre eles, possibilitando a troca de informações essenciais para a
detecção de vizinhança e o funcionamento colaborativo dos pelotões. Verificamos o protocolo
utilizando o verificador de modelos PRISM. Utilizando este protocolo, realizamos experimentos
para determinar os tempos necessários que ocorrem entre o envio e o recebimento de
mensagens trocadas entre líderes de pelotões para a determinação de vizinhança. Realizamos
experimentos em canais confiáveis e não confiáveis e determinamos tempos limite de
progresso por meio de simulações utilizando o framework VEINS, que integra SUMO e
OMNET++. Os resultados dos experimentos forneceram informações sobre os tempos limite
de progresso em vários cenários que podem servir de referência para o desenvolvimento de
aplicações relacionadas à comunicação em VANETs.

Iury Gomes de Oliveira
CycleVis - A sport visualization approach for cyclist performance analysis
Leader : DANILO BARBOSA COIMBRA
MEMBRES DE LA BANQUE :
DANILO BARBOSA COIMBRA
FREDERICO ARAUJO DURAO
RENATO LIMA NOVAIS
Data: 14 sept. 2023

Afficher le Résumé
The technological evolution together with the growing mass production of data demand the creation of new mechanisms for processing and analysis in the most different areas of information. The same goes for the field of sports, which has a great popular appeal around the world and has been gaining more and more interest from the scientific community. Among the different sports genres, cycling has been gaining special attention in the community of information visualization and Visual Analytics over the years. In general, the focus of these works is on finding patterns to draw strategies, display statistical information, or analyze performance of groups of cyclists. However, there is a gap of studies that focus on paths not related to professional competitions or that consider attributes of climate and intensity in their analysis. In this context, this work presents CycleVis, a data visualization approach for cycling that allows a better understanding of the sports performance of cyclists. For this, intensity characteristics such as speed, heart rate and elevation were used, in addition to the geolocation of the runs/pedals. To validate the approach, a use case was developed and it was evaluated through a study with users, which resulted in a very positive evaluation for effectiveness and usability.

JOSÉ AUGUSTO DUARTE GOMES
MultiVisD3: A Visual Analytics Approach for Quality Analysis of Multidimensional Projections Using Multiple Coordinated Views
Leader : DANILO BARBOSA COIMBRA
MEMBRES DE LA BANQUE :
RAFAEL MESSIAS MARTINS
BRUNO PEREIRA DOS SANTOS
DANILO BARBOSA COIMBRA
Data: 29 sept. 2023

Afficher le Résumé
Multidimensional visualizations are graphical representations that assist in presenting multidimensional datasets. However, many of these techniques exhibit limited visual scalability as the number of dimensions increases, requiring ever-expanding visual space to accommodate the entire dataset. One way to address this problem is by utilizing Multidimensional Projection techniques, which perform dimensionality reduction while aiming to preserve the patterns of the original data in the projected space. However, it is highly likely that errors and distortions may be present in the projection layouts. A challenge in researching this topic is to measure and visualize the errors and distortions embedded in the projection mappings. In this regard, this article introduces MultiVisD3, a Visual Analytics approach to analyzing the quality of Multidimensional Projections. This approach comprises an interactive visual dashboard containing multiple coordinated views to facilitate understanding and comparison of error metrics between two projections. Other visualization techniques such as Treemap and tabular views support the extraction of information regarding projection quality. Finally, the proposed approach underwent user evaluation using well-known datasets to analyze functionality and usability aspects. The results demonstrated that users positively evaluated the proposed approach. More than 90% of the participants correctly completed eight out of nine tasks assigned to them, and over 90% of them approved of usability aspects related to interactions and the arrangement of visualizations.

LEONARDO RODRIGUES RIBEIRO
MetaLProjection: An Approach for Recommending Dimensionality Reduction Algorithms Using Meta-Learning
Leader : DANILO BARBOSA COIMBRA
MEMBRES DE LA BANQUE :
THIAGO FERREIRA COVÕES
DANILO BARBOSA COIMBRA
RICARDO ARAUJO RIOS
Data: 6 oct. 2023

Afficher le Résumé
Data visualization techniques have significant potential for analysis, summarization, and comprehension to facilitate information extraction. Recent areas such as Visual Analytics and Data Science underscore their importance, especially in analyzing complex datasets. In this sense, Multidimensional Projection techniques are particularly used for visual analysis of high-dimensional datasets because they perform dimensionality reduction and, consequently, exhibit better scalability in terms of the number of attributes/dimensions. However, there is a wide variety of these projection techniques, and determining the most suitable one for discovering visual patterns of information in one or multiple datasets is not a trivial task. While works in the literature test and compare different techniques on datasets with distinct characteristics, they do not do so systematically to assist user decision-making. In this context, this research leverages meta-learning to classify and recommend multidimensional projections, considering specific evaluation metrics from a knowledge base comprising over 500 distinct datasets. To evaluate the approach, we observed i) the relationship between the meta-attributes of all datasets, ii) the generation of a ranking containing the performance of the selected projection techniques, and iii) the accuracy of the recommendation of these techniques. Finally, the results obtained demonstrate that the developed approach effectively contributes to the selection and recommendation of multidimensional projection techniques.

FELIPE REBOUCAS FERREIRA ABREU
An Intelligent Self-Configuring Recommender System as a Service
Leader : FREDERICO ARAUJO DURAO
MEMBRES DE LA BANQUE :
FREDERICO ARAUJO DURAO
CLAUDIO NOGUEIRA SANT ANNA
ROSALVO FERREIRA DE OLIVEIRA NETO
Data: 28 nov. 2023

Afficher le Résumé
In today's dynamic digital realm, the plethora of listing services, spanning from music platforms to product recommenders and social media content suggestions, often leaves users searching for items that truly align with their tastes. Addressing this intricacy, the rise of Recommender Systems has proven invaluable. These systems efficiently filter vast data to align items with individual preferences, enhancing user choices. This work centers on the creation of an advanced Recommender Systems API. Distinctively crafted, this API boasts universal accessibility and an uncomplicated deployment procedure. As the foundation for various Web Services, the API draws strength from the stalwart REST architecture. It is designed with a commitment to modularity, championing adaptability and flexibility. The API processes user data and queries to provide tailor-made recommendations quickly. Performance evaluations illuminated the API's commendable accuracy. It particularly shone with smaller datasets, displaying impressive data processing and algorithm execution times. The API manifested exceptional efficiency and resilience under specific test conditions, including cloud environments, especially notable in extensive 16,000-item dataset scenarios. The API is more than a tool; it paves the way for personalized digital experiences, showcasing its prowess in CRUD operations and tailored recommendations. The user evaluation phase encompassed a varied demographic: novice to experienced developers. Over half had considerable software development experience, and a significant percentage had prior engagements with coding recommender systems. With diverse knowledge of recommender libraries, most feedback praised the API's effectiveness. 81% valued the recommendations, and many felt confident in its filtering techniques. The highlight of this work is the Recommender System API's versatility. Despite positive feedback, users suggested improvements in documentation, data security, and features. These insights will shape future API refinements and user experience. Participants' enthusiastic engagement and feedback underscore the API's potential to enhance applications requiring a recommendation system, especially for developers who are perhaps less versed in the theoretical nuances. The solid research foundation and participant dedication highlight the API's potential for broader adoption by developers.

GENICLEITO CARVALHO BELTRAO GONCALVES
Granularity to Ensure Interpretability of the Fuzzy Rules
Leader : TATIANE NOGUEIRA RIOS
MEMBRES DE LA BANQUE :
MARCOS EVANDRO CINTRA
HELOISA DE ARRUDA CAMARGO
TATIANE NOGUEIRA RIOS
Data: 30 nov. 2023

Afficher le Résumé
The computational representation of human knowledge, when composed of imprecise data, is a task facilitated by the use of systems that use resources from set theory and fuzzy logic. Such systems use rules that allow mapping the knowledge obtained from data into an easy-to-understand linguistic representation. The fuzzy rule base that constitutes this type of system can be generated by a specialist or through methods that consider the characteristics of the data itself, reducing the need for a specialist in this creation process. With the appropriate adjustments and use of automated methods, it is possible to increase the interpretability of these rules without reducing the system's accuracy. In this sense, the rule base of a fuzzy system can be improved through the principle of justifiable granularity to adjust the fuzzy sets representative of the data. Therefore, in this master's thesis, the method called GEnI-FR (Granularity to Ensure Interpretability of the Fuzzy Rules) is presented, which makes adjustments and refinements in the process of generating fuzzy rules, achieving a balance between interpretability and precision, adjusting sets fuzzy based on the characteristics of the data itself. GEnI-FR presents itself as a promising method as it provides a reduction in the number of fuzzy rules while maintaining the same levels of accuracy when compared to other state-of-the-art methods.

JALISSON DOS SANTOS HENRIQUE
MOTIVATIONS FOR APPLYING THE EXTRACT METHOD REFACTORING: A STUDY BASED ON COMMIT MESSAGES
Leader : CLAUDIO NOGUEIRA SANT ANNA
MEMBRES DE LA BANQUE :
CHRISTINA VON FLACH GARCIA CHAVEZ
CLAUDIO NOGUEIRA SANT ANNA
RAPHAEL PEREIRA DE OLIVEIRA
Data: 11 déc. 2023

Afficher le Résumé
Refactorings are operations performed on source code aimed at improving the maintainability of a software system. Although literature contains a high number of studies on refactorings, there are only few works that investigate the motivations reported by developers to perform refactorings. Therefore, this research aims to investigate the motivations behind extract method refactorings in real systems. To achieve this, we conducted two empirical studies: a preliminary study involving a single software system, and another study considering multiple real systems. The studies were based on mining software repository commits in which extract method refactorings occurred. Essentially, the goal was to analyze commit messages in order to capture developers' motivations for performing such refactorings. The results identified 16 different motivations for applying extract method. Additionally, it was observed that only 16% of the refactorings had an explicit intention of improving code quality. It was also identified that refactorings with the intention of improving code quality occurred more frequently in methods with size higher than 61 lines of code and less frequently in methods smaller than 20 lines of code.

MATEUS CARVALHO DA SILVA
Application of biased random-key genetic algorithm and formulations for the Grundy coloring problem and the connected Grundy coloring problem
Leader : RAFAEL AUGUSTO DE MELO
MEMBRES DE LA BANQUE :
PEDRO HENRIQUE GONZÁLEZ SILVA
MAURICIO GUILHERME DE CARVALHO RESENDE
FREDERICO ARAUJO DURAO
RAFAEL AUGUSTO DE MELO
Data: 13 déc. 2023

Afficher le Résumé
Given a graph G, its Grundy number $\Gamma(G)$ defines the worst-case behavior for the well-known and widely used first-fit greedy coloring heuristic. Specifically, $\Gamma(G)$ is the largest $k$ for which a $k$-coloring can be obtained with the first-fit heuristic. The connected Grundy number $\Gamma_c(G)$ gives the worst-case behavior for the connected first-fit coloring heuristic, that is, one in which each vertex to be colored, except the first, is added adjacent to an already colored vertex. Both problems are NP-hard. In this master's thesis, we present heuristic and exact approaches to the Grundy coloring problem and the connected Grundy coloring problem, which are optimization problems consisting of obtaining the Grundy number and the connected Grundy number, respectively. This study proposes the use of a algorithm Biased Random-Key Genetic Algorithm (BRKGA) and the use of integer programming formulations using a more traditional (standard) approach and a representative one. A new combinatorial upper bound is also proposed that is valid for both problems and an algorithm using dynamic programming for its calculation. The computational experiments show that the new upper bound can improve over a well-established combinatorial bound available in the literature for several instances. The results also evidence that the formulation by representatives has an overall superior performance than the standard formulation, achieving better results for the denser instances, while the latter performs better for the sparser ones to the Grundy coloring problem. However, we show that these types of integer programming formulations are computationally impractical for the connected version. Furthermore, the BRKGA can find high-quality solutions for both problems and can be used with confidence in large instances where the formulations fail for the Grundy coloring problem.

JUVENAL CONSTANTINO DE MACÊDO JÚNIOR
Investigating the association between change bursts and build status
Leader : RODRIGO ROCHA GOMES E SOUZA
MEMBRES DE LA BANQUE :
RODRIGO ROCHA GOMES E SOUZA
IVAN DO CARMO MACHADO
JOSÉ AMÂNCIO MACEDO SANTOS
TIAGO OLIVEIRA MOTTA
Data: 18 déc. 2023

Afficher le Résumé
Commit bursts are sequences of changes made by developers that occur in code within a short period of time. In projects that adopt the practice of Continuous Integration (CI), every time a modification is completed, a new version of the code is created, generating a new build. In this new version, the changes made are verified automatically, running unit tests and reporting the result of failure or success of build to the developers. In this sense, the objective of this research is to carry out an empirical study to verify the association between bursts of commits and failures in build. The first step was to carry out an empirical study based on the mining of deposits, through which the relationship between bursts of commits and construction failure was identified. After this study, a survey was followed to evaluate the results obtained, considering the opinions of developers who work with CI. Among the discoveries made, the results of the first study show that in some projects the success rate of builds after bursts of changes tends to decrease. However, we cannot generalize the results to all projects since, in most of the studied projects, the difference was not statistically significant. In the second study, most survey participants agree that the proximity of a project's delivery data is a factor responsible for bursts of commits and build failure. However, we cannot generalize the results, since some developers had small disagreements about who was responsible for the construction failure. Thus, the results of this work intend to contribute to the community of developers who use CI, helping them to reduce failures in build, facilitating future work on good development practices.

ANGELA PEIXOTO SANTANA
A Characterization Study of Technical Debt in Software in Industry
Leader : CHRISTINA VON FLACH GARCIA CHAVEZ
MEMBRES DE LA BANQUE :
CHRISTINA VON FLACH GARCIA CHAVEZ
IVAN DO CARMO MACHADO
CRESCENCIO RODRIGUES LIMA NETO
TIAGO OLIVEIRA MOTTA
Data: 20 déc. 2023

Afficher le Résumé
Context. The term Technical Debt (TD) denotes the results and consequences of the decision to prioritize the rapid delivery of software, focusing on short-term effectiveness at the expense of its quality. Technical debts incurred during the development and maintenance of software must be identified, characterized, and repaid in the future to avoid compromising its quality and the ease of its evolution.
In software projects developed in the industry, there is a recognized need to identify and characterize TD throughout the software lifecycle, aiming to manage it and mitigate threats to the long-term success of software. Industry studies allow exploring different aspects of TD in practice and from various perspectives, such as those of developers and managers, enriching the characterization of TD.

Objectives. The main objective of this work is to investigate, in the context of an IT company, the perceptions of development team professionals regarding the concept of Technical Debt (TD) and how TD manifests in one of the developed software systems.

Methods. We conducted a study in the context of a software development company branch, comprising:

(i) a survey with employees engaged in software development at the branch;
(ii) a preliminary characterization of the Technical Debt (TD) in a software system based on the identification of TD using the static analysis tool adopted by the company, SonarQube;
(iii) semi-structured interviews with the development team of the analyzed software regarding the TD identified by SonarQube.

Results. The survey study received 32 valid responses. It was observed that the concept of Technical Debt (TD) was known to the majority of professionals who participated in the study; however, they exhibited little familiarity with the long-term consequences of TD. The characterization of the software system based on the automatic identification of TD was conducted and used as support for the interviews. The deadlines set for the implementation of requirements are notably short. The documentation, when not updated, becomes insufficient in the long term, while testing is relegated to the background in favor of faster deliveries.

Conclusions. A characterization of Technical Debt in the industry revealed that established deadlines are the predominant factor for development teams to decide to incur technical debt. The team often chooses this approach to meet delivery deadlines. Despite being a known topic, there is still a need to disseminate the long-term consequences of Technical Debt. However, despite the increasing research on technical debt, the field lacks consensus on its definitions and long-term impacts on the industry.

ADRIANO HUMBERTO DE OLIVEIRA MAIA
A Fog Computing-based Framework for Data Reduction in a Traffic Detection System for VANETs
Leader : MAYCON LEONE MACIEL PEIXOTO
MEMBRES DE LA BANQUE :
RODRIGO AUGUSTO CARDOSO DA SILVA
ROBERTO RODRIGUES FILHO
MAYCON LEONE MACIEL PEIXOTO
Data: 20 déc. 2023

Afficher le Résumé
With the growth in the number of vehicles in the world in recent years, it has become necessary to adopt technologies to deal with the consequences that this vehicle volume can generate for large cities, such as increased congestion on highways. Vehicular Ad-Hoc Networks (VANETs) present themselves as a promising technology in this scenario, helping to form vehicular networks capable of interconnecting vehicles and infrastructure to understand and deal with vehicle congestion. Considering this, the amount of data generated by this environment increases as the number of vehicles on the roads increases. Consequently, sending data from the vehicular environment to the structure that identifies congestion can be increasingly costly from the point of view of network use, potentially generating overloads and increased latency, making quick decision-making difficult. Therefore, in this work, we propose the construction of a Framework that aims to identify vehicle congestion, with an approach to reduce the data generated by a VANET in the fog layer and then send only the most relevant data to the cloud. for decision making. In addition to congestion detection, with historical data in time series format we perform congestion prediction using ARIMA. To work with data reduction, \textit{Framework} uses simple sampling algorithms and clustering techniques (DBSCAN and XMEANS). The results showed that the use of clustering algorithms in this Framework can achieve a significant level of accuracy in detecting traffic congestion together with a marked reduction in network usage.

IURY MAIA DE ALMEIDA
Predicting Player Churn in a Free-To-Play Game Using Game Analytics
Leader : RODRIGO ROCHA GOMES E SOUZA
MEMBRES DE LA BANQUE :
LYNN ROSALINA GAMA ALVES
RODRIGO ROCHA GOMES E SOUZA
TIAGO OLIVEIRA MOTTA
Data: 21 déc. 2023

Afficher le Résumé
The issue of player churn in free-to-play games poses a significant challenge in the electronic gaming industry. The growing popularity of these business models, where players can access the game for free, places a crucial emphasis on retaining these users to ensure the financial success and sustainability of the game. In this scenario, predictive analysis emerges as an essential tool to anticipate and understand the patterns of player churn.

This study began with a systematic literature review in the field of predictive models in game analytics, aiming to answer the main research question: How are predictive models applied in game analytics? The research was conducted based on a protocol that defined the objectives, research questions, and inclusion and exclusion criteria.

The main findings indicate that research on predictive models in game analytics has grown significantly since 2010, with a variety of machine learning techniques being applied. Furthermore, the most investigated prediction targets include the probability of winning, churn prediction, and player expertise. Regarding preprocessing techniques, several approaches were identified, such as Principal Component Analysis (PCA) and web scraping techniques.

We focused our research on churn prediction, initially by defining churn and establishing cutoff dates, considering multiple time windows for classifying players as churned or recurrent. The analysis addressed the validity threats of the work, including churn definition issues, class imbalance, and the use of techniques like SMOTE to balance the data.

Six machine learning models were evaluated, with an emphasis on metrics like accuracy, precision, recall, and AUC (Area Under the Curve). The 10-fold cross-validation technique was applied to validate the models, providing a more comprehensive view of their performance. The analysis of feature importance revealed which player characteristics were most relevant for churn prediction, although the interpretation of these features was highlighted as context-dependent.

Ultimately, the work offered promising insights into the prediction of player churn in free-to-play games but emphasized the need for careful approaches and contextual considerations to mitigate validity threats and ensure the generalization of models to different datasets and time periods.

Thèses

EMMANUEL SÁVIO SILVA FREIRE
Organizing the State of Practice on Technical Debt Prevention, Monitoring, and Payment in Software Projects
Leader : MANOEL GOMES DE MENDONCA NETO
MEMBRES DE LA BANQUE :
DANIELA SOARES CRUZES
MANOEL GOMES DE MENDONCA NETO
RENATO LIMA NOVAIS
THIAGO SOUTO MENDES
UIRÁ KULESZA
Data: 30 janv. 2023

Afficher le Résumé
Context: Technical debt (TD) describes the effects of immature artifacts on software development that can bring benefits in the short term but may have to be paid with interest in the long term. TD management balances short-term and long-term goals, supporting development teams to decide on the need and the best time to eliminate the debt. TD management activities include prevention, monitoring, and payment. Through prevention, it is possible to prevent teams from incurring TD, while monitoring helps them follow the evolution of TD items concerning the cost-benefit of eliminating them or not, that is, paying the debt items. Knowing the practices used to prevent, monitor, and pay TD items can help development teams to choose the best practice to be used in their projects. Identifying the practice avoidance reasons (PARs) that lead to nonprevention, non-monitoring, and non-payment of TD can help teams understand which aspects need to be improved to enable TD management. Although the technical literature has investigated the prevention, monitoring, and payment of TD, current results only reflect the viewpoint of a small number of professionals and organizations. To achieve the benefits of TD management, it is necessary to investigate more deeply the practices and PARs associated with these TD activities.

Aims: This Ph.D. dissertation aims to investigate, through the continuous and independent replication of a family of surveys conducted globally, the state of practice on the prevention, monitoring, and payment of TD items in software projects.

Method: Initially, we conducted a literature review on the current state of research on TD and its prevention, monitoring, and payment. Then, we analyzed data collected by six replication teams from the InsighTD project, which is a family of globally distributed surveys on the causes, effects, and management of TD. From the body of knowledge resulting from the analysis of InsighTD data, we defined three artifacts: an updated version of the conceptual model for TD, a set of conceptual maps, and IDEA (Impediments, Decision factors, Enabling practices, and Actions) diagrams. Finally, we assessed these artifacts through case studies in academic and industrial settings.

Results: This Ph.D. dissertation presents the leading practices used to prevent, monitor, and pay off TD items and the PARs that justify the non-application of these practices. Regarding the prevention of TD, well-defined requirements, adopting good programming practices, and better project management are among the five most cited practices related to prevention, while short deadlines, ineffective management, and lack of predictability in the software development are among the five most cited PARs to justify the non-prevention of debt. About TD monitoring, TD item backlog, use of tools, and team meetings are among the five most cited practices related to monitoring, while lack of interest, focus on short-term goals, and lack of time are among the five PARs used to explain the non-monitoring of TD items. Regarding TD payment, code refactoring, investing effort in TD payment activities, and design refactoring are among the top five payment-related practices, while focusing on short-term goals, lack of organizational interest, and lack of time are among the five most cited PARs to explain the non-payment of TD. We update the conceptual model for TD by including the knowledge we learn from the state of practice and organize all practices and PARs along with their types, natures, and categories into maps and IDEA diagrams. From the conceptual model and TD payment map assessment, we found that they are well organized and provide valuable information to define strategies for TD management. The IDEA diagrams assessment provided positive evidence that the diagrams are easy to read and follow and can influence decisions on how to manage TD items.

Conclusion: Using the InsighTD data, this Ph.D. dissertation explores the state of practice on TD prevention, monitoring, and payment, revealing the primary practices used to perform these activities and the PARs that avoid their execution. All body of knowledge was organized into three artifacts that can drive new investigations on TD and support software practitioners in increasing their capabilities and reducing their issues in managing debt items.

Edson Mota da Cruz
DaRkaM: A Fog-Based Data Reduction Framework Applied to the Context of Urban Computing
Leader : MAYCON LEONE MACIEL PEIXOTO
MEMBRES DE LA BANQUE :
ADEMAR TAKEO AKABANE
DIONISIO MACHADO LEITE FILHO
GERALDO PEREIRA ROCHA FILHO
LOURENÇO ALVES PEREIRA JUNIOR
MAYCON LEONE MACIEL PEIXOTO
Data: 1 févr. 2023

Afficher le Résumé
The Intelligent Transport Systems (ITS) has a function to analyze the flow of vehicles on highways in order to identify any traffic anomalies, ensuring greater efficiency during the decision-making process. These systems can be based on an Ad-Hoc vehicular network (VANETs) able to integrate the elements of urban space through a distributed communication system. Similarly, ITS applications require constant monitoring of roads, and such monitoring aims to analyze, among other aspects, the variation of the vehicle density over time. In general, this process occurs by means of the periodic sending of situational data from the mobility environment to the cloud. Consequently, the data sets sent at high frequency to the cloud form a continuous data flow that should be processed in a real-time context. However, this dynamic implies in a progressive increase of the communication cost, in function of the volume of data transferred in the link between the fog and the cloud, increasing the risks of overload beyond increasing the latency during requests for services made available in the cloud. Therefore, this work proposes the development of a multilayer architecture for data reduction based on Fog Computing called DaRkaM, acronym in English for (Data Reduction Framework for Traffic Management). The strategy consists of using a monitoring model able to perform data reduction processes directly at the edge of the vehicular network. At the cloud layer, DaRkaM acts as a central controller, analyzing the geographic positions of vehicles that are received from a continuous data flow. These data are used to monitor and perform the traffic management processes addressed in this proposal. At the edge network, a data reduction module was designed to host different traffic monitoring strategies. This architecture favors the comparative analysis among different approaches, ranging from the use of algorithms based on simple sampling until clustering algorithms, in which the data reduction processes are structured based on the number of clusters. The results showed that the use of cluster-based algorithms, hosted in the data reduction core of the DaRkaM framework, are able to achieve high accuracy in monitoring and detecting traffic congestion, in addition, they are able to reach a significant reduction in communication cost, especially in overloaded scenarios.

DIEGO BRAGA MONTEIRO DE MOURA
Heterogeneous Memory Management for Graph Applications
Leader : VINICIUS TAVARES PETRUCCI
MEMBRES DE LA BANQUE :
PAUL M. CARPENTER
LUIS FILIPE NUNES QUARESMA DE OLIVEIRA
DANIEL MOSSÉ
ESBEL TOMÁS VALERO ORELLANA
GEORGE MARCONI DE ARAUJO LIMA
VINICIUS TAVARES PETRUCCI
Data: 9 mars 2023

Afficher le Résumé
The demand for memory has increased due to various applications such as big data, machine learning, social networks, and streaming analytics. DRAM technology is facing issues with scalability, energy, and costs, despite its low latency advantage. Non-Volatile Memory (NVM) is an alternative to DRAM scalability issues. Although NVM has high density, low cost, and low energy, its high latency prevents it from replacing DRAM. Heterogeneous memories like DRAM+NVM are likely to become common, and efficient data placement is an important research question. This PhD dissertation focuses on managing data placement for graph analytics applications. It studies machine learning models to predict application performance in a context with multiple applications and explores data mapping choices at the object or chunk level for a single application. The dissertation contributes to offline identification of performance-influencing object attributes, implementation of an offline scheduler at the chunk level, and implementation of an online scheduler at the chunk level.

Valéria Argôlo Rosa de Queiroz
Semio-participatory interaction design by elderly people: empathy and engagement in technology production
Leader : ECIVALDO DE SOUZA MATOS
MEMBRES DE LA BANQUE :
SIMONE BACELLAR LEAL FERREIRA
KAMILA RIOS DA HORA RODRIGUES
FAUSTO ORSI MEDOLA
ECIVALDO DE SOUZA MATOS
MARLO VIEIRA DOS SANTOS E SOUZA
Data: 27 mars 2023

Afficher le Résumé
The number of elderly people has increased considerably around the world. Thus, science has developed strategies to help people age healthily and maintain an active life. Digital technologies can contribute to this. However, these technologies are not always designed according to the real needs and expectations of the elderly. Encouraging the elderly to design computational technologies through open and participatory perspectives of interaction design has been shown to be a possible solution. However, engaging older people in this process and making them a design partner is still challenging. Most studies based on participatory practices found do not establish the engagement of potential users, especially in relation to elderly people. Therefore, this research aimed to investigate and develop techniques to engage the elderly in designing digital technologies through a semi-participatory interaction (co)design process, establishing empathy as a fundamental engagement element. To this end, an engagement model was initially developed to integrate the elements and variables of this model in a semi-participatory process. From this perspective, two case studies were carried out, the first in a face-to-face format and the second in a remote format, in a qualitative approach with elderly women. The results analyzed using the content analysis method confirmed the potentiality of the elements and variables to improve the engagement of the elderly person and, thus, provided the conception of an informative educational document in Guide format, highlighting the main techniques, activities and strategies with the purpose of making the conduction of a semi-participatory Interaction co-design process more practical and accessible. It is expected that these results may contribute to the expansion of semi-participatory interaction co-design processes from the perspective of the elderly person, besides offering epistemological elements for developing inclusive interaction design methods and processes.

Djan Almeida Santos
Comprehensibility of Source Code with Feature Dependency in Configurable Systems
Leader : CLAUDIO NOGUEIRA SANT ANNA
MEMBRES DE LA BANQUE :
ROHIT GHEYI
FLÁVIO MOTA MEDEIROS
CLAUDIO NOGUEIRA SANT ANNA
EDUARDO SANTANA DE ALMEIDA
IVAN DO CARMO MACHADO
Data: 28 mars 2023

Afficher le Résumé
Conditional compilation is often used to implement variability in configurable systems. This technique relies on #ifdefs to delimit feature code. Previous studies have shown that #ifdefs may hinder code comprehensibility. However, those studies did not explicitly take feature dependencies into account. Feature dependency occurs when different features refer to the same program element, such as a variable. Comprehensibility may be even more affected in the presence of feature dependency, as the developer must reason about different scenarios affecting the same variable. Our goal is to understand how feature dependency affects the comprehensibility of configurable system source code. We conducted four complementary empirical studies. In Study 1, forty-six developers responded an online survey. They executed tasks in which they had to analyze programs containing #ifdefs with and without feature dependency. However, feature dependencies may be of different types depending on the scope of the shared variable. In Study 1, we were not concerned with different types of dependency. Thus, in Study 2, we carried out an experiment in which 30 developers debugged programs with different types of feature dependency. Each program included a different type of feature dependency: global, intraprocedural or interprocedural. We used an eye-tracking device to record developers' gaze movements while they debugged programs. However, feature dependencies do not differ from each other only in terms of types. They can also present differences in terms of number of dependent variables and degree of variability, among others. To address those characteristics, we complemented Study 1 and 2 by means of Studies 3 and 4. In Study 3, we executed a controlled experiment with 12 participants who analyzed programs with different numbers of dependent variables and number of uses of dependent variables. In Study 4, we carried out an experiment in which 12 developers analyzed programs with different degrees of variability. Our results show that: (i) analyzing programs containing #ifdefs and feature dependency required more time than programs containing #ifdefs but without feature dependency, (ii) debugging programs with #ifdefs and global or interprocedural dependency required more time and higher visual effort than programs with intraprocedural dependency, (iii) the higher the number of dependent variables the more difficult it was to understand programs with feature dependency, and (iv) the degree of variability did not affect the comprehensibility of programs with feature dependency. In summary, our studies showed that #ifdefs affected comprehensibility of configurable systems in different degrees depending on the presence or not of feature dependency, on the type of feature dependency, and on the number of dependent variables.

LEANDRO OLIVEIRA DE SOUZA
AN AUTOMATED SOFTWARE TRANSPLANTATION APPROACH FOR REENGINEERING OF SYSTEMS INTO PRODUCT LINES
Leader : EDUARDO SANTANA DE ALMEIDA
MEMBRES DE LA BANQUE :
CLAUDIO NOGUEIRA SANT ANNA
EDUARDO SANTANA DE ALMEIDA
GUSTAVO HENRIQUE LIMA PINTO
LEOPOLDO MOTTA TEIXEIRA
RODRIGO ROCHA GOMES E SOUZA
Data: 30 juin 2023

Afficher le Résumé
Although Software Product Lines (SPL) offer the potential for order-of-magnitude improvements in software engineering performance, the up-front cost, level of effort, assumed risk, and latency required to make the transition to SPL are prohibitive adoption barriers for many organizations that could otherwise benefit from reusing of their existing systems. The SPL adoption from an extractive model based on a reengineering process of existing systems into SPL is an active research topic with tangible benefits in practice. It allows software development companies to preserve their investment and aggregate knowledge obtained while developing their individually developed systems portfolio.

Despite these benefits, adopting an extractive product line approach still requires a considerable upfront investment and is more complex to evolve than one-off systems. Because of these drawbacks, software companies refrain from adopting SPL, resorting to an ad-hoc practice of clone-and-own. To speed the conversion to and maintenance of SPL, we present FOUNDRY, a Software Transplantation (ST) approach that guides transplanting and merging features in a product line from existing systems. It is the first approach for SPL that automates all stages of product line construction using the ST technique. We realized Foundry in a software transplantation tool for SPL that automates identifying, adapting, and transferring features from existing systems to a standard product base. Its code transfer mechanism between different systems allows it to be used not only for generating product lines but also as an alternative to the clone-and-own technique for system specialization. We compared our proposal with the existing reengineering solutions to demonstrate evidence that the ST is an alternative with the potential for application in reengineering existing systems to SPL. In the search for more concrete evidence, we evaluated two case studies where two products were generated by transplanting features from three real-world systems.

Moreover, we experimented with comparing Foundry’s feature migration with manual effort. We show that Foundry automatically migrated features across codebases 4.8 times faster, on average, than the average time a group of SPL experts took to accomplish the task. Although preliminary, our evaluation provides evidence to support the claim that ST for Software Product Line Engineering (SPLE) is a feasible and promising new research direction.

SILVIO LUIZ BRAGATTO BOSS
Avaliação automática de mapas conceituais para identificar indícios de aprendizagem significativa
Leader : ALINE MARIA SANTOS ANDRADE
MEMBRES DE LA BANQUE :
DAVIDSON CURY
ALINE MARIA SANTOS ANDRADE
CREDINE SILVA DE MENEZES
ECIVALDO DE SOUZA MATOS
LYNN ROSALINA GAMA ALVES
MARLO VIEIRA DOS SANTOS E SOUZA
Data: 1 sept. 2023

Afficher le Résumé
Mapas Conceituais são ferramentas pedagógicas amplamente utilizadas para facilitar o
processo de aprendizagem. De forma geral, podem ser definidos como diagramas com
organização hierárquica que apresentam relações entre conceitos. São um importante
instrumento para o ensino e avaliação do conhecimento por representarem, entre outras
características, a estrutura cognitiva do aprendiz. Os mapas conceituais estão fundamentados
na Teoria da Aprendizagem Significativa de Ausubel, assim, é possível avaliar os mapas com
base em aspectos dessa Teoria, como as categorias-chave da diferenciação progressiva e
reconciliação integrativa. Estas representam informações sobre o processo de construção do
conhecimento sendo, portanto, fundamentais para uma avaliação de aprendizagem
significativa. As categorias-chave podem ser identificadas no processo de elaboração dos
mapas, no entanto, este processo não vem sendo muito explorado na literatura. A maioria dos
trabalhos focam na análise comparativa entre o mapa do aprendiz e um mapa de referência
para verificar se os conceitos e relações estão representados corretamente. A análise
comparativa entre mapas é importante, mas não suficiente para uma avaliação da
aprendizagem significativa, visto que as categorias-chave bem como outras informações da
estrutura cognitiva do aprendiz, ou seja, como ele constrói e reconstrói seu conhecimento no
decurso do seu processo de aprendizagem, são essenciais para identificar indícios de
aprendizagem. Na literatura, há trabalhos que analisam a estrutura do mapa conceitual para
classificar a qualidade da aprendizagem em aprendizagem mecânica, significativa ou não-
aprendizagem, mas não analisa o processo de construção do mapa e nenhuma discussão é
feita sobre a automatização da avaliação. Pesquisas no campo da Informática na Educação têm
investigado o problema de se avaliar a aprendizagem, comparando mapas conceituais de
aprendizes com um mapa de referência, por meio de ferramentas computacionais. Há diversas
soluções que envolvem análise estrutural utilizando padrões de comparação e análise
semântica por meio de ontologias de domínio. Esta tese de doutorado apresenta um modelo
e um método para avaliação de mapas conceituais que considera avaliação sintática e
semântica bem como o processo de construção de mapas, propondo critérios para identificar
indícios de aprendizagem mecânica, significativa e não-aprendizagem. São consideradas
avaliação qualitativa e quantitativa. Um framework conceitual é especificado fornecendo um
arcabouço sobre a organização e estruturação de avaliação de mapas conceituais para apoiar a
construção de sistemas computacionais para a sua automatização. Foram desenvolvidos
estudos de caso, que serviram como uma primeira avaliação do método proposto.

GABRIELA OLIVEIRA MOTA DA SILVA
EXPLOITING LOD-BASED SIMILARITY PERSONALIZATION STRATEGIES FOR RECOMMENDER SYSTEMS
Leader : FREDERICO ARAUJO DURAO
MEMBRES DE LA BANQUE :
DANIELA BARREIRO CLARO
FREDERICO ARAUJO DURAO
LAIS DO NASCIMENTO SALVADOR
NATASHA CORREIA QUEIROZ LINO
ROSALVO FERREIRA DE OLIVEIRA NETO
Data: 28 sept. 2023

Afficher le Résumé
Linked Open Data (LOD) is a cloud of freely accessible and interconnected datasets that encompass machine-readable data. These data are available under open Semantic Web standards, such as Resource Description Framework (RDF), SPARQL Protocol, and RDF Query Language (SPARQL). One notable example of a LOD set is DBpedia, a crowd-sourced community effort to extract structured information from Wikipedia and make this information openly available on the Web. The semantic content of LOD and the advanced features of SPARQL has opened unprecedented opportunities for enabling semantic-aware applications. LOD-based Recommender Systems Recommender Systems usually leverage the data available within LOD datasets such as DBpedia to recommend items such as movies, places, books, and music to end-users. These systems use a semantic similarity algorithm that calculates the degree of matching between pairs of resources by counting the number of direct and indirect links between them, the length of the path between them, or the hierarchy of classes. Conversely, calculating similarity in RDF graphs could be difficult because each resource can have hundreds of links to other nodes. Not all of them are semantically relevant or can be applied to all resources in the graph. This can lead to the well-known matrix sparsity problem. Nevertheless, some effort has been made to select subsets of features, i.e., links, which are more helpful to computing similarity between items of a graph dataset, reducing the matrix dimension. Despite several studies in this field, there is still a lack of solutions applied to the personalization of feature selection tasks. In this context, we propose personalized strategies to improve semantic similarity precision in LOD-based Recommender Systems, including i) applying a feature selection approach to filter the best features for a particular user, ii) personalizing the RDF graph by adding weights to the edges, according to the user’s previous preferences; and iii) exploiting the similarity of literal properties as well as the links from the user model. The evaluation experiments used combined data from DBpedia and MovieLens and DBpedia and LastFM datasets. Results indicate significant increases in top-n recommendation tasks in Precision@K (K=5, 10), Map, and NDCG over non-personalized baseline similarities methods such as Linked Data Semantic Distance (LDSD) and Resource Similarity (ReSim). The results show that the LOD-based strategies of user model personalization.

SAULO ANTONIO DE LIMA MATOS
Invariants and Neighborhood Structures for 1-factorizations of complete graphs
Leader : RAFAEL AUGUSTO DE MELO
MEMBRES DE LA BANQUE :
RAFAEL AUGUSTO DE MELO
TIAGO DE OLIVEIRA JANUARIO
CELSO DA CRUZ CARNEIRO RIBEIRO
MARCIO COSTA SANTOS
VINICIUS FERNANDES DOS SANTOS
SEBASTIÁN ALBERTO URRUTIA
Data: 3 oct. 2023

Afficher le Résumé
A 1-factorization is a partition of the edge set of a graph into perfect matchings. The concept of 1-factorization is of great interest due to its applications in modeling sports tournaments. Two 1-factorizations are said to be isomorphic (belong to the same isomorphism class) if there exists a bijection between their sets of vertices that transforms one into the other. The non-isomorphic 1-factorization search space is a graph in which each isomorphism class is represented by a vertex and each edge that connects the vertices $\mathcal{F}_a$ and $\mathcal{F}_b$ corresponds to a move in a neighborhood structure, which from a 1-factorization isomorphic to $\mathcal{F}_a$ generates a 1-factorization isomorphic to $\mathcal{F}_b$. An invariant of a 1-factorization is a property that depends only on its structure such that isomorphic 1-factorizations are guaranteed to have equal invariant values. An invariant is complete when any two non-isomorphic 1-factorizations have distinct invariant values. This thesis reviews seven invariants used to distinguish non-isomorphic 1-factorizations of $K_{2n}$ (complete graph with an even number of vertices). Additionally, considering that the invariants available in the literature are not complete, we propose two new ones, denoted lantern profiles and even-size bichromatic chains. The invariants are compared regarding their sizes and calculation time complexity. Furthermore, we conduct computational experiments to assess their ability to distinguish non-isomorphic 1-factorizations. To accomplish that we use the sets of non-isomorphic 1-factorizations of $K_{10}$ and $K_{12}$, as well as the sets of non-isomorphic perfect 1-factorizations of $K_{14}$ and $K_{16}$.
We also consider algorithmic and computational aspects for exploring the generalized partial team swap (GPTS) neighborhood, a neighborhood structure for round-robin sports scheduling problems recently proposed in the literature. In this regard, we present a framework to explore the GPTS neighborhood. Furthermore, a discussion is presented on how this neighborhood structure increases the connectivity of the search space defined by non-isomorphic 1-factorizations of $K_{2n}$ (for $8 \le 2n \le 12$) when compared to other neighborhood structures.

RENATA MARIA DE SOUZA SANTOS
A Multi-Case Study of Software Engineering Practices in Early-Stage Startups
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
GLEISON DOS SANTOS SOUZA
KIEV SANTOS DA GAMA
CHRISTINA VON FLACH GARCIA CHAVEZ
IVAN DO CARMO MACHADO
RITA SUZANA PITANGUEIRA MACIEL
Data: 14 nov. 2023

Afficher le Résumé
Early-stage startups, often founded by small teams with innovative yet unproven ideas, face the challenge of validating their concepts in the market. These ideas take shape through a fluid business model, adapting until a repeatable and scalable approach is established. Startups must grapple with the pressure to deliver a minimum viable product or service swiftly. For software startups, this often centers around software or technology-mediated offerings, operating within a competitive landscape alongside companies of various sizes. Recent research has explored software startups' context, objectives, challenges, and practices. While extensive research has delved into how software startups engage in software development, there remains a gap in understanding how they select and implement specific development practices. Uncovering their software development processes and methodologies is equally essential, as it directly impacts their ability to overcome software development challenges. Key questions include: What characterizes software development in startup environments? How do startups prioritize product quality attributes in their development processes? What software engineering practices support their development endeavors, and which tools aid their progress? This thesis seeks to gain insights into how early-stage software startups navigate the software development journey, shedding light on their priorities, processes, and the tools they leverage. Our research adopts a qualitative approach in the form of a multiple case study involving 14 organizations. We analyze the data employing Grounded Theory techniques, including open, axial, and selective coding. The findings underscore the critical relevance of human factors, software development processes, software engineering practices, and external influences in the software development journey. Understanding these factors has led to the formulation of practical recommendations to bolster early-stage software development within startups. This work identifies pivotal factors influencing software development within startups and offers practical recommendations for their early-stage software development endeavors.

RENATA MARIA DE SOUZA SANTOS
A Multi-Case Study of Software Engineering Practices in Early-Stage Startups
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
CHRISTINA VON FLACH GARCIA CHAVEZ
GLEISON DOS SANTOS SOUZA
IVAN DO CARMO MACHADO
KIEV SANTOS DA GAMA
RITA SUZANA PITANGUEIRA MACIEL
Data: 14 nov. 2023

Afficher le Résumé
Early-stage startups, often founded by small teams with innovative yet unproven ideas, face the challenge of validating their concepts in the market. These ideas take shape through a fluid business model, adapting until a repeatable and scalable approach is established. Startups must grapple with the pressure to deliver a minimum viable product or service swiftly. For software startups, this often centers around software or technology-mediated offerings, operating within a competitive landscape alongside companies of various sizes. Recent research has explored software startups' context, objectives, challenges, and practices. While extensive research has delved into how software startups engage in software development, there remains a gap in understanding how they select and implement specific development practices. Uncovering their software development processes and methodologies is equally essential, as it directly impacts their ability to overcome software development challenges. Key questions include: What characterizes software development in startup environments? How do startups prioritize product quality attributes in their development processes? What software engineering practices support their development endeavors, and which tools aid their progress? This thesis seeks to gain insights into how early-stage software startups navigate the software development journey, shedding light on their priorities, processes, and the tools they leverage. Our research adopts a qualitative approach in the form of a multiple case study involving 14 organizations. We analyze the data employing Grounded Theory techniques, including open, axial, and selective coding. The findings underscore the critical relevance of human factors, software development processes, software engineering practices, and external influences in the software development journey. Understanding these factors has led to the formulation of practical recommendations to bolster early-stage software development within startups. This work identifies pivotal factors influencing software development within startups and offers practical recommendations for their early-stage software development endeavors.

	Thèses
1	TATIANA DE OLIVEIRA SILVA Dedicated protection and restoration hybrid strategy driven by low-priority connections in EON networks. Leader : GUSTAVO BITTENCOURT FIGUEIREDO MEMBRES DE LA BANQUE : GUSTAVO BITTENCOURT FIGUEIREDO RODRIGO IZIDORO TININI PEDRO MESQUITA MOURA Data: 15 févr. 2022 Ata de defesa assinada: Afficher le Résumé The spectrum rigidity of wavelength-multiplexed networks (Wavelength Division Multiplexing - WDM) does not allow to accommodate requests proportionally to their bandwidth needs, causing waste in the spectrum. Therefore, Elastic Optical Networks (EON) have proved to be a solution for the future of optical transport networks, bringing flexibility and efficiency in the use of resources. The scalability provided by these networks proves to be an adequate paradigm for the bandwidth requirements for emerging applications on the Internet. Due to the volume of traffic supported by EONs, failure events can cause massive data loss. For this, protection and restoration schemes have been developed in order to minimize such damage. Service Level Agreement (SLA) specifications make it possible for applications to have their resources properly allocated and with their respective service degradation tolerance levels. Higher priority classes of service present greater restrictions to service degradation, while lower priority classes are more flexible to degradation. In this way, the classification of connections and their respective SLA specifications must be taken into account when defining disaster recovery strategies in order to identify the best opportunities that increase the network provisioning or re-provisioning capacity. This work proposes a hybrid survivability scheme (protection and restoration) with bandwidth degradation and restoration delay for low-priority connections transmitted in the backup path of high-priority connections. The proposed algorithm was compared with a path protection strategy with dedicated backup. The results obtained demonstrate that restoration based on service degradation and the use of backup resources reduces the overall network blocking probability and the blocking probability by classes. Furthermore, the proposed strategy increases the ability to restore low-priority connections interrupted by the disaster while maintaining the ability to restore high-priority connections.
2	GABRIEL DAHIA FERNANDES META LEARNING FOR FEW-SHOT ONE-CLASS CLASSIFICATION Leader : MAURICIO PAMPLONA SEGUNDO MEMBRES DE LA BANQUE : FABIO AUGUSTO FARIA MAURICIO PAMPLONA SEGUNDO RUBISLEY DE PAULA LEMES Data: 8 mars 2022 Afficher le Résumé We propose a method that can perform one-class classification given only a small number of examples from the target class and none from the others. We formulate the learning of meaningful features for one-class classification as a meta-learning problem in which the meta-training stage repeatedly simulates one-class classification, using the classification loss of the chosen algorithm to learn a feature representation. To learn these representations, we require only multiclass data from similar tasks. We show how the SVDD method can be used with our method, and also propose a simpler variant based on Prototypical Networks that obtains comparable performance, indicating that learning feature representations directly from data may be more important than which one-class algorithm we choose. We validate our approach by adapting few-shot classification datasets to the few-shot one-class classification scenario, obtaining similar results to the state-of-the-art of traditional one-class classification, and that improves upon that of one-class classification baselines employed in the few-shot setting. Moreover, as a practical application, we employ our method to the biometric task of on-device face verification. In this scenario, it compares unfavorably to the state-of-the-art metric learning technique.
3	DANIEL DAVID FERNANDES An Empirical Investigation of Test Smell in Python Leader : RITA SUZANA PITANGUEIRA MACIEL MEMBRES DE LA BANQUE : LEOPOLDO MOTTA TEIXEIRA RITA SUZANA PITANGUEIRA MACIEL RODRIGO ROCHA GOMES E SOUZA Data: 30 mars 2022 Afficher le Résumé Software testing is an activity in the software development process that looks for defects. Automated testing is composed of code that allows you to run software testing scenarios more quickly, avoiding manual rework. When writing test code, testers may execute bad practices, known as test smells, which are coding patterns that can negatively impact test quality in terms of maintainability, understandability, and defect detection effectiveness. Python became the most widely used programming language in the world in 2020, however, most research on test code quality is conducted for the Java language. The absence of studies regarding Python developers’ perception of test smells, as well as types of test smells not yet detected by tools, can cause damage to software developed in this language. Although the concept of test smells is language independent, studies in one programming language cannot be generalized to the others, because different languages may present different behaviors regarding test smells. This dissertation aims to support Python developers in building and maintaining higher quality test code, providing insight into the occurrences of test smells in Python. To do this, we initially built a dataset with 5303 test files from 90 Python projects collected from GitHub repositories to observe the behavior of test smells in Python. This analysis resulted in the proposal of 4 new test smells with discussions of their potential impacts. In addition, to know the perception of Python developers about test smells, we conducted a survey through questionnaires to understand the perception about software testing of Python developers, which we publish the analysis of the results obtained. As an additional contribution, we developed TEMPY, an open-source 10 test smells detection tool for Python, which achieved 100% accuracy. We evaluated TEMPY through interviews with 10 Python developers, obtaining 99% agreement on the detections pointed out by TEMPY. We hope sharing the findings of this study, Python developers can use the concept of test smell to improve test code quality, thereby supporting software testing activities
4	Alan Teixeira de Oliveira Improving the efficiency of cloud systems through latency-sensitive and best-effort task placement Leader : VINICIUS TAVARES PETRUCCI MEMBRES DE LA BANQUE : GEORGE MARCONI DE ARAUJO LIMA VINICIUS TAVARES PETRUCCI ESBEL TOMÁS VALERO ORELLANA Data: 1 avr. 2022 Afficher le Résumé Cloud computing provides a centralized, service-oriented infrastructure for users around the world. A cloud environment allows running critical applications sensitive to user interaction (such as an online web search), and is also used to run best-effort oriented batch applications (such as audio/video compression, indexing of web pages). Data centers are designed on a large scale to support the various cloud services, distributed across hundreds of thousands of servers, with significant operational and capital expenditures. Thus, optimizing the use of server resources results in significant savings on these systems. As a research opportunity, there is the possibility of exploring idle moments of servers that run critical services, especially during periods of low demand. This research aims to co-locate different types of cloud application tasks (latency-sensitive and best-effort) on the same server in order to increase its utilization, thus improving its efficiency. The work demonstrates that performing co-allocation of latency-sensitive and best-effort tasks provides greater efficiency for cloud servers, as long as the performance of critical application tasks is guaranteed. The work explored the use of specific schedulers for each type of task, latency-sensitive tasks were scheduled and parameterized via real-time scheduling, and batch application tasks used fair best-effort scheduling. The gains in relation to the standard scheduler of the system reached up to 150% when compared to the one proposed in this work, here called EDFCoaloc.
5	PRABHÁT KUMAR DE OLIVEIRA Soft Tissue Deformation Simulation in Orthognathic Surgery Planning Using Position Based Dynamics Leader : ANTONIO LOPES APOLINARIO JUNIOR MEMBRES DE LA BANQUE : ANTONIO LOPES APOLINARIO JUNIOR IEDA MARGARIDA CRUSOE ROCHA REBELLO KARL PHILIPS APAZA AGUERO VINICIUS MOREIRA MELLO Data: 8 avr. 2022 Afficher le Résumé Deformable bodies is a topic that has been studied in the area of computer graphics in recent decades, with several solutions and proposals for simulations of these types of objects. However, there is still a concern with the balance between physical realism and real-time simulation in an interactive way. Some previous works in the medical field use FEM and MSM to perform simulations, but these approaches have limitations. The use of Position-Based Dynamics (PBD), in turn, has been gaining prominence, being able to provide interactions between different types of objects. The purpose of this work focuses on the use of PBD for real-time simulation of soft and rigid tissues in predicting the results of orthognathic surgery.
6	ARISTOTELES ESTEVES MARÇAL DA SILVA Planning in Self-Adaptive Systems: An Approach Based on Model Verification and MAPE-K Leader : ALINE MARIA SANTOS ANDRADE MEMBRES DE LA BANQUE : MARIA VIVIANE MENEZES ADOLFO ALMEIDA DURAN ALINE MARIA SANTOS ANDRADE ALIRIO SANTOS DE SA SANDRO SANTOS ANDRADE Data: 12 avr. 2022 Afficher le Résumé The design and implementation of effective Self-Adaptive Systems (SAS) is a very challenging task, usually due to the existence of complex dynamics between the various elements that make up the system and between these elements and the environment in which they operate. A crucial point, in the SAS project, is the execution of the planning activity (planning): definition of which actions (adaptation plans) must be carried out so that the system adapts and returns to meet the quality levels expected service. Recent research has investigated the use of formal verification techniques to enable the generation of reliable adaptation plans. Through the model checking technique (model checking) and the MAPE-K reference model, we developed the design, implementation and validation of an architecture that defines self-adaptive structures and behaviors using formal models, for the specification planners that generate adaptation plans autonomously, considering self-adaptive properties such as self-organizing and self-healing. The architecture evaluation was carried out through a case study related to the traffic light control problem, with the creation of a planner, in which the adaptation plans were generated, in an off-line way, with the verifier of UPPAAL models and executed in run-time through an implementation of this work. Some application scenarios were simulated, using the SUMO tool, to evaluate the effectiveness of the solutions found. The created planner generated efficient adaptation plans, autonomously, in scenarios where the number of possible adaptation plans was around 2¹². The results indicate that the solutions adopted with our planner were more effective compared to solutions without any self-adaptation approach, when the vehicle flow approached collapse.
7	EBERTY ALVES DA SILVA Pipeline for 3D reconstruction of cultural artifacts using different depth and color cameras Leader : KARL PHILIPS APAZA AGUERO MEMBRES DE LA BANQUE : LUCIANO SILVA ANTONIO LOPES APOLINARIO JUNIOR KARL PHILIPS APAZA AGUERO Data: 12 mai 2022 Afficher le Résumé "In recent years, there have been numerous advances with regard to 3D digitization of cultural heritage, mainly applying a process chain to three-dimensional reconstruction (pipelines) from expensive high-precision 3D scanners. In addition, recent studies seek to reconstruct objects and artifacts using low-cost acquisition devices (for example, the Microsoft Kinect sensor) or using more modern photogrammetry techniques (such as Structure from Motion). Low-cost methods are still limited in the context of 3D reconstruction of cultural artifacts, being a real challenge when it comes to the representativeness and final quality of digitally reconstructed 3D models. Aiming to obtain complete and well-detailed three-dimensional models of cultural artifacts using low-cost technology, this work proposes a 3D reconstruction pipeline that uses point clouds from different sensors, combining captures from a low-cost depth sensor post-processed by imaging techniques. Super-Resolution with high resolution RGB images acquired by an external camera applied in Structure from Motion and Multi-View Stereo algorithms. The main contribution of this work includes the description of a complete pipeline that improves the information acquisition stage and merges data from different sensors. Various phases of the 3D reconstruction pipeline were also specialized to improve the visual quality of the model. The pipeline was evaluated and discussed in terms of the results obtained and demonstrates that it contributes to enabling the use of low-cost 3D reconstruction methodologies in the context of digital cultural heritage. The proposed methodology was developed to be applied to the preservation of pieces from the Museum of Archeology and Ethnology of the Federal University of Bahia (MAE/UFBA), being also tested on table objects that enabled the validation process of the experimental results."
8	MURILO GUERREIRO AROUCA Gamification as a support for participatory mapping and crowdsourcing approaches in the face of Covid-19 Leader : MARCOS ENNES BARRETO MEMBRES DE LA BANQUE : ISA BEATRIZ DA CRUZ NEVES LUSTOSA MARCOS ENNES BARRETO RICARDO LUSTOSA BRITO VANINHA VIEIRA DOS SANTOS Data: 16 mai 2022 Afficher le Résumé Due to the impacts caused by the advent of the new Coronavirus (SARS-CoV-2), the mobilization and popular participation in the face of public health policies are fundamental for facing Covid-19 and mitigating its damages, which is widely affected knowledge, attitudes and practices in relation to the disease. However, in view of the context of people in a state of social vulnerability and with low access to education and information, the absence of participation and engagement in socioeconomically disadvantaged communities presents itself as an obstacle that stands in the way of the objectives proposed by health policies. . Thus, the present work proposes the implementation of the GAFCC theory-oriented gamification design model on the +Lugar platform (Covid-19 version), to support participatory crowdsourcing approaches and participatory mapping in favor of initiatives aimed at coping with Covid-19. -19, mainly the mapping and signaling of disease transmission predictors in real time, through a risk perception approach. In order to assess the acceptance and use of the proposed solution, UTAUT was used to identify the determining factors that have a significant impact on the intention to use the platform. To this end, a validation study was carried out with 20 young people and adolescents living in the Marechal Rondon neighborhood, where a form was applied, consisting of questions related to moderating and determining factors, and behavioral intention. The result of the study pointed out that the respondents have high levels of performance and effort expectation, social influence, facilitating conditions, and behavioral intention to use the platform. However, through the analysis of Spearman's correlation coefficient, it was possible to identify that only the factors “effort expectancy” and “social influence” significantly impact behavioral intention.
9	GESSÉ JUSTINIANO DE OLIVEIRA JÚNIOR Evaluation of Scheduling Algorithms and Energy Savings for FreeRTOS-based Embedded Systems Leader : GEORGE MARCONI DE ARAUJO LIMA MEMBRES DE LA BANQUE : ALLAN EDGARD SILVA FREITAS GEORGE MARCONI DE ARAUJO LIMA VINICIUS TAVARES PETRUCCI Data: 3 juin 2022 Afficher le Résumé Embedded systems, increasingly present in our daily lives, are becoming more complex and powerful due to technological advances in computing and electronics. Real-time embedded systems applications related to the automotive and aeronautical industries are prominent examples of classical engineering disciplines, where the conflicts between costs, short product life cycles and legal requirements regarding reliability, robustness and safety arise. became a problem. Indeed, managing energy effectively and efficiently is crucial for all battery powered devices, for example those applied in autonomous robots, wearable devices, industrial controllers and wireless sensor networks. Determining effective scheduling algorithms in real-time systems where task deadlines are met is a topic widely addressed in the state of the art, always aiming to improve applications in different environments. When the approach applied is via fixed priority scheduling, which does not change as a function of time, the computational cost is lower than dynamic priority scheduling, since it is not necessary to dynamically order the task queue. In embedded systems with small processing power, the use of Earliest-Deadline First (EDF), a dynamic priority scheduling policy, is often less seen, although there are many efforts to use it in this segment, demonstrating that overhead may be feasible. When using the DVFS technique, the frequency and voltage of the CPU supply are manipulated during the execution of the tasks, in order to reduce the power dissipated by the processor. With the use of the DPM technique, the idle components at a given instant of time t, can be turned off or placed in low energy modes in order to reduce the power consumed by the processor when performing some action in the system. The focus of the present work is to adapt FreeRTOS, the real-time operating system, in order to make it capable of dealing with applications that have time constraints and energy consumption. A new energy saving technique based on DPM, with low computational overhead, was created to be used on platforms with few hardware resources.
10	MAYCON EMILY DE FARIAS RIBEIRO DESIGN AND DEVELOPMENT OF A ROBUST COMPUTATIONAL TOOL FOR ELECTROCARDIOGRAPHY Leader : RAIMUNDO JOSE DE ARAUJO MACEDO MEMBRES DE LA BANQUE : LUCIANO REBOUCAS DE OLIVEIRA RAIMUNDO JOSE DE ARAUJO MACEDO ROBERTO FREITAS PARENTE Data: 8 juin 2022 Afficher le Résumé For a long time, electrocardiograms (ECGs) have been used to diagnose heart problems. However, obtaining a satisfactory classification through the technology of digital systems applied to e-health medicine is a difficult task. To study this problem, it was considered that the heart is a reactive system with some possible modes or states of functioning, so it is intuitive to use computational tools such as finite state machines (FEMs) to describe the health status of this human organ. In this paradigm, we propose the Cyber-ECG, a computational model for automatic classification of ECG signals with detection of electrocardiograph or sensor failures. To accomplish this, the Simscape and Stateflow libraries from Simulink were used in the Matlab ecosystem. In addition, a prototype for cardiac pulse detection is presented, using the Medical Internet of Things (IoTM) technology. This prototype consists of an ESP32 board, to collect, aggregate and visualize data on a person's heart pulse in the cloud. The proposed Cyber-ECG tool obtained an accuracy of 84% and 80% when classifying the ARR and NSR classes, respectively. For these same classes, the F1-Score values are approximately 82% and 83%. The precision was 82.5% and the sensitivity was 80%, respectively. Based on these results, the tool
11	MATHEUS THIAGO MARQUES BARBOSA Q-balance: A load balancing scheme based on a Multilayer Perceptron for Fog resources on a Smart Grid Leader : MAYCON LEONE MACIEL PEIXOTO MEMBRES DE LA BANQUE : MAYCON LEONE MACIEL PEIXOTO GUSTAVO BITTENCOURT FIGUEIREDO BRUNO GUAZZELLI BATISTA Data: 14 juin 2022 Afficher le Résumé Data processing in Smart Grids applications can use cloud computing. However, this infrastructure can lead to increased response time in such applications due to the distance between cloud data centers and Smart Meters. In this way, we propose a neural network based approach to manage the computational resources in the fog. In this environment, Q-balance aims to reduce the average response time of applications that use data from Smart Meters through the use of a MultiLayer Perceptron -- (MLP) neural network. MLP predicts the time that a computational resource will process a request from the Smart Meter application. Q-balance uses this information about the forecast to balance the load between available resources, reducing the average response time obtained. The performance evaluation of the experiments showed that the Q-balance reduced the average response time by up to 65% and 79% compared to the algorithms in the literature for fog and cloud respectively.
12	VICTOR MARTINEZ VIDAL PEREIRA Exploiting Linked Data in DBpedia to Reduce Prediction Error in Matrix Factorization Recommenders Leader : FREDERICO ARAUJO DURAO MEMBRES DE LA BANQUE : ADRIANO CÉSAR MACHADO PEREIRA DANILO BARBOSA COIMBRA FREDERICO ARAUJO DURAO RAMON PEREIRA LOPES RODRIGO ROCHA GOMES E SOUZA Data: 21 juin 2022 Afficher le Résumé Recommender Systems provide suggestions for items that are most likely of interest to users. Providing personalized recommendations is a challenge that can be addressed by filtering algorithms among which Collaborative Filtering (CF) has demonstrated much progress in the last few years. By using Matrix Factorization (MF) techniques, CF methods reduce prediction error by using optimization algorithms. However, they usually face problems such as data sparsity and prediction error. Studies point to the use of data available in Semantic Web as a path to improve recommender systems and address the challenges related to CF techniques. Motivated by these premises, the present work developed a data pipeline along with an algorithm that processes the Ratings Matrix combining semantic similarities of Linked Open Data (LOD) and estimates missing rat- ings. The experiments take subsets of three different datasets (Movielens, LastFM and LibraryThing), two semantic similarity metrics, Linked Data Similarity Distance (LDSD) and Resource Similarity (RESIM), and three MF-based algorithms (SVD, SVD++ and NMF). Our experiments reduced sparsity by more than 75% in Movielens subset and 28% in LastFM. Prediction error is reduced in all subsets with statistical confidence using parametric test one-way ANOVA followed by Tukey’s multiple comparison test.
13	MARCOS RICARDO SANTOS OLIVEIRA PSGF (Phase Space Gap Filling): A new method to fill missing values in chaotic time series Leader : RICARDO ARAUJO RIOS MEMBRES DE LA BANQUE : RICARDO ARAUJO RIOS RENATO PORFIRIO ISHII EWALDO EDER CARVALHO SANTANA Data: 7 juil. 2022 Afficher le Résumé The preprocessing step performed to deal with missing or invalid information in datasets is a relevant task in Machine Learning (ML) applications to avoid producing wrong models and make feasible the usage of specific algorithms that do not work in such a condition. In general, missing values occours for different reasons as, for instance, problems in the device used to monitor a system, network issues between monitoring and storage services, and the authentic absence of data. By collecting data in an i.i.d (independent and identically distributed) manner, traditional ML models are able to replace missing values. However, when there are temporal dependencies between collected observations, e.g., time series, such models are unsuitable for not considering the existing relationship in time instants. The treatment of missing data in time series is performed by several techniques such as interpolation methods (e.g. Lagrange, Newton, and Splines) and Singular Spectrum Analysis (SSA). Experiments during this project highlighted that these methods provided poor results when the time series present a chaotic behavior once their attractors in the phase space are not taken into account. Therefore, this work presents a new method that considers Dynamical System and Chaos Theory tools to unfold series from the temporal domain into phase space, making it possible the adoption of ML models to replace missing values. Our results emphasize the importance of this new paradigm to deal with missing values, outperforming the state-of-the-art.
14	VERUSCA CARIN BARROS ROCHA EVOLVING A MAP TO SUPPORT THE MANAGEMENT OF TECHNICAL DEBT RELATED TO SOFTWARE TESTING Leader : MANOEL GOMES DE MENDONCA NETO MEMBRES DE LA BANQUE : JOSÉ CARLOS MALDONADO MANOEL GOMES DE MENDONCA NETO RENATO LIMA NOVAIS Data: 27 juil. 2022 Afficher le Résumé Technical Debt (TD) contextualizes a set of tasks that are not performed and are accumulated to be performed later during software development. In software testing, insufficient test coverage, lack of testing, and lack of test case planning are situations in which test debt can be found. In contrast, other types of debt can affect the quality of testing activities. Outdated requirements documentation is an example of a requirements debt situation that can compromise test planning activities. In this scenario, the term test-related debt (TRTD) refers to the problems faced by test professionals due to the presence of TD items. Recently, a conceptual map has been proposed in the technical literature that relates test activities to TD items. This map presents test-related TD causes, indicators and payment practices. However, this map was based on an ad hoc literature review and experimental evidence on its feasibility for use in industry is limited. Despite several works investigating indicators, causes, effects and prevention of TD, there is still a long way to go on how to use this information to support the management of debt items in the context of testing activities. The objective of this work is (i) to present an updated version of a conceptual framework that organizes a set of indicators, causes, effects and preventive practices of TRTD, based on data from the InsighTD project, which is a global questionnaire, which seeks to generate experimental data on TD, its causes, effects and management from software engineering professionals and (ii) to evaluate it for its ease of use, usefulness and possible future use. This evaluation methodology, based on the technology acceptance model (TAM), ran a feasibility study of the maps with 95 participants. As a result of the research, it was observed that the conceptual framework with TD elements can be useful to support the management of TRTD items, as pointed out by 89% of the participants. Most participants also indicated that they would gain productivity, performance, agility and effectiveness using the proposed framework. The framework was also considered easy to learn and use by the study participants.
15	SAMIA CAPISTRANO GOMES Evaluating Nexus Modifications Popularity and Positive Evaluations: An Empirical Study Leader : EDUARDO SANTANA DE ALMEIDA MEMBRES DE LA BANQUE : RODRIGO PEREIRA DOS SANTOS EDUARDO SANTANA DE ALMEIDA RODRIGO ROCHA GOMES E SOUZA Data: 18 août 2022 Afficher le Résumé In recent years, computer games have grown in the number of users, application area, and development complexity. In this context, modding stands out, which are changes created by game users (Modders), made available for free through modification communities. One of the largest modification-sharing communities is Nexus Mods, with over 300,000 diverse files available to its users. The Mods tend to increase the life cycle and sales and help add new functionality to games, but few studies have analyzed them. So we studied the mods in order to identify patterns of artifacts focusing on their popularity and quality (positive evaluations). This study identified which dimensions could distinguish popular modifications from unpopular ones and the mods with many positive evaluations from those with few positive evaluations. We also identify the features with significant explanatory power under popularity and positive evaluations of the mods. For this, Logistic models were proposed and analyzed using statistical calculations. We analyze 40,865 popular mods and 40,865 unpopular mods, 40,865 mods with many positive evaluations, and 40,865 with few positive evaluations from the Nexus Mods community through 4 dimensions (Mods Categories, Mods Documentation, Mods Community Contribution, and Modders Characteristics ) and 27 characteristics. As a result, we obtained that all the dimensions studied have the power to distinguish popular modifications from unpopular ones and modifications with many positive evaluations from those with few positive evaluations. Of the 27 features, 23 features have significant explanatory power under mod popularity, and 22 under mod positive evaluations. This study has identified a need to integrate the versions of mods and their version evaluations, enable negative evaluations, create a good description, and add tags to the mods, among other implications and perspectives for the Nexus community, Modders, users, and researchers.
16	JADNA ALMEIDA DA CRUZ GRSPOID: A Point of Interest Recommendation System for Groups using Diversification Leader : FREDERICO ARAUJO DURAO MEMBRES DE LA BANQUE : FREDERICO ARAUJO DURAO DANILO BARBOSA COIMBRA ROSALVO FERREIRA DE OLIVEIRA NETO Data: 24 août 2022 Afficher le Résumé In recent years the availability of data on the Web has been driven by the increasing use of social networks and smartphone applications. In addition to textual content, geo-location information is also shared, favoring the emergence of numerous location-based services. The location information of a Point of Interest (POI) can be used to understand the profile of users, their interests, and movements. In this way, it is possible to identify, for example, places of interest for a particular user, and even classify them into categories, such as cafes, universities, bars, malls, etc. This type of data is widely used for Points of Interest Recommender Systems, which aim to assist users in the search for places of interest, whether on a daily basis or during a trip. These systems are traditionally recommended for individuals, however, there are scenarios where individuals gather in groups, thus increasing the complexity of the problem. In addition to the need to find individual preferences, the recommendation must consider the preferences of the group as a whole, which requires the application of a consensus technique. Another obstacle is that non-diversified recommendations tend to always be in the same category, decreasing the group's interest in recommendations from already known Points of Interest. This master's thesis proposes a Points of Interest Recommendation System for Groups using diversity. To evaluate the proposed model, an exhaustive experiment was carried out with 19 groups, some with 3 and others with 5 members. To evaluate the diversified recommendations, precision metrics were used in positions 3, 5, and 10. According to the results, the recommendations for positions 5 and 10 obtained more satisfactory results when diversity was applied. After the experiment with real users, an offline analysis was also performed with variations of the proposed model and aggregation techniques. According to the results obtained, it was possible to verify that the recommendation models with diversity obtained better results than the non-diversified approach in most of the tested configurations.
17	ADEILSON ANTONIO DA SILVA On deceiving malware classification with section injection: attack and defense using deep neural networks. Leader : MAURICIO PAMPLONA SEGUNDO MEMBRES DE LA BANQUE : ANDRÉ BRASIL VIEIRA WYZYKOWSKI KARL PHILIPS APAZA AGUERO MAURICIO PAMPLONA SEGUNDO Data: 22 nov. 2022 Afficher le Résumé We investigate how to modify executable files to deceive malware classification systems. This work's main contribution is a methodology to inject bytes across a malware file randomly and use it both as an attack to decrease classification accuracy but also as a defensive method, augmenting the data available for training. It respects the operating system file format to make sure the malware will still execute after our injection and will not change its behavior. We reproduced five state-of-the-art malware classification approaches to evaluate our injection scheme: one based on GIST+KNN, three CNN variations and one Gated CNN. We performed our experiments on a public dataset with 9,339 malware samples from 25 different families. Our results show that a mere increase of 7% in the malware size causes an accuracy drop between 25% and 40% for malware family classification. They show that a automatic malware classification system may not be as trustworthy as initially reported in the literature. We also evaluate using modified malwares alongside the original ones to increase networks robustness against mentioned attacks. Results show that a combination of reordering malware sections and injecting random data can improve overall performance of the classification. Code available at https://github.com/adeilsonsilva/malware-injection.
18	AMANDA CHAGAS DE OLIVEIRA A Recommendation Model for Groups Using Diversification Techniques Leader : FREDERICO ARAUJO DURAO MEMBRES DE LA BANQUE : FREDERICO ARAUJO DURAO DANILO BARBOSA COIMBRA NATASHA CORREIA QUEIROZ LINO Data: 28 nov. 2022 Afficher le Résumé Recommender Systems traditionally recommend items to individual users. However, there are scenarios where individuals gather in groups, and with that arises the need to recommend to groups. Most of these groups form naturally, for example, to watch a movie, have lunch at a restaurant, or even plan a trip. In all these hypotheses, it is possible to use Recommender Systems to offer personalized information to the group as a whole. To do so, it is necessary to consider the individual preferences of the group members, to satisfy them fully, and, in this sense, to use techniques for aggregating this information. Although there are consensus techniques for aggregating information, the recommendations can be repetitive among themselves, as they will always serve the same group profile. This inconvenience sets a precedent for adopting diversity techniques for recommendations to the group. In this work, we investigate how to apply such diversification techniques in group recommendations, based on the members' preferences, to avoid over-specialization of the system and thus keep the group members satisfied in general. To do so, it is necessary to develop a group formation model and then model how the recommendation will be carried out to be able to apply diversification techniques to the items to be recommended. Two experiments were carried out in this research, considering related work as a baseline. In the offline experiment, the result of the proposed model was slightly higher than the baseline, having been 3.8% more accurate and 1.8% more diverse. In the second experiment, the online experiment, the proposed model completely dominated the baseline in terms of accuracy and was slightly inferior in terms of diversity.
19	Jane Barbosa Santos Leader : LEOBINO NASCIMENTO SAMPAIO MEMBRES DE LA BANQUE : ECIVALDO DE SOUZA MATOS JOBERTO SERGIO BARBOSA MARTINS LEOBINO NASCIMENTO SAMPAIO MARIA CAROLINA DE SOUZA SAMPAIO Data: 7 déc. 2022 Afficher le Résumé
20	LEONAN TEIXEIRA DE OLIVEIRA A latency-sensitive and network cost awareness approach for allocation of component-based applications in a hierarchical fog. Leader : MAYCON LEONE MACIEL PEIXOTO MEMBRES DE LA BANQUE : DANIEL GOUVEIA COSTA LUIZ FERNANDO BITTENCOURT MAYCON LEONE MACIEL PEIXOTO Data: 13 déc. 2022 Afficher le Résumé Over the years, Cloud Computing has provided infrastructure for processing applications and services. However, the distance between Cloud computing resources and Internet of Things (IoT) devices can cause delays in response times for some services and applications that have sensitivity and temporal constraints. In this context, Fog Computing emerges as a computational paradigm that seeks to bring Cloud Computing resources closer to IoT devices, geographically distributing micro-data centers (Cloudlets) and reducing response time. Studies using Fog Computing present solutions for the application allocation, addressing that it is possible to meet different types of requirements. These studies evaluate certain requirements for decision making, such as response time and communication cost. However, it is observed in these studies that as a requirement is prioritized, another is neglected. Thus, this work proposes an approach for allocating modular applications in a Fog Computing architecture with multiple hierarchical levels, which seeks to reduce the response time of latency-sensitive applications and reduce data traffic on the network. To achieve this goal, the proposed approach, Least Impact - X (LI-X), seeks to reduce the idleness of resources at the lowest levels of the fog and takes into account the communication between application modules when deciding on the allocation. LI-X was compared to predecessor studies in a simulated iFogSim environment. The results show that the LI-X was able to overcome these studies in most of the proposed scenarios, reducing the response time and the cost of data communication on the network.
21	Nilvan Santana Souza Intelligent Resource Allocation for Delayed Protection Based on Machine Learning in Elastic Optical Networks Leader : GUSTAVO BITTENCOURT FIGUEIREDO MEMBRES DE LA BANQUE : MARCIA HELENA MOREIRA PAIVA GUSTAVO BITTENCOURT FIGUEIREDO TATIANE NOGUEIRA RIOS Data: 14 déc. 2022 Afficher le Résumé Currently, several Data Centers are interconnected worldwide, providing support to cloud services for processing and storing data generated on the internet. The Elastic Optical Network (EON) presents itself as an infrastructure capable of transmitting the high data rates generated in such Data Center networks, creating the Inter-datacenter Elastic Optical Networks (IDC-EONs). This is because the EON network has high bandwidth. In addition, it can operate at different transmission rates, increasing or decreasing the bandwidth according to demand, which makes the network ideal for supplying the world's data flow. In IDC-EONs, minor interruptions in transmission can result in the loss of a large volume of data. Some data can be critical, so their loss can cause harm to network users, hence the need to use protection strategies in IDC-EONs. From the study and analysis of data, it is possible to detect patterns and behaviors of the IDC-EON network, through the Machine Learning (ML) process, from the learning of a past experience. This work presents 3 resource allocation mechanisms for deferred protection in IDC-EON networks from this context. The mechanisms used in this work were based on data analysis and AM techniques to help decisions related to bandwidth allocation in optical paths in IDC-EON networks.
	Thèses
1	STEFANI SILVA PIRES On learning suitable caching policies in Information-Centric Networks Leader : LEOBINO NASCIMENTO SAMPAIO MEMBRES DE LA BANQUE : JUSSARA MARQUES DE ALMEIDA DANIEL SADOC MENASCHE CHRISTINA VON FLACH GARCIA CHAVEZ LEOBINO NASCIMENTO SAMPAIO TATIANE NOGUEIRA RIOS Data: 11 févr. 2022 Afficher le Résumé In recent years, Information-centric networking (ICN) has gained attention from the research and industry communities as an efficient and reliable content distribution network paradigm, especially to address content-centric and bandwidth-needed applications together with the heterogeneous requirements of emergent networks, such as the Internet of Things, Vehicular Ad-hoc NETwork, and Mobile Edge Computing. In-network caching is an essential part of ICN architecture design, and the performance of the overall network relies on caching policy efficiency. Therefore, a large number of cache replacement strategies have been proposed to suit the needs of different networks. The literature extensively presents studies on the performance of the replacement schemes in different contexts. The evaluations may present different variations of context characteristics leading to different impacts on the performance of the policies or different results of most suitable policies. Conversely, there is a lack of research efforts to understand how the context characteristics influence policy performance. There is also a lack of initiatives to assist the process of choosing a suitable policy given a specific scenario. In this direction, this thesis address those research gaps by (i) pointing out what is context from the perspective of cache replacement policies and the context characteristics that influence cache behavior, and (ii) proposing a caching meta-policy strategy to assist the choosing process of suitable policies according to the current context. For the context delimitation study, we have conducted an extensive survey of the ICN literature to map reported evidence of different aspects of context regarding the cache replacement schemes. Beyond the contribution of understanding what is context for caching policies, the survey provided a helpful classification of policies based on the context dimensions used to determine the relevance of contents. Moreover, as an investigation of holistic aspects to represent context, and motivated by the emergent area of human-centric networking, we have performed an exploratory case study on a human behavior influence over the policies performance. To accomplish such goal, we carry out a simulation-based study that evaluated the performance of cache replacement policies through clusters formed by users according to their music listening habits. The results fostered the evidence that distinct context aspects have an effect on caching policy performances. Following the context studies, we present a meta-policy strategy capable of learning the most appropriate policy for cache online and dynamically adapting to context variations that leads to changes in which policy is best. The meta-policy benefits from the diversity of policies and its context aspects, decouples the eviction strategy from managing the context information used by the policy, and models the choice of suitable policies as online learning with bandit feedback problem. The meta-policy can support the deployment of a diverse set of self-contained caching policies in different networks. It enables cache routers to work as adaptive systems agnostic to the underlying contexts, such as content request patterns or popularity variations. Experimental results in single and network of caches have shown the meta-policy effectiveness and adaptability to different contexts in synthetic and trace-driven simulations.
2	ANDRÉ BRASIL VIEIRA WYZYKOWSKI IDENTITY-AWARE FINGERPRINT SYNTHESIS USING GENERATIVE ADVERSARIAL NETWORKS Leader : MAURICIO PAMPLONA SEGUNDO MEMBRES DE LA BANQUE : DANILO BARBOSA COIMBRA KARL PHILIPS APAZA AGUERO MAURICIO PAMPLONA SEGUNDO RAONI FLORENTINO DA SILVA TEIXEIRA RODRIGO MINETTO Data: 23 févr. 2022 Afficher le Résumé Today’s legal restrictions that protect biometric data privacy are hampering fingerprint recognition research. For instance, all high-resolution fingerprint databases ceased to be publicly available. Besides, there are no hybrid databases that contain high and medium-resolution images on different sensors. Recent works use generative neural networks (GAN) to replicate the distribution of fingerprints in a given training dataset and synthesize new images. However, these methods are troubled with the replication of undesirable visual characteristics and biases (e.g., fingerprint class distribution) from the training dataset and the inability to generate multiple images of the same identity. To address this problem, we present two novel approaches to synthesize fingerprints. The first is a hybrid approach to synthesize realistic, multiresolution and multisensor fingerprints. This approach can generate high and medium-resolution fingerprints on different sensors, maintaining the same identities. The second is a fingerprint generation method divided into two stage: identity generation and acquisition simulation. Unlike previous works, our first stage models fingerprints focusing on identity cues rather than realism, which we achieve by combining class-specific image generation and inpainting-based restoration. In the second stage, we create multiple samples per identity with the combination of sensor modeling (e.g., area, orientation, deformation) and realistic texturing. We favorably compared our synthetic images to the state-of-the-art in terms of biometric recognition performance and seed image quality analysis. Our experiments suggest that the datasets of synthetic images created by our approaches are analogous to an authentic dataset, which can be crucial for researching and developing large-scale biometric systems without incurring major dataset creation costs and privacy-related risks.
3	DIOGO VINÍCIUS DE SOUSA SILVA Using Clustering Techniques and Markov Chains for Long-Tailed Item Recommender Systems Leader : FREDERICO ARAUJO DURAO MEMBRES DE LA BANQUE : FREDERICO ARAUJO DURAO MAYCON LEONE MACIEL PEIXOTO RAFAEL AUGUSTO DE MELO RENATO DE FREITAS BULCÃO NETO PEDRO DE ALCÂNTARA DOS SANTOS NETO Data: 9 juin 2022 Afficher le Résumé Recommendation Systems focus on recommending the relevant items to end-users. Usually, the most important items are the most popular ones. However, the emergence of new forms of content distribution in the most diverse markets implied the emergence of the long tail phenomenon, i.e., the creation of new niche products market. Considering the growth of groups related to niche markets, the study of recommendations in the long tail has become increasingly demanding. Nevertheless, long-tail items naturally tend to generate fewer inputs to information systems such as purchase logs, user feedback, and ratings. As a result, it becomes more difficult to recommend items from the long tail. Motivated by these premisses, the primary purpose of this thesis is to develop and exploit recommendation models capable of leading users to niche items located in the long tail, but at the same time highly relevant ones. For this, two major techniques of clustering and representation of matrices through graphs are explored. The first technique adopts Markov chains to calculate similarities of the nodes of a user-item graph. The second technique applies clustering to the set of items in a dataset. Such a combination aims to give more visibility to the long tail items. Experiments were carried out to evaluate the approach and measure the effectiveness of the recommendations considering the long tail context. Metrics such as recall, diversity, and popularity of the generated recommendations were calculated and compared with techniques whose objectives do not directly cover long-tail recommendations. In addition, a questionnaire applied to experts in the business domain also complemented the evaluation of the proposed approaches. By comparing our proposals against three state-of-the-art baselines, the results show that it is possible to improve the accuracy of the recommendations even by focusing on less popular items, in this case, niche products that form the long tail. The recall in some cases improved by about 27.9%, while the popularity of recommended items has declined. In addition, the recommendations show to contain more diversified items indicating better exploitation of the long tail.
4	JOÃO PAULO PEREIRA DE SÁ CANÁRIO On deep learning features for noisy time series classification Leader : RICARDO ARAUJO RIOS MEMBRES DE LA BANQUE : RICARDO ARAUJO RIOS RUBISLEY DE PAULA LEMES EDUARDO FURTADO DE SIMAS FILHO MARCELO KEESE ALBERTINI GLORIA MILLARAY JULIA CURILEM SALDIAS Data: 4 juil. 2022 Afficher le Résumé The adoption of Deep Neural Network (DNN) methods to solve problems in real-world scenarios has been increasing as the data volume grows. Although such methods present impressive results in supervised learning, it is known that the occurrence of noises modifying the original data behavior can affect the model accuracies and, consequently, the generalization process, which is highly relevant in learning tasks. Several approaches have been proposed to reduce the impact of noise on the final model, varying since the application of preprocessing steps to the design of robust DNN layers. However, we have noticed that such approaches were not systematically assessed to understand how the noise influences have been propagated throughout the DNN architectures. This gap motivated us to design this work, which was focused on modeling noisy data with temporal dependencies, typically referred to as time series or signals. In summary, our main claim was to create a network capable of acting as a noise filter and being easily connected to existing networks. To reach this goal, we have defined a methodology, which was organized into four phases: i) execution of a study about the application of DNNs to model signals collected from a real-world problem; ii) investigation of different preprocessing tools to transform such signals and reduce noise influences; iii) analysis about the impact of increasing/reducing the noise on the final model; and iv) creation of a new DNN that can be embedded into DNN architectures and act as noise filtering layer to keep the overall performances. The first and second phases were achieved in collaboration with researchers from the Universidad de La Frontera, which provided a set of signals directly collected from the Llaima volcano in Chile. The modeling performed on such signals allowed the creation of a new architecture called SeismicNet. By knowing the behavior or such signals, we could create a controlled scenario with different additive noise levels and outputs produced by our original models, thus meeting the third phase. Next, we performed two new studies to understand the impact of noises in our scenario. Firstly, we used statistical tests to confirm the error variation when noise is added to the expected signals. Then, we used XAI (eXplainable Artificial Intelligence) to visually comprehend the noise propagation into the DNN layers. Finally, we were able to finish up the last phase and accomplish our primary goal: the design of a new neural network architecture with embedding noise filtering to suppress the preprocessing phase. Interpreting the obtained results, we understand that this novel approach learned the noisy features better and was capable of delivering stable results apart from the noise level on the signal.
5	MARCO ANTONIO COSTA SIMÕES Learning by Demonstration of Coordinated Plans in Multi-Agent Systems Leader : TATIANE NOGUEIRA RIOS MEMBRES DE LA BANQUE : TATIANE NOGUEIRA RIOS RITA SUZANA PITANGUEIRA MACIEL REINALDO AUGUSTO DA COSTA BIANCHI JOÃO ALBERTO FABRO LUIS PAULO GONÇALVES DOS REIS Data: 5 juil. 2022 Afficher le Résumé One of the great challenges in MAS is the creation of cooperative plans to deal with the different scenarios that present themselves in a dynamic, real-time environment composed of teams of mobile robots. In this scenario, an agent of the MAS controls each robot, which needs to make complex decisions in a short time in a coordinated manner with the other robots on its team. Despite the many solutions developed based on multi-agent planning and reinforcement learning, a human expert in the problem domain usually sees opportunities for better cooperative plans in many scenarios where robots underperform. The research presented in this thesis consists of capturing the human expert's knowledge to demonstrate how robot teams can better cooperate in solving the problem they must solve. The human expert can indicate the situations in which a cooperative plan can better solve a given problem by watching the performance of a team of robots in action. Consequently, a dataset for training the agents that control the robots can gather the various human observations. For the development of this research, this work used the environment 3DSSIM and the collection of human demonstrations was carried out through a set of tools developed from the adaptation of existing solutions in the RoboCup community using a strategy of crowdsourcing. In addition, fuzzy clustering was used to gather expert demonstrations (setplays) with the same semantic meaning, even with small differences. With the data organized, this thesis used a reinforcement learning mechanism to learn a classification policy that allows agents to decide which group of setplays is best suited to each situation that presents itself in the environment. The results show the ability of the robot team to evolve, from the learning of the suggested setplays and its use in an appropriate way to the abilities of each robot.
6	JOSÉ ROGÉRIO POGGIO MOREIRA Enterprise Architecture:AN APPROACH TO TRACEABILITY AND SYNCHRONIZATION OF COMPUTATIONAL MODELS Leader : RITA SUZANA PITANGUEIRA MACIEL MEMBRES DE LA BANQUE : CARLOS EDUARDO SALGADO FLÁVIA MARIA SANTORO ANA PATRICIA FONTES MAGALHÃES MASCARENHAS LAIS DO NASCIMENTO SALVADOR RITA SUZANA PITANGUEIRA MACIEL Data: 26 juil. 2022 Afficher le Résumé An Enterprise Architecture (EA) represents the fundamental organization of business processes and Information Technology (IT) infrastructure, serving to capture the essentiality of the business, IT and its evolution, being relevant to protecting the business and maintaining the flexibility and adaptability of the organization. An enterprise architecture is formed by levels such as: the strategic, tactical and IT operational levels. Each level, in turn, has distinct computational models, such as strategic models (strategic level), business process models (tactical level) and systems requirements models (IT operational level). In the context of enterprise architecture modeling, the state of the art presents some gaps such as: (i) the lack of traceability of the elements present in the computational models that make up the EA levels or the visualization, in a poorly understandable way, of the chain of elements that relate; and (ii) the lack of synchronization between the computational models of the strategic, tactical and operational IT levels. Among the negative impacts related to these problems, one can list, for example, the obsolescence of models and the difficulty of carrying out impact analysis and, consequently, decision making, in scenarios of organizational changes. In this scenario, this thesis address the problems of lack of traceability and synchronization of strategic models, business processes and information systems requirements of an enterprise architecture. Thus, in order to provide a reduction or elimination of the negative impacts caused by these problems, an approach is proposed to promote the traceability and synchronization of the computational models of an AE. The proposed approach consists of: (i) a set of meta-models, representing the strategic, tactical and operational levels of the EA; (ii) a traceability model that supports configuration and change management through the use of COBIT and ITIL best practices; and (iii) a synchronization framework that uses the model-driven development (MDD) techniques. The purpose of the proposal is to keep the computational models traceable and updated, allowing: (i) support for visualization and understanding of how a set of models and their elements are related; (ii) support for impact analysis and decision making in change scenarios; (iii) to necessary support to avoid the obsolescence of the models; and (iv) the creation and maintenance of the strategic alignment of the business with IT. To evaluate the work, a survey was carried out, in the corporate context of the public and private sphere, which obtained 51 valid responses. The research was not limited to professionals in the area of information technology, but also involved professionals from strategic and administrative areas. The survey results showed that the proposed approach is understandable, useful and contributes to keeping the models traceable, up-to-date and consistent, as well as enabling the performance of impact analyzes and the strategic alignment of the business with IT.
7	ANA MARIA PARANHOS DE AMORIM A process model to support the development of crowdsourcing projects to motivate the participation of older adults Leader : VANINHA VIEIRA DOS SANTOS MEMBRES DE LA BANQUE : VANINHA VIEIRA DOS SANTOS RITA SUZANA PITANGUEIRA MACIEL ADRIANA SANTAROSA VIVACQUA LUCIANA CARDOSO DE CASTRO SALGADO LETÍCIA DOS SANTOS MACHADO Data: 13 oct. 2022 Afficher le Résumé Crowdsourcing is a new concept associated with problem-solving where the crowd is its main element. Knowing what drives this audience to engage in crowdsourcing projects is a challenge, as is building projects that attract this audience. Our studies focused on two aspects. First, in the investigation of the planning and development of crowdsourcing projects, and second in the identification of the factors that lead people to collaborate with these projects. Crowdsourcing has a dynamic structure where knowledge, problem- solving, and sometimes decision-making can be attributed to unknown members of the crowd. These unique characteristics of crowdsourcing must be considered in its building process. In this thesis, our goal is to investigate crowdsourcing systems from two perspec- tives. From the developer’s perspective, we investigated the construction of crowdsourc- ing. And from the user’s perspective, we investigated the factors that motivate users to collaborate on these projects. To achieve this goal, we carried out several studies, using an exploratory qualitative research method. The investigation of the construction process and the investigation of the users’ motivation followed the same procedures: (i) studies selected by literature review, and (ii) semi-structured interviews with developers and users. As the older adults’ population is constantly growing and is little studied in tech- nological field, our motivation study focused on this public. We conducted interviews with 46 older adults over 60 years of age. The interviews’ analysis suggest that older adults are motivated by dynamic tasks, which stimulate their curiosity, increase their knowledge and skills, tasks that help them to care for their physical and mental health and, mainly, by tasks related to altruistic causes. With the results of the studies, we proposed a process model for the construction of crowdsourcing and a motivation model aimed at older adults, both to be used as a guide by managers and software developers.
8	Simone da Silva Amorim ARCHITECTURAL HEALTH OF SOFTWARE ECOSYSTEMS: A PRACTICE-BASED EVALUATION FRAMEWORK Leader : CHRISTINA VON FLACH GARCIA CHAVEZ MEMBRES DE LA BANQUE : CHRISTINA VON FLACH GARCIA CHAVEZ MANOEL GOMES DE MENDONCA NETO RITA SUZANA PITANGUEIRA MACIEL IGOR FABIO STEINMACHER ROSANA TERESINHA VACCARE BRAGA Data: 29 nov. 2022 Afficher le Résumé Software ecosystems have emerged as an important technological environment where a common software platform and its applications are developed and maintained by a community. This community is composed of internal and external members working to satisfy their needs. Many ecosystems have evolved for years and achieve success as Apple, Facebook, Android, KDE, Hadoop, and others. They have achieved an effective way of developing products with greater sharing of costs, quality, and innovation when compared to traditional software engineering environments. Software ecosystems have called the attention of the researchers to study how they were formed, developed, and survived for a long time. Many studies have been directed to investigate the concept of "Health'' which represents its ability to evolve and succeed over time. Health indicators were defined to obtain a picture of an ecosystem and to know how they are working. There is also a set of metrics proposed to know the real state of the health of a software ecosystem. Existing studies focus on the definition and measurement of health indicators without addressing other perspectives for studying the health of ecosystems. Regarding the perspective of software architecture, we did not find studies on the intersection between software architecture and ecosystem health. Software architecture plays a crucial role in the operation of ecosystems. Several authors highlight its importance in different aspects such as attributes of quality, variability, and design challenges. However, the influences of software architecture on the health of ecosystems have not been explored by now. Our proposal introduces the concept of Architectural Health of Software Ecosystems which covers parts of ecosystem health influenced by architectural issues. Thus, to assess the architectural health of software ecosystems, in this work, we propose the framework HEVAL/SA, defined considering the architectural practices adopted those influence health indicators in a software ecosystem. The evaluation of these practices will indicate a state for the health of the ecosystem. In addition, we validate HEVAL/SA through case studies in real-world open source software ecosystems. The results of HEVAL/SA can help to identify the strengths and weaknesses of the architectural practices adopted by them. The framework will also help to assess the "weight" of the software architecture on the health of ecosystems.
9	Jaziel Souza Lobo UAware Alert: A Context-Aware Platform for Affordable Early Alerting. Leader : VANINHA VIEIRA DOS SANTOS MEMBRES DE LA BANQUE : VANINHA VIEIRA DOS SANTOS MANOEL GOMES DE MENDONCA NETO MARCOS ROBERTO DA SILVA BORGES CLAUDIA LAGE REBELLO DA MOTTA SORAIA SILVA PRIETCH Data: 6 déc. 2022 Afficher le Résumé Contextualization: Natural disasters lead to damage and loss of life around the world. Early Warning Systems (EWS) are systems that send alerts, in advance, to warn the population about these events. Prediction institutions send information about events that can cause disasters in a standard file format known as CAP. In general, an EWS receives a CAP file, extracts its data, and sends warnings to the population. Most EWS does not consider the specific needs of vulnerable groups, and send the same alert to everyone, whether they are, e.g., visually or hearing impaired people, or are in an area with a higher level of risk for the event. Objective: Given the context, the aim of this study is to purpose an EWS to warn people (vulnerable groups or not) who are in risk areas. Method: This research was carried out through a combination approach of the following methods: Systematic Mapping Study, exploratory studies, and empirical studies. Exploratory studies, through semi-structured interviews, served to achieve more familiarity with the research problem. Based on these studies, a context model, behavioral rules, and an automated process for creating an accessible resources library were proposed, in order to enable the development of an EWS that sends alerts to different groups of users. Finally, the evaluation was carried out from two perspectives: (i) evaluating whether the alerts are delivered to the expected recipients and with the expected personalization; (ii) evaluating the context model and behavioral rules. For evaluation (i) an empirical approach was used through quantitative techniques. For evaluation (ii), a survey was conducted with Civil Defense experts from the states of Brazil. Results: This study presents an architecture and a context model with contextual rules for an EWS that considers people from vulnerable groups (for example deaf or blind people). Finally, it presents an implemented and functional prototype, the UAware Alert, with a management interface for sending alerts and a mobile app for receiving the warning in an accessible way. Conclusion: EWS uses CAP messages and the person’s location to send alerts. But to alert people who have different needs, it is also necessary to consider the profile of the person who will receive the alert and their location. Thus, the context model, behavioral rules, and media generation process of this study enables the development of an EWS, which sends instructions focused on the region where the person is located, either in text or in formats accessible to the needs of each user.
10	DANIEL DOMINGOS ALVES Interaction Design in Distributed Software Development: practices, challenges, recommendations and research gaps Leader : ECIVALDO DE SOUZA MATOS MEMBRES DE LA BANQUE : FREDERICK MARINUS CONSTANT VAN AMSTEL IVALDIR HONORIO DE FARIAS JUNIOR ARTUR HENRIQUE KRONBAUER CHRISTINA VON FLACH GARCIA CHAVEZ ECIVALDO DE SOUZA MATOS Data: 15 déc. 2022 Afficher le Résumé Human-Computer Interaction (HCI) is an area focused on studies on human interaction with/through computational systems, among which stand out tangent studies on interaction design. In HCI, interaction design has been consolidated as a practice with the potential to support interactive systems projects. On the other hand, more and more organizations are developing (conceiving, designing, building, testing) software with geographically distributed teams. However, interaction design in distributed software development (DSD) has been little explored, mapped, or structured in the scientific literature. Although there are reports in the scientific literature about interaction design in DSD and some proposed solutions, it is unclear how interaction design occurs in DSD projects. This research, therefore, was motivated by the advancement of projects in DSD, the increase in the number of projects and users of free/libre open source software (FLOSS), problems related to human-computer interaction, and interaction design challenges in DSD. In this sense, this research investigated how interaction design has been practiced in DSD projects, seeking to understand its characteristics, challenges, and limitations. The methodology of this research was based on Charles S. Peirce’s semiotic methodeutics, bibliographic research method (ad-hoc literature review and systematic mapping study), and mixed methods research (survey and interview study) to investigate the current state of knowledge and practice on interaction design in DSD, in addition to providing a theoretical foundation for the conception of recommendations for the interaction design in DSD. We hope that the results pointed out by this research contribute to the body of knowledge about interaction design at the research frontier between HCI and DSD by (i) providing an overview of research efforts on interaction design in DSD, (ii) providing an overview of the practice of interaction design in DSD projects, (iii) presenting how interaction design occurs in DSD projects, (iv) identify research gaps and discuss future research directions, (v) conceiving a set of recommendations for interaction design in DSD, and (vi) supporting the involvement of end users in the construction of interactive systems artifacts in DSD projects.

Thèses

TATIANA DE OLIVEIRA SILVA
Dedicated protection and restoration hybrid strategy driven by low-priority connections in EON networks.
Leader : GUSTAVO BITTENCOURT FIGUEIREDO
MEMBRES DE LA BANQUE :
GUSTAVO BITTENCOURT FIGUEIREDO
RODRIGO IZIDORO TININI
PEDRO MESQUITA MOURA
Data: 15 févr. 2022
Ata de defesa assinada:

Afficher le Résumé
The spectrum rigidity of wavelength-multiplexed networks (Wavelength Division Multiplexing - WDM) does not allow to accommodate requests proportionally to their bandwidth needs, causing waste in the spectrum. Therefore, Elastic Optical Networks (EON) have proved to be a solution for the future of optical transport networks, bringing flexibility and efficiency in the use of resources. The scalability provided by these networks proves to be an adequate paradigm for the bandwidth requirements for emerging applications on the Internet. Due to the volume of traffic supported by EONs, failure events can cause massive data loss. For this, protection and restoration schemes have been developed in order to minimize such damage. Service Level Agreement (SLA) specifications make it possible for applications to have their resources properly allocated and with their respective service degradation tolerance levels. Higher priority classes of service present greater restrictions to service degradation, while lower priority classes are more flexible to degradation. In this way, the classification of connections and their respective SLA specifications must be taken into account when defining disaster recovery strategies in order to identify the best opportunities that increase the network provisioning or re-provisioning capacity. This work proposes a hybrid survivability scheme (protection and restoration) with bandwidth degradation and restoration delay for low-priority connections transmitted in the backup path of high-priority connections. The proposed algorithm was compared with a path protection strategy with dedicated backup. The results obtained demonstrate that restoration based on service degradation and the use of backup resources reduces the overall network blocking probability and the blocking probability by classes. Furthermore, the proposed strategy increases the ability to restore low-priority connections interrupted by the disaster while maintaining the ability to restore high-priority connections.

GABRIEL DAHIA FERNANDES
META LEARNING FOR FEW-SHOT ONE-CLASS CLASSIFICATION
Leader : MAURICIO PAMPLONA SEGUNDO
MEMBRES DE LA BANQUE :
FABIO AUGUSTO FARIA
MAURICIO PAMPLONA SEGUNDO
RUBISLEY DE PAULA LEMES
Data: 8 mars 2022

Afficher le Résumé
We propose a method that can perform one-class classification given only a small number of examples from the target class and none from the others. We formulate the learning of meaningful features for one-class classification as a meta-learning problem in which the meta-training stage repeatedly simulates one-class classification, using the classification loss of the chosen algorithm to learn a feature representation. To learn these representations, we require only multiclass data from similar tasks. We show how the SVDD method can be used with our method, and also propose a simpler variant based on Prototypical Networks that obtains comparable performance, indicating that learning feature representations directly from data may be more important than which one-class algorithm we choose. We validate our approach by adapting few-shot classification datasets to the few-shot one-class classification scenario, obtaining similar results to the state-of-the-art of traditional one-class classification, and that improves upon that of one-class classification baselines employed in the few-shot setting. Moreover, as a practical application, we employ our method to the biometric task of on-device face verification. In this scenario, it compares unfavorably to the state-of-the-art metric learning technique.

DANIEL DAVID FERNANDES
An Empirical Investigation of Test Smell in Python
Leader : RITA SUZANA PITANGUEIRA MACIEL
MEMBRES DE LA BANQUE :
LEOPOLDO MOTTA TEIXEIRA
RITA SUZANA PITANGUEIRA MACIEL
RODRIGO ROCHA GOMES E SOUZA
Data: 30 mars 2022

Afficher le Résumé
Software testing is an activity in the software development process that looks for defects. Automated testing is composed of code that allows you to run software testing scenarios more quickly, avoiding manual rework. When writing test code, testers may execute bad practices, known as test smells, which are coding patterns that can negatively impact test quality in terms of maintainability, understandability, and defect detection effectiveness. Python became the most widely used programming language in the world in 2020, however, most research on test code quality is conducted for the Java language. The absence of studies regarding Python developers’ perception of test smells, as well as types of test smells not yet detected by tools, can cause damage to software developed in this language. Although the concept of test smells is language independent, studies in one programming language cannot be generalized to the others, because different languages may present different behaviors regarding test smells. This dissertation aims to support Python developers in building and maintaining higher quality test code, providing insight into the occurrences of test smells in Python. To do this, we initially built a dataset with 5303 test files from 90 Python projects collected from GitHub repositories to observe the behavior of test smells in Python. This analysis resulted in the proposal of 4 new test smells with discussions of their potential impacts. In addition, to know the perception of Python developers about test smells, we conducted a survey through questionnaires to understand the perception about software testing of Python developers, which we publish the analysis of the results obtained. As an additional contribution, we developed TEMPY, an open-source 10 test smells detection tool for Python, which achieved 100% accuracy. We evaluated TEMPY through interviews with 10 Python developers, obtaining 99% agreement on the detections pointed out by TEMPY. We hope sharing the findings of this study, Python developers can use the concept of test smell to improve test code quality, thereby supporting software testing activities

Alan Teixeira de Oliveira
Improving the efficiency of cloud systems through latency-sensitive and best-effort task placement
Leader : VINICIUS TAVARES PETRUCCI
MEMBRES DE LA BANQUE :
GEORGE MARCONI DE ARAUJO LIMA
VINICIUS TAVARES PETRUCCI
ESBEL TOMÁS VALERO ORELLANA
Data: 1 avr. 2022

Afficher le Résumé
Cloud computing provides a centralized, service-oriented infrastructure for users around the world. A cloud environment allows running critical applications sensitive to user interaction (such as an online web search), and is also used to run best-effort oriented batch applications (such as audio/video compression, indexing of web pages). Data centers are designed on a large scale to support the various cloud services, distributed across hundreds of thousands of servers, with significant operational and capital expenditures. Thus, optimizing the use of server resources results in significant savings on these systems. As a research opportunity, there is the possibility of exploring idle moments of servers that run critical services, especially during periods of low demand. This research aims to co-locate different types of cloud application tasks (latency-sensitive and best-effort) on the same server in order to increase its utilization, thus improving its efficiency. The work demonstrates that performing co-allocation of latency-sensitive and best-effort tasks provides greater efficiency for cloud servers, as long as the performance of critical application tasks is guaranteed. The work explored the use of specific schedulers for each type of task, latency-sensitive tasks were scheduled and parameterized via real-time scheduling, and batch application tasks used fair best-effort scheduling. The gains in relation to the standard scheduler of the system reached up to 150% when compared to the one proposed in this work, here called EDFCoaloc.

PRABHÁT KUMAR DE OLIVEIRA
Soft Tissue Deformation Simulation in Orthognathic Surgery Planning Using Position Based Dynamics
Leader : ANTONIO LOPES APOLINARIO JUNIOR
MEMBRES DE LA BANQUE :
ANTONIO LOPES APOLINARIO JUNIOR
IEDA MARGARIDA CRUSOE ROCHA REBELLO
KARL PHILIPS APAZA AGUERO
VINICIUS MOREIRA MELLO
Data: 8 avr. 2022

Afficher le Résumé
Deformable bodies is a topic that has been studied in the area of computer graphics in recent decades, with several solutions and proposals for simulations of these types of objects. However, there is still a concern with the balance between physical realism and real-time simulation in an interactive way. Some previous works in the medical field use FEM and MSM to perform simulations, but these approaches have limitations. The use of Position-Based Dynamics (PBD), in turn, has been gaining prominence, being able to provide interactions between different types of objects. The purpose of this work focuses on the use of PBD for real-time simulation of soft and rigid tissues in predicting the results of orthognathic surgery.

ARISTOTELES ESTEVES MARÇAL DA SILVA
Planning in Self-Adaptive Systems: An Approach Based on Model Verification and MAPE-K
Leader : ALINE MARIA SANTOS ANDRADE
MEMBRES DE LA BANQUE :
MARIA VIVIANE MENEZES
ADOLFO ALMEIDA DURAN
ALINE MARIA SANTOS ANDRADE
ALIRIO SANTOS DE SA
SANDRO SANTOS ANDRADE
Data: 12 avr. 2022

Afficher le Résumé
The design and implementation of effective Self-Adaptive Systems (SAS) is a very challenging task, usually due to the existence of complex dynamics between the various elements that make up the system and between these elements and the environment in which they operate. A crucial point, in the SAS project, is the execution of the planning activity (planning): definition of which actions (adaptation plans) must be carried out so that the system adapts and returns to meet the quality levels expected service. Recent research has investigated the use of formal verification techniques to enable the generation of reliable adaptation plans. Through the model checking technique (model checking) and the MAPE-K reference model, we developed the design, implementation and validation of an architecture that defines self-adaptive structures and behaviors using formal models, for the specification planners that generate adaptation plans autonomously, considering self-adaptive properties such as self-organizing and self-healing. The architecture evaluation was carried out through a case study related to the traffic light control problem, with the creation of a planner, in which the adaptation plans were generated, in an off-line way, with the verifier of UPPAAL models and executed in run-time through an implementation of this work. Some application scenarios were simulated, using the SUMO tool, to evaluate the effectiveness of the solutions found. The created planner generated efficient adaptation plans, autonomously, in scenarios where the number of possible adaptation plans was around 2¹². The results indicate that the solutions adopted with our planner were more effective compared to solutions without any self-adaptation approach, when the vehicle flow approached collapse.

EBERTY ALVES DA SILVA
Pipeline for 3D reconstruction of cultural artifacts using different depth and color cameras
Leader : KARL PHILIPS APAZA AGUERO
MEMBRES DE LA BANQUE :
LUCIANO SILVA
ANTONIO LOPES APOLINARIO JUNIOR
KARL PHILIPS APAZA AGUERO
Data: 12 mai 2022

Afficher le Résumé
"In recent years, there have been numerous advances with regard to 3D digitization of cultural heritage, mainly applying a process chain to
three-dimensional reconstruction (pipelines) from expensive high-precision 3D scanners.
In addition, recent studies seek to reconstruct objects and artifacts using low-cost acquisition devices (for example, the Microsoft Kinect sensor) or using more modern photogrammetry techniques (such as Structure from Motion). Low-cost methods are still limited in the context of 3D reconstruction of cultural artifacts, being a real challenge when it comes to the representativeness and final quality of digitally reconstructed 3D models.
Aiming to obtain complete and well-detailed three-dimensional models of cultural artifacts using low-cost technology, this work proposes a 3D reconstruction pipeline that uses point clouds from different sensors, combining captures from a low-cost depth sensor post-processed by imaging techniques. Super-Resolution with high resolution RGB images acquired by an external camera applied in Structure from Motion and Multi-View Stereo algorithms.
The main contribution of this work includes the description of a complete pipeline that improves the information acquisition stage and merges data from different sensors. Various phases of the 3D reconstruction pipeline were also specialized to improve the visual quality of the model.
The pipeline was evaluated and discussed in terms of the results obtained and demonstrates that it contributes to enabling the use of low-cost 3D reconstruction methodologies in the context of digital cultural heritage. The proposed methodology was developed to be applied to the preservation of pieces from the Museum of Archeology and Ethnology of the Federal University of Bahia (MAE/UFBA), being also tested on table objects that enabled the validation process of the experimental results."

MURILO GUERREIRO AROUCA

Gamification as a support for participatory mapping and crowdsourcing approaches in the face of Covid-19

Leader : MARCOS ENNES BARRETO
MEMBRES DE LA BANQUE :
ISA BEATRIZ DA CRUZ NEVES LUSTOSA
MARCOS ENNES BARRETO
RICARDO LUSTOSA BRITO
VANINHA VIEIRA DOS SANTOS
Data: 16 mai 2022

Afficher le Résumé
Due to the impacts caused by the advent of the new Coronavirus (SARS-CoV-2), the mobilization and popular participation in the face of public health policies are fundamental for facing Covid-19 and mitigating its damages, which is widely affected knowledge, attitudes and practices in relation to the disease. However, in view of the context of people in a state of social vulnerability and with low access to education and information, the absence of participation and engagement in socioeconomically disadvantaged communities presents itself as an obstacle that stands in the way of the objectives proposed by health policies. . Thus, the present work proposes the implementation of the GAFCC theory-oriented gamification design model on the +Lugar platform (Covid-19 version), to support participatory crowdsourcing approaches and participatory mapping in favor of initiatives aimed at coping with Covid-19. -19, mainly the mapping and signaling of disease transmission predictors in real time, through a risk perception approach. In order to assess the acceptance and use of the proposed solution, UTAUT was used to identify the determining factors that have a significant impact on the intention to use the platform. To this end, a validation study was carried out with 20 young people and adolescents living in the Marechal Rondon neighborhood, where a form was applied, consisting of questions related to moderating and determining factors, and behavioral intention. The result of the study pointed out that the respondents have high levels of performance and effort expectation, social influence, facilitating conditions, and behavioral intention to use the platform. However, through the analysis of Spearman's correlation coefficient, it was possible to identify that only the factors “effort expectancy” and “social influence” significantly impact behavioral intention.

GESSÉ JUSTINIANO DE OLIVEIRA JÚNIOR
Evaluation of Scheduling Algorithms and Energy Savings for FreeRTOS-based Embedded Systems
Leader : GEORGE MARCONI DE ARAUJO LIMA
MEMBRES DE LA BANQUE :
ALLAN EDGARD SILVA FREITAS
GEORGE MARCONI DE ARAUJO LIMA
VINICIUS TAVARES PETRUCCI
Data: 3 juin 2022

Afficher le Résumé
Embedded systems, increasingly present in our daily lives, are becoming more complex and powerful due to technological advances in computing and electronics. Real-time embedded systems applications related to the automotive and aeronautical industries are prominent examples of classical engineering disciplines, where the conflicts between costs, short product life cycles and legal requirements regarding reliability, robustness and safety arise. became a problem. Indeed, managing energy effectively and efficiently is crucial for all battery powered devices, for example those applied in autonomous robots, wearable devices, industrial controllers and wireless sensor networks. Determining effective scheduling algorithms in real-time systems where task deadlines are met is a topic widely addressed in the state of the art, always aiming to improve applications in different environments.
When the approach applied is via fixed priority scheduling, which does not change as a function of time, the computational cost is lower than dynamic priority scheduling, since it is not necessary to dynamically order the task queue. In embedded systems with small processing power, the use of Earliest-Deadline First (EDF), a dynamic priority scheduling policy, is often less seen, although there are many efforts to use it in this segment, demonstrating that overhead may be feasible. When using the DVFS technique, the frequency and voltage of the CPU supply are manipulated during the execution of the tasks, in order to reduce the power dissipated by the processor. With the use of the DPM technique, the idle components at a given instant of time t, can be turned off or placed in low energy modes in order to reduce the power consumed by the processor when performing some action in the system. The focus of the present work is to adapt FreeRTOS, the real-time operating system, in order to make it capable of dealing with applications that have time constraints and energy consumption. A new energy saving technique based on DPM, with low computational overhead, was created to be used on platforms with few hardware resources.

MAYCON EMILY DE FARIAS RIBEIRO

DESIGN AND DEVELOPMENT OF A ROBUST COMPUTATIONAL TOOL FOR ELECTROCARDIOGRAPHY

Leader : RAIMUNDO JOSE DE ARAUJO MACEDO
MEMBRES DE LA BANQUE :
LUCIANO REBOUCAS DE OLIVEIRA
RAIMUNDO JOSE DE ARAUJO MACEDO
ROBERTO FREITAS PARENTE
Data: 8 juin 2022

Afficher le Résumé
For a long time, electrocardiograms (ECGs) have been used to diagnose heart problems. However, obtaining a satisfactory classification through the technology of digital systems applied to e-health medicine is a difficult task. To study this problem, it was considered that the heart is a reactive system with some possible modes or states of functioning, so it is intuitive to use computational tools such as finite state machines (FEMs) to describe the health status of this human organ. In this paradigm, we propose the Cyber-ECG, a computational model for automatic classification of ECG signals with detection of electrocardiograph or sensor failures. To accomplish this, the Simscape and Stateflow libraries from Simulink were used in the Matlab ecosystem. In addition, a prototype for cardiac pulse detection is presented, using the Medical Internet of Things (IoTM) technology. This prototype consists of an ESP32 board, to collect, aggregate and visualize data on a person's heart pulse in the cloud. The proposed Cyber-ECG tool obtained an accuracy of 84% and 80% when classifying the ARR and NSR classes, respectively. For these same classes, the F1-Score values are approximately 82% and 83%. The precision was 82.5% and the sensitivity was 80%, respectively. Based on these results, the tool

MATHEUS THIAGO MARQUES BARBOSA
Q-balance: A load balancing scheme based on a Multilayer Perceptron for Fog resources on a Smart Grid
Leader : MAYCON LEONE MACIEL PEIXOTO
MEMBRES DE LA BANQUE :
MAYCON LEONE MACIEL PEIXOTO
GUSTAVO BITTENCOURT FIGUEIREDO
BRUNO GUAZZELLI BATISTA
Data: 14 juin 2022

Afficher le Résumé
Data processing in Smart Grids applications can use cloud computing. However, this infrastructure can lead to increased response time in such applications due to the distance between cloud data centers and Smart Meters. In this way, we propose a neural network based approach to manage the computational resources in the fog. In this environment, Q-balance aims to reduce the average response time of applications that use data from Smart Meters through the use of a MultiLayer Perceptron -- (MLP) neural network. MLP predicts the time that a computational resource will process a request from the Smart Meter application. Q-balance uses this information about the forecast to balance the load between available resources, reducing the average response time obtained. The performance evaluation of the experiments showed that the Q-balance reduced the average response time by up to 65% and 79% compared to the algorithms in the literature for fog and cloud respectively.

VICTOR MARTINEZ VIDAL PEREIRA
Exploiting Linked Data in DBpedia to Reduce Prediction Error in Matrix Factorization Recommenders
Leader : FREDERICO ARAUJO DURAO
MEMBRES DE LA BANQUE :
ADRIANO CÉSAR MACHADO PEREIRA
DANILO BARBOSA COIMBRA
FREDERICO ARAUJO DURAO
RAMON PEREIRA LOPES
RODRIGO ROCHA GOMES E SOUZA
Data: 21 juin 2022

Afficher le Résumé
Recommender Systems provide suggestions for items that are most likely of interest to users. Providing personalized recommendations is a challenge that can be addressed by filtering algorithms among which Collaborative Filtering (CF) has demonstrated much progress in the last few years. By using Matrix Factorization (MF) techniques, CF methods reduce prediction error by using optimization algorithms. However, they usually face problems such as data sparsity and prediction error. Studies point to the use of data available in Semantic Web as a path to improve recommender systems and address the challenges related to CF techniques. Motivated by these premises, the present work developed a data pipeline along with an algorithm that processes the Ratings Matrix combining semantic similarities of Linked Open Data (LOD) and estimates missing rat- ings. The experiments take subsets of three different datasets (Movielens, LastFM and LibraryThing), two semantic similarity metrics, Linked Data Similarity Distance (LDSD) and Resource Similarity (RESIM), and three MF-based algorithms (SVD, SVD++ and NMF). Our experiments reduced sparsity by more than 75% in Movielens subset and 28% in LastFM. Prediction error is reduced in all subsets with statistical confidence using parametric test one-way ANOVA followed by Tukey’s multiple comparison test.

MARCOS RICARDO SANTOS OLIVEIRA
PSGF (Phase Space Gap Filling): A new method to fill missing values in chaotic time series
Leader : RICARDO ARAUJO RIOS
MEMBRES DE LA BANQUE :
RICARDO ARAUJO RIOS
RENATO PORFIRIO ISHII
EWALDO EDER CARVALHO SANTANA
Data: 7 juil. 2022

Afficher le Résumé
The preprocessing step performed to deal with missing or invalid information in datasets is a relevant task in Machine Learning (ML) applications to avoid producing wrong models and make feasible the usage of specific algorithms that do not work in such a condition. In general, missing values occours for different reasons as, for instance, problems in the device used to monitor a system, network issues between monitoring and storage services, and the authentic absence of data. By collecting data in an i.i.d (independent and identically distributed) manner, traditional ML models are able to replace missing values. However, when there are temporal dependencies between collected observations, e.g., time series, such models are unsuitable for not considering the existing relationship in time instants. The treatment of missing data in time series is performed by several techniques such as interpolation methods (e.g. Lagrange, Newton, and Splines) and Singular Spectrum Analysis (SSA). Experiments during this project highlighted that these methods provided poor results when the time series present a chaotic behavior once their attractors in the phase space are not taken into account. Therefore, this work presents a new method that considers Dynamical System and Chaos Theory tools to unfold series from the temporal domain into phase space, making it possible the adoption of ML models to replace missing values. Our results emphasize the importance of this new paradigm to deal with missing values, outperforming the state-of-the-art.

VERUSCA CARIN BARROS ROCHA
EVOLVING A MAP TO SUPPORT THE MANAGEMENT OF TECHNICAL DEBT RELATED TO SOFTWARE TESTING
Leader : MANOEL GOMES DE MENDONCA NETO
MEMBRES DE LA BANQUE :
JOSÉ CARLOS MALDONADO
MANOEL GOMES DE MENDONCA NETO
RENATO LIMA NOVAIS
Data: 27 juil. 2022

Afficher le Résumé
Technical Debt (TD) contextualizes a set of tasks that are not performed and are accumulated to be performed later during software development. In software testing, insufficient test coverage, lack of testing, and lack of test case planning are situations in which test debt can be found. In contrast, other types of debt can affect the quality of testing activities. Outdated requirements documentation is an example of a requirements debt situation that can compromise test planning activities. In this scenario, the term test-related debt (TRTD) refers to the problems faced by test professionals due to the presence of TD items. Recently, a conceptual map has been proposed in the technical literature that relates test activities to TD items. This map presents test-related TD causes, indicators and payment practices. However, this map was based on an ad hoc literature review and experimental evidence on its feasibility for use in industry is limited. Despite several works investigating indicators, causes, effects and prevention of TD, there is still a long way to go on how to use this information to support the management of debt items in the context of testing activities. The objective of this work is (i) to present an updated version of a conceptual framework that organizes a set of indicators, causes, effects and preventive practices of TRTD, based on data from the InsighTD project, which is a global questionnaire, which seeks to generate experimental data on TD, its causes, effects and management from software engineering professionals and (ii) to evaluate it for its ease of use, usefulness and possible future use. This evaluation methodology, based on the technology acceptance model (TAM), ran a feasibility study of the maps with 95 participants. As a result of the research, it was observed that the conceptual framework with TD elements can be useful to support the management of TRTD items, as pointed out by 89% of the participants. Most participants also indicated that they would gain productivity, performance, agility and effectiveness using the proposed framework. The framework was also considered easy to learn and use by the study participants.

SAMIA CAPISTRANO GOMES
Evaluating Nexus Modifications Popularity and Positive Evaluations: An Empirical Study
Leader : EDUARDO SANTANA DE ALMEIDA
MEMBRES DE LA BANQUE :
RODRIGO PEREIRA DOS SANTOS
EDUARDO SANTANA DE ALMEIDA
RODRIGO ROCHA GOMES E SOUZA
Data: 18 août 2022

Afficher le Résumé
In recent years, computer games have grown in the number of users, application area, and development complexity. In this context, modding stands out, which are changes created by game users (Modders), made available for free through modification communities. One of the largest modification-sharing communities is Nexus Mods, with over 300,000 diverse files available to its users.
The Mods tend to increase the life cycle and sales and help add new functionality to games, but few studies have analyzed them. So we studied the mods in order to identify patterns of artifacts focusing on their popularity and quality (positive evaluations).
This study identified which dimensions could distinguish popular modifications from unpopular ones and the mods with many positive evaluations from those with few positive evaluations. We also identify the features with significant explanatory power under popularity and positive evaluations of the mods. For this, Logistic models were proposed and analyzed using statistical calculations. We analyze 40,865 popular mods and 40,865 unpopular mods, 40,865 mods with many positive evaluations, and 40,865 with few positive evaluations from the Nexus Mods community through 4 dimensions (Mods Categories, Mods Documentation, Mods Community Contribution, and Modders Characteristics ) and 27 characteristics.
As a result, we obtained that all the dimensions studied have the power to distinguish popular modifications from unpopular ones and modifications with many positive evaluations from those with few positive evaluations. Of the 27 features, 23 features have significant explanatory power under mod popularity, and 22 under mod positive evaluations. This study has identified a need to integrate the versions of mods and their version evaluations, enable negative evaluations, create a good description, and add tags to the mods, among other implications and perspectives for the Nexus community, Modders, users, and researchers.

JADNA ALMEIDA DA CRUZ
GRSPOID: A Point of Interest Recommendation System for Groups using Diversification
Leader : FREDERICO ARAUJO DURAO
MEMBRES DE LA BANQUE :
FREDERICO ARAUJO DURAO
DANILO BARBOSA COIMBRA
ROSALVO FERREIRA DE OLIVEIRA NETO
Data: 24 août 2022

Afficher le Résumé
In recent years the availability of data on the Web has been driven by the increasing use of social networks and smartphone applications. In addition to textual content, geo-location information is also shared, favoring the emergence of numerous location-based services. The location information of a Point of Interest (POI) can be used to understand the profile of users, their interests, and movements. In this way, it is possible to identify, for example, places of interest for a particular user, and even classify them into categories, such as cafes, universities, bars, malls, etc. This type of data is widely used for Points of Interest Recommender Systems, which aim to assist users in the search for places of interest, whether on a daily basis or during a trip. These systems are traditionally recommended for individuals, however, there are scenarios where individuals gather in groups, thus increasing the complexity of the problem. In addition to the need to find individual preferences, the recommendation must consider the preferences of the group as a whole, which requires the application of a consensus technique. Another obstacle is that non-diversified recommendations tend to always be in the same category, decreasing the group's interest in recommendations from already known Points of Interest. This master's thesis proposes a Points of Interest Recommendation System for Groups using diversity. To evaluate the proposed model, an exhaustive experiment was carried out with 19 groups, some with 3 and others with 5 members. To evaluate the diversified recommendations, precision metrics were used in positions 3, 5, and 10. According to the results, the recommendations for positions 5 and 10 obtained more satisfactory results when diversity was applied. After the experiment with real users, an offline analysis was also performed with variations of the proposed model and aggregation techniques. According to the results obtained, it was possible to verify that the recommendation models with diversity obtained better results than the non-diversified approach in most of the tested configurations.

ADEILSON ANTONIO DA SILVA
On deceiving malware classification with section injection: attack and defense using deep neural networks.
Leader : MAURICIO PAMPLONA SEGUNDO
MEMBRES DE LA BANQUE :
ANDRÉ BRASIL VIEIRA WYZYKOWSKI
KARL PHILIPS APAZA AGUERO
MAURICIO PAMPLONA SEGUNDO
Data: 22 nov. 2022

Afficher le Résumé
We investigate how to modify executable files to deceive malware classification systems. This work's main contribution is a methodology to inject bytes across a malware file randomly and use it both as an attack to decrease classification accuracy but also as a defensive method, augmenting the data available for training. It respects the operating system file format to make sure the malware will still execute after our injection and will not change its behavior. We reproduced five state-of-the-art malware classification approaches to evaluate our injection scheme: one based on GIST+KNN, three CNN variations and one Gated CNN. We performed our experiments on a public dataset with 9,339 malware samples from 25 different families. Our results show that a mere increase of 7% in the malware size causes an accuracy drop between 25% and 40% for malware family classification. They show that a automatic malware classification system may not be as trustworthy as initially reported in the literature. We also evaluate using modified malwares alongside the original ones to increase networks robustness against mentioned attacks. Results show that a combination of reordering malware sections and injecting random data can improve overall performance of the classification. Code available at https://github.com/adeilsonsilva/malware-injection.

AMANDA CHAGAS DE OLIVEIRA
A Recommendation Model for Groups Using Diversification Techniques
Leader : FREDERICO ARAUJO DURAO
MEMBRES DE LA BANQUE :
FREDERICO ARAUJO DURAO
DANILO BARBOSA COIMBRA
NATASHA CORREIA QUEIROZ LINO
Data: 28 nov. 2022

Afficher le Résumé
Recommender Systems traditionally recommend items to individual users. However, there are scenarios where individuals gather in groups, and with that arises the need to recommend to groups. Most of these groups form naturally, for example, to watch a movie, have lunch at a restaurant, or even plan a trip. In all these hypotheses, it is possible to use Recommender Systems to offer personalized information to the group as a whole. To do so, it is necessary to consider the individual preferences of the group members, to satisfy them fully, and, in this sense, to use techniques for aggregating this information. Although there are consensus techniques for aggregating information, the recommendations can be repetitive among themselves, as they will always serve the same group profile. This inconvenience sets a precedent for adopting diversity techniques for recommendations to the group. In this work, we investigate how to apply such diversification techniques in group recommendations, based on the members' preferences, to avoid over-specialization of the system and thus keep the group members satisfied in general. To do so, it is necessary to develop a group formation model and then model how the recommendation will be carried out to be able to apply diversification techniques to the items to be recommended. Two experiments were carried out in this research, considering related work as a baseline. In the offline experiment, the result of the proposed model was slightly higher than the baseline, having been 3.8% more accurate and 1.8% more diverse. In the second experiment, the online experiment, the proposed model completely dominated the baseline in terms of accuracy and was slightly inferior in terms of diversity.

Jane Barbosa Santos
Leader : LEOBINO NASCIMENTO SAMPAIO
MEMBRES DE LA BANQUE :
ECIVALDO DE SOUZA MATOS
JOBERTO SERGIO BARBOSA MARTINS
LEOBINO NASCIMENTO SAMPAIO
MARIA CAROLINA DE SOUZA SAMPAIO
Data: 7 déc. 2022

Afficher le Résumé

LEONAN TEIXEIRA DE OLIVEIRA
A latency-sensitive and network cost awareness approach for allocation of component-based applications in a hierarchical fog.
Leader : MAYCON LEONE MACIEL PEIXOTO
MEMBRES DE LA BANQUE :
DANIEL GOUVEIA COSTA
LUIZ FERNANDO BITTENCOURT
MAYCON LEONE MACIEL PEIXOTO
Data: 13 déc. 2022

Afficher le Résumé
Over the years, Cloud Computing has provided infrastructure for processing applications and services. However, the distance between Cloud computing resources and Internet of Things (IoT) devices can cause delays in response times for some services and applications that have sensitivity and temporal constraints. In this context, Fog Computing emerges as a computational paradigm that seeks to bring Cloud Computing resources closer to IoT devices, geographically distributing micro-data centers (Cloudlets) and reducing response time. Studies using Fog Computing present solutions for the application allocation, addressing that it is possible to meet different types of requirements. These studies evaluate certain requirements for decision making, such as response time and communication cost. However, it is observed in these studies that as a requirement is prioritized, another is neglected. Thus, this work proposes an approach for allocating modular applications in a Fog Computing architecture with multiple hierarchical levels, which seeks to reduce the response time of latency-sensitive applications and reduce data traffic on the network. To achieve this goal, the proposed approach, Least Impact - X (LI-X), seeks to reduce the idleness of resources at the lowest levels of the fog and takes into account the communication between application modules when deciding on the allocation. LI-X was compared to predecessor studies in a simulated iFogSim environment. The results show that the LI-X was able to overcome these studies in most of the proposed scenarios, reducing the response time and the cost of data communication on the network.

Nilvan Santana Souza
Intelligent Resource Allocation for Delayed Protection Based on Machine Learning in Elastic Optical Networks
Leader : GUSTAVO BITTENCOURT FIGUEIREDO
MEMBRES DE LA BANQUE :
MARCIA HELENA MOREIRA PAIVA
GUSTAVO BITTENCOURT FIGUEIREDO
TATIANE NOGUEIRA RIOS
Data: 14 déc. 2022

Afficher le Résumé
Currently, several Data Centers are interconnected worldwide, providing support to cloud services for processing and storing data generated on the internet. The Elastic Optical Network (EON) presents itself as an infrastructure capable of transmitting the high data rates generated in such Data Center networks, creating the Inter-datacenter Elastic Optical Networks (IDC-EONs). This is because the EON network has high bandwidth. In addition, it can operate at different transmission rates, increasing or decreasing the bandwidth according to demand, which makes the network ideal for supplying the world's data flow. In IDC-EONs, minor interruptions in transmission can result in the loss of a large volume of data. Some data can be critical, so their loss can cause harm to network users, hence the need to use protection strategies in IDC-EONs. From the study and analysis of data, it is possible to detect patterns and behaviors of the IDC-EON network, through the Machine Learning (ML) process, from the learning of a past experience. This work presents 3 resource allocation mechanisms for deferred protection in IDC-EON networks from this context. The mechanisms used in this work were based on data analysis and AM techniques to help decisions related to bandwidth allocation in optical paths in IDC-EON networks.

Thèses

STEFANI SILVA PIRES
On learning suitable caching policies in Information-Centric Networks
Leader : LEOBINO NASCIMENTO SAMPAIO
MEMBRES DE LA BANQUE :
JUSSARA MARQUES DE ALMEIDA
DANIEL SADOC MENASCHE
CHRISTINA VON FLACH GARCIA CHAVEZ
LEOBINO NASCIMENTO SAMPAIO
TATIANE NOGUEIRA RIOS
Data: 11 févr. 2022

Afficher le Résumé
In recent years, Information-centric networking (ICN) has gained attention from the research and industry communities as an efficient and reliable content distribution network paradigm, especially to address content-centric and bandwidth-needed applications together with the heterogeneous requirements of emergent networks, such as the Internet of Things, Vehicular Ad-hoc NETwork, and Mobile Edge Computing. In-network caching is an essential part of ICN architecture design, and the performance of the overall network relies on caching policy efficiency. Therefore, a large number of cache replacement strategies have been proposed to suit the needs of different networks. The literature extensively presents studies on the performance of the replacement schemes in different contexts. The evaluations may present different variations of context characteristics leading to different impacts on the performance of the policies or different results of most suitable policies. Conversely, there is a lack of research efforts to understand how the context characteristics influence policy performance. There is also a lack of initiatives to assist the process of choosing a suitable policy given a specific scenario. In this direction, this thesis address those research gaps by (i) pointing out what is context from the perspective of cache replacement policies and the context characteristics that influence cache behavior, and (ii) proposing a caching meta-policy strategy to assist the choosing process of suitable policies according to the current context. For the context delimitation study, we have conducted an extensive survey of the ICN literature to map reported evidence of different aspects of context regarding the cache replacement schemes. Beyond the contribution of understanding what is context for caching policies, the survey provided a helpful classification of policies based on the context dimensions used to determine the relevance of contents. Moreover, as an investigation of holistic aspects to represent context, and motivated by the emergent area of human-centric networking, we have performed an exploratory case study on a human behavior influence over the policies performance. To accomplish such goal, we carry out a simulation-based study that evaluated the performance of cache replacement policies through clusters formed by users according to their music listening habits. The results fostered the evidence that distinct context aspects have an effect on caching policy performances. Following the context studies, we present a meta-policy strategy capable of learning the most appropriate policy for cache online and dynamically adapting to context variations that leads to changes in which policy is best. The meta-policy benefits from the diversity of policies and its context aspects, decouples the eviction strategy from managing the context information used by the policy, and models the choice of suitable policies as online learning with bandit feedback problem. The meta-policy can support the deployment of a diverse set of self-contained caching policies in different networks. It enables cache routers to work as adaptive systems agnostic to the underlying contexts, such as content request patterns or popularity variations. Experimental results in single and network of caches have shown the meta-policy effectiveness and adaptability to different contexts in synthetic and trace-driven simulations.

ANDRÉ BRASIL VIEIRA WYZYKOWSKI
IDENTITY-AWARE FINGERPRINT SYNTHESIS USING GENERATIVE ADVERSARIAL NETWORKS
Leader : MAURICIO PAMPLONA SEGUNDO
MEMBRES DE LA BANQUE :
DANILO BARBOSA COIMBRA
KARL PHILIPS APAZA AGUERO
MAURICIO PAMPLONA SEGUNDO
RAONI FLORENTINO DA SILVA TEIXEIRA
RODRIGO MINETTO
Data: 23 févr. 2022

Afficher le Résumé
Today’s legal restrictions that protect biometric data privacy are hampering fingerprint recognition research. For instance, all high-resolution fingerprint databases ceased to be publicly available. Besides, there are no hybrid databases that contain high and medium-resolution images on different sensors. Recent works use generative neural networks (GAN) to replicate the distribution of fingerprints in a given training dataset and synthesize new images. However, these methods are troubled with the replication of undesirable visual characteristics and biases (e.g., fingerprint class distribution) from the training dataset and the inability to generate multiple images of the same identity. To address this problem, we present two novel approaches to synthesize fingerprints. The first is a hybrid approach to synthesize realistic, multiresolution and multisensor fingerprints. This approach can generate high and medium-resolution fingerprints on different sensors, maintaining the same identities. The second is a fingerprint generation method divided into two stage: identity generation and acquisition simulation. Unlike previous works, our first stage models fingerprints focusing on identity cues rather than realism, which we achieve by combining class-specific image generation and inpainting-based restoration. In the second stage, we create multiple samples per identity with the combination of sensor modeling (e.g., area, orientation, deformation) and realistic texturing. We favorably compared our synthetic images to the state-of-the-art in terms of biometric recognition performance and seed image quality analysis. Our experiments suggest that the datasets of synthetic images created by our approaches are analogous to an authentic dataset, which can be crucial for researching and developing large-scale biometric systems without incurring major dataset creation costs and privacy-related risks.

DIOGO VINÍCIUS DE SOUSA SILVA

Using Clustering Techniques and Markov Chains for Long-Tailed Item Recommender Systems

Leader : FREDERICO ARAUJO DURAO
MEMBRES DE LA BANQUE :
FREDERICO ARAUJO DURAO
MAYCON LEONE MACIEL PEIXOTO
RAFAEL AUGUSTO DE MELO
RENATO DE FREITAS BULCÃO NETO
PEDRO DE ALCÂNTARA DOS SANTOS NETO
Data: 9 juin 2022

Afficher le Résumé
Recommendation Systems focus on recommending the relevant items to end-users. Usually, the most important items are the most popular ones. However, the emergence of new forms of content distribution in the most diverse markets implied the emergence of the long tail phenomenon, i.e., the creation of new niche products market. Considering the growth of groups related to niche markets, the study of recommendations in the long tail has become increasingly demanding. Nevertheless, long-tail items naturally tend to generate fewer inputs to information systems such as purchase logs, user feedback, and ratings. As a result, it becomes more difficult to recommend items from the long tail. Motivated by these premisses, the primary purpose of this thesis is to develop and exploit recommendation models capable of leading users to niche items located in the long tail, but at the same time highly relevant ones. For this, two major techniques of clustering and representation of matrices through graphs are explored. The first technique adopts Markov chains to calculate similarities of the nodes of a user-item graph. The second technique applies clustering to the set of items in a dataset. Such a combination aims to give more visibility to the long tail items. Experiments were carried out to evaluate the approach and measure the effectiveness of the recommendations considering the long tail context. Metrics such as recall, diversity, and popularity of the generated recommendations were calculated and compared with techniques whose objectives do not directly cover long-tail recommendations. In addition, a questionnaire applied to experts in the business domain also complemented the evaluation of the proposed approaches. By comparing our proposals against three state-of-the-art baselines, the results show that it is possible to improve the accuracy of the recommendations even by focusing on less popular items, in this case, niche products that form the long tail. The recall in some cases improved by about 27.9%, while the popularity of recommended items has declined. In addition, the recommendations show to contain more diversified items indicating better exploitation of the long tail.

JOÃO PAULO PEREIRA DE SÁ CANÁRIO
On deep learning features for noisy time series classification
Leader : RICARDO ARAUJO RIOS
MEMBRES DE LA BANQUE :
RICARDO ARAUJO RIOS
RUBISLEY DE PAULA LEMES
EDUARDO FURTADO DE SIMAS FILHO
MARCELO KEESE ALBERTINI
GLORIA MILLARAY JULIA CURILEM SALDIAS
Data: 4 juil. 2022

Afficher le Résumé
The adoption of Deep Neural Network (DNN) methods to solve problems in real-world scenarios has been increasing as the data volume grows. Although such methods present impressive results in supervised learning, it is known that the occurrence of noises modifying the original data behavior can affect the model accuracies and, consequently, the generalization process, which is highly relevant in learning tasks. Several approaches have been proposed to reduce the impact of noise on the final model, varying since the application of preprocessing steps to the design of robust DNN layers. However, we have noticed that such approaches were not systematically assessed to understand how the noise influences have been propagated throughout the DNN architectures. This gap motivated us to design this work, which was focused on modeling noisy data with temporal dependencies, typically referred to as time series or signals. In summary, our main claim was to create a network capable of acting as a noise filter and being easily connected to existing networks. To reach this goal, we have defined a methodology, which was organized into four phases: i) execution of a study about the application of DNNs to model signals collected from a real-world problem; ii) investigation of different preprocessing tools to transform such signals and reduce noise influences; iii) analysis about the impact of increasing/reducing the noise on the final model; and iv) creation of a new DNN that can be embedded into DNN architectures and act as noise filtering layer to keep the overall performances. The first and second phases were achieved in collaboration with researchers from the Universidad de La Frontera, which provided a set of signals directly collected from the Llaima volcano in Chile. The modeling performed on such signals allowed the creation of a new architecture called SeismicNet. By knowing the behavior or such signals, we could create a controlled scenario with different additive noise levels and outputs produced by our original models, thus meeting the third phase. Next, we performed two new studies to understand the impact of noises in our scenario. Firstly, we used statistical tests to confirm the error variation when noise is added to the expected signals. Then, we used XAI (eXplainable Artificial Intelligence) to visually comprehend the noise propagation into the DNN layers. Finally, we were able to finish up the last phase and accomplish our primary goal: the design of a new neural network architecture with embedding noise filtering to suppress the preprocessing phase. Interpreting the obtained results, we understand that this novel approach learned the noisy features better and was capable of delivering stable results apart from the noise level on the signal.

MARCO ANTONIO COSTA SIMÕES

Learning by Demonstration of Coordinated Plans in Multi-Agent Systems

Leader : TATIANE NOGUEIRA RIOS
MEMBRES DE LA BANQUE :
TATIANE NOGUEIRA RIOS
RITA SUZANA PITANGUEIRA MACIEL
REINALDO AUGUSTO DA COSTA BIANCHI
JOÃO ALBERTO FABRO
LUIS PAULO GONÇALVES DOS REIS
Data: 5 juil. 2022

Afficher le Résumé
One of the great challenges in MAS is the creation of cooperative plans to deal with the different scenarios that present themselves in a dynamic, real-time environment composed of teams of mobile robots. In this scenario, an agent of the MAS controls each robot, which needs to make complex decisions in a short time in a coordinated manner with the other robots on its team. Despite the many solutions developed based on multi-agent planning and reinforcement learning, a human expert in the problem domain usually sees opportunities for better cooperative plans in many scenarios where robots underperform. The research presented in this thesis consists of capturing the human expert's knowledge to demonstrate how robot teams can better cooperate in solving the problem they must solve. The human expert can indicate the situations in which a cooperative plan can better solve a given problem by watching the performance of a team of robots in action.
Consequently, a dataset for training the agents that control the robots can gather the various human observations. For the development of this research, this work used the environment 3DSSIM and the collection of human demonstrations was carried out through a set of tools developed from the adaptation of existing solutions in the RoboCup community using a strategy of crowdsourcing. In addition, fuzzy clustering was used to gather expert demonstrations (setplays) with the same semantic meaning, even with small differences. With the data organized, this thesis used a reinforcement learning mechanism to learn a classification policy that allows agents to decide which group of setplays is best suited to each situation that presents itself in the environment. The results show the ability of the robot team to evolve, from the learning of the suggested setplays and its use in an appropriate way to the abilities of each robot.

JOSÉ ROGÉRIO POGGIO MOREIRA
Enterprise Architecture:AN APPROACH TO TRACEABILITY AND SYNCHRONIZATION OF COMPUTATIONAL MODELS
Leader : RITA SUZANA PITANGUEIRA MACIEL
MEMBRES DE LA BANQUE :
CARLOS EDUARDO SALGADO
FLÁVIA MARIA SANTORO
ANA PATRICIA FONTES MAGALHÃES MASCARENHAS
LAIS DO NASCIMENTO SALVADOR
RITA SUZANA PITANGUEIRA MACIEL
Data: 26 juil. 2022

Afficher le Résumé
An Enterprise Architecture (EA) represents the fundamental organization of business processes and Information Technology (IT) infrastructure, serving to capture the essentiality of the business, IT and its evolution, being relevant to protecting the business and maintaining the flexibility and adaptability of the organization.
An enterprise architecture is formed by levels such as: the strategic, tactical and IT operational levels. Each level, in turn, has distinct computational models, such as strategic models (strategic level), business process models (tactical level) and systems requirements models (IT operational level).
In the context of enterprise architecture modeling, the state of the art presents some gaps such as: (i) the lack of traceability of the elements present in the computational models that make up the EA levels or the visualization, in a poorly understandable way, of the chain of elements that relate; and (ii) the lack of synchronization between the computational models of the strategic, tactical and operational IT levels.
Among the negative impacts related to these problems, one can list, for example, the obsolescence of models and the difficulty of carrying out impact analysis and, consequently, decision making, in scenarios of organizational changes.
In this scenario, this thesis address the problems of lack of traceability and synchronization of strategic models, business processes and information systems requirements of an enterprise architecture.
Thus, in order to provide a reduction or elimination of the negative impacts caused by these problems, an approach is proposed to promote the traceability and synchronization of the computational models of an AE. The proposed approach consists of: (i) a set of meta-models, representing the strategic, tactical and operational levels of the EA; (ii) a traceability model that supports configuration and change management through the use of COBIT and ITIL best practices; and (iii) a synchronization framework that uses the model-driven development (MDD) techniques.
The purpose of the proposal is to keep the computational models traceable and updated, allowing: (i) support for visualization and understanding of how a set of models and their elements are related; (ii) support for impact analysis and decision making in change scenarios; (iii) to necessary support to avoid the obsolescence of the models; and (iv) the creation and maintenance of the strategic alignment of the business with IT.
To evaluate the work, a survey was carried out, in the corporate context of the public and private sphere, which obtained 51 valid responses. The research was not limited to professionals in the area of information technology, but also involved professionals from strategic and administrative areas. The survey results showed that the proposed approach is understandable, useful and contributes to keeping the models traceable, up-to-date and consistent, as well as enabling the performance of impact analyzes and the strategic alignment of the business with IT.

ANA MARIA PARANHOS DE AMORIM
A process model to support the development of crowdsourcing projects to motivate the participation of older adults
Leader : VANINHA VIEIRA DOS SANTOS
MEMBRES DE LA BANQUE :
VANINHA VIEIRA DOS SANTOS
RITA SUZANA PITANGUEIRA MACIEL
ADRIANA SANTAROSA VIVACQUA
LUCIANA CARDOSO DE CASTRO SALGADO
LETÍCIA DOS SANTOS MACHADO
Data: 13 oct. 2022

Afficher le Résumé
Crowdsourcing is a new concept associated with problem-solving where the crowd is its main element. Knowing what drives this audience to engage in crowdsourcing projects is a challenge, as is building projects that attract this audience. Our studies focused on two aspects. First, in the investigation of the planning and development of crowdsourcing projects, and second in the identification of the factors that lead people to collaborate with these projects. Crowdsourcing has a dynamic structure where knowledge, problem- solving, and sometimes decision-making can be attributed to unknown members of the crowd. These unique characteristics of crowdsourcing must be considered in its building process. In this thesis, our goal is to investigate crowdsourcing systems from two perspec- tives. From the developer’s perspective, we investigated the construction of crowdsourc- ing. And from the user’s perspective, we investigated the factors that motivate users to collaborate on these projects. To achieve this goal, we carried out several studies, using an exploratory qualitative research method. The investigation of the construction process and the investigation of the users’ motivation followed the same procedures: (i) studies selected by literature review, and (ii) semi-structured interviews with developers and users. As the older adults’ population is constantly growing and is little studied in tech- nological field, our motivation study focused on this public. We conducted interviews with 46 older adults over 60 years of age. The interviews’ analysis suggest that older adults are motivated by dynamic tasks, which stimulate their curiosity, increase their knowledge and skills, tasks that help them to care for their physical and mental health and, mainly, by tasks related to altruistic causes. With the results of the studies, we proposed a process model for the construction of crowdsourcing and a motivation model aimed at older adults, both to be used as a guide by managers and software developers.

Simone da Silva Amorim
ARCHITECTURAL HEALTH OF SOFTWARE ECOSYSTEMS: A PRACTICE-BASED EVALUATION FRAMEWORK
Leader : CHRISTINA VON FLACH GARCIA CHAVEZ
MEMBRES DE LA BANQUE :
CHRISTINA VON FLACH GARCIA CHAVEZ
MANOEL GOMES DE MENDONCA NETO
RITA SUZANA PITANGUEIRA MACIEL
IGOR FABIO STEINMACHER
ROSANA TERESINHA VACCARE BRAGA
Data: 29 nov. 2022

Afficher le Résumé
Software ecosystems have emerged as an important technological environment where a common software platform and its applications are developed and maintained by a community. This community is composed of internal and external members working to satisfy their needs. Many ecosystems have evolved for years and achieve success as Apple, Facebook, Android, KDE, Hadoop, and others. They have achieved an effective way of developing products with greater sharing of costs, quality, and innovation when compared to traditional software engineering environments.

Software ecosystems have called the attention of the researchers to study how they were formed, developed, and survived for a long time. Many studies have been directed to investigate the concept of "Health'' which represents its ability to evolve and succeed over time. Health indicators were defined to obtain a picture of an ecosystem and to know how they are working. There is also a set of metrics proposed to know the real state of the health of a software ecosystem. Existing studies focus on the definition and measurement of health indicators without addressing other perspectives for studying the health of ecosystems.

Regarding the perspective of software architecture, we did not find studies on the intersection between software architecture and ecosystem health. Software architecture plays a crucial role in the operation of ecosystems. Several authors highlight its importance in different aspects such as attributes of quality, variability, and design challenges. However, the influences of software architecture on the health of ecosystems have not been explored by now. Our proposal introduces the concept of Architectural Health of Software Ecosystems which covers parts of ecosystem health influenced by architectural issues.

Thus, to assess the architectural health of software ecosystems, in this work, we propose the framework HEVAL/SA, defined considering the architectural practices adopted those influence health indicators in a software ecosystem. The evaluation of these practices will indicate a state for the health of the ecosystem. In addition, we validate HEVAL/SA through case studies in real-world open source software ecosystems. The results of HEVAL/SA can help to identify the strengths and weaknesses of the architectural practices adopted by them. The framework will also help to assess the "weight" of the software architecture on the health of ecosystems.

Jaziel Souza Lobo
UAware Alert: A Context-Aware Platform for Affordable Early Alerting.
Leader : VANINHA VIEIRA DOS SANTOS
MEMBRES DE LA BANQUE :
VANINHA VIEIRA DOS SANTOS
MANOEL GOMES DE MENDONCA NETO
MARCOS ROBERTO DA SILVA BORGES
CLAUDIA LAGE REBELLO DA MOTTA
SORAIA SILVA PRIETCH
Data: 6 déc. 2022

Afficher le Résumé
Contextualization: Natural disasters lead to damage and loss of life around the world. Early Warning Systems (EWS) are systems that send alerts, in advance, to warn the population about these events. Prediction institutions send information about events that can cause disasters in a standard file format known as CAP. In general, an EWS receives a CAP file, extracts its data, and sends warnings to the population. Most EWS does not consider the specific needs of vulnerable groups, and send the same alert to everyone, whether they are, e.g., visually or hearing impaired people, or are in an area with a higher level of risk for the event. Objective: Given the context, the aim of this study is to purpose an EWS to warn people (vulnerable groups or not) who are in risk areas. Method: This research was carried out through a combination approach of the following methods: Systematic Mapping Study, exploratory studies, and empirical studies. Exploratory studies, through semi-structured interviews, served to achieve more familiarity with the research problem. Based on these studies, a context model, behavioral rules, and an automated process for creating an accessible resources library were proposed, in order to enable the development of an EWS that sends alerts to different groups of users. Finally, the evaluation was carried out from two perspectives: (i) evaluating whether the alerts are delivered to the expected recipients and with the expected personalization; (ii) evaluating the context model and behavioral rules. For evaluation (i) an empirical approach was used through quantitative techniques. For evaluation (ii), a survey was conducted with Civil Defense experts from the states of Brazil. Results: This study presents an architecture and a context model with contextual rules for an EWS that considers people from vulnerable groups (for example deaf or blind people). Finally, it presents an implemented and functional prototype, the UAware Alert, with a management interface for sending alerts and a mobile app for receiving the warning in an accessible way. Conclusion: EWS uses CAP messages and the person’s location to send alerts. But to alert people who have different needs, it is also necessary to consider the profile of the person who will receive the alert and their location. Thus, the context model, behavioral rules, and media generation process of this study enables the development of an EWS, which sends instructions focused on the region where the person is located, either in text or in formats accessible to the needs of each user.

DANIEL DOMINGOS ALVES
Interaction Design in Distributed Software Development: practices, challenges, recommendations and research gaps
Leader : ECIVALDO DE SOUZA MATOS
MEMBRES DE LA BANQUE :
FREDERICK MARINUS CONSTANT VAN AMSTEL
IVALDIR HONORIO DE FARIAS JUNIOR
ARTUR HENRIQUE KRONBAUER
CHRISTINA VON FLACH GARCIA CHAVEZ
ECIVALDO DE SOUZA MATOS
Data: 15 déc. 2022

Afficher le Résumé
Human-Computer Interaction (HCI) is an area focused on studies on human interaction with/through computational systems, among which stand out tangent studies on interaction design. In HCI, interaction design has been consolidated as a practice with the potential to support interactive systems projects. On the other hand, more and more organizations are developing (conceiving, designing, building, testing) software with geographically distributed teams. However, interaction design in distributed software development (DSD) has been little explored, mapped, or structured in the scientific literature. Although there are reports in the scientific literature about interaction design in DSD and some proposed solutions, it is unclear how interaction design occurs in DSD projects. This research, therefore, was motivated by the advancement of projects in DSD, the increase in the number of projects and users of free/libre open source software (FLOSS), problems related to human-computer interaction, and interaction design challenges in DSD. In this sense, this research investigated how interaction design has been practiced in DSD projects, seeking to understand its characteristics, challenges, and limitations. The methodology of this research was based on Charles S. Peirce’s semiotic methodeutics, bibliographic research method (ad-hoc literature review and systematic mapping study), and mixed methods research (survey and interview study) to investigate the current state of knowledge and practice on interaction design in DSD, in addition to providing a theoretical foundation for the conception of recommendations for the interaction design in DSD. We hope that the results pointed out by this research contribute to the body of knowledge about interaction design at the research frontier between HCI and DSD by (i) providing an overview of research efforts on interaction design in DSD, (ii) providing an overview of the practice of interaction design in DSD projects, (iii) presenting how interaction design occurs in DSD projects, (iv) identify research gaps and discuss future research directions, (v) conceiving a set of recommendations for interaction design in DSD, and (vi) supporting the involvement of end users in the construction of interactive systems artifacts in DSD projects.

	Thèses
1	MOARA SOUSA BRITO LESSA On the Selection of Open Source Software Projects for Software Engineering Education Leader : CHRISTINA VON FLACH GARCIA CHAVEZ MEMBRES DE LA BANQUE : CHRISTINA VON FLACH GARCIA CHAVEZ IGOR FABIO STEINMACHER RODRIGO ROCHA GOMES E SOUZA Data: 5 févr. 2021 Afficher le Résumé Context: Teaching Software Engineering (SE) in higher education is a challenging task. A solid theoretical base should be provided to students, including basic SE concepts (requirements, design, tests, among others), and topics related to project management, human-computer interaction, configuration and release management. Practical experience, communication skills and professionalism are dimensions that should also be considered, including teamwork and ethics, to prepare students for a career in the software industry. Free/open source software (FLOSS) has been used in Software Engineering Education (SEE) as a viable strategy to address the need for training students' technical and non-technical skills and prepare them to deal with the challenges posed by the evolving software industry. Some studies report that the pedagogical use of FLOSS projects has brought benefits but also challenges that may hinder their effective adoption in SEE. A common challenge is project selection, due to the diversity of available FLOSS projects and the complexity of the task. In general, the teacher must search for, manually or with the support of different tools (not necessarily designed for use in Education), one or more projects for pedagogical use in the course. Several criteria can be considered for the selection of a FLOSS project for pedagogical use in SEE, including project size, number of tests, programming language, community size and their best practices. Problem: The selection of FLOSS projects for pedagogical use in SEE requires effort and experience, and there are few tools that support such complex task while providing clear and well-defined selection criteria to be used by the teacher. Thus, the burden of selecting FLOSS projects can hinder their use and adoption in the context of SEE. Objective: This research aimed to investigate the usefulness of an automated approach to support the teacher in the selection of open source projects guided by criteria that take into account socio-technical aspects of the FLOSS project, for pedagogical use in SEE higher education. Reasearch Methods: A literature review was carried out to identify criteria already in use for the selection of FLOSS projects. Then, a set of selection criteria was identified, documented, operationalized and implemented in an open source tool called FlossSearch.Edu. The first version of the tool was evaluated by students and professors from different Brazilian higher education institutions. Initially, students used FlossSearch.Edu, in the context of a SE discipline, to select FLOSS projects based on criteria defined by the teacher, and then evaluated the use of the tool through a survey with questions based on the Technology Acceptance Model (TAM). In a qualitative and quantitative study, teachers already familiar with the pedagogical use of FLOSS projects, used FlossSearch.Edu in an individual session, recorded and guided by predefined scenarios, while reporting their impressions out loud (``Think Aloud''protocol). At the end of the sessions, teachers evaluated the use of the tool using a TAM questionnaire. Findings: In the context of selecting FLOSS projects in which FlossSearch.Edu was used, the tool was well evaluated. The majority of students who used FlossSearch.Edu in the classroom and teachers who participated in the second assessment said that the tool is useful, easy to use and that they intend to use it in the future. Several suggestions for improvement were received and should guide the evolution of the tool and further studies.
2	VITOR MORAES ARANHA SCREEN-SPACE VIRTUAL POINT LIGHT PROPAGATION FOR REAL-TIME GLOBAL ILLUMINATION Leader : ANTONIO LOPES APOLINARIO JUNIOR MEMBRES DE LA BANQUE : ANTONIO LOPES APOLINARIO JUNIOR KARL PHILIPS APAZA AGUERO RICARDO GUERRA MARROQUIM Data: 1 mars 2021 Afficher le Résumé Proper light-transport simulation adds both realism and aesthetically pleasing effects to virtual 3D scenes. However, the cost of computing complex light interactions is prohibitive for real-time applications. Indirect diffuse lighting is a low frequency component of global illumination that can greatly enhance the quality of an image. Virtual point lights are commonly used to reproduce diffuse bounces by tracing light paths through the scene and creating proxy light sources at the intersections between a path and geometry. In this work we extend clusterization-based methods for Virtual Point Lights, allowing for the reproduction of up to two bounces of light with a projection-aware sampling method in real-time. We show that plausible images can be obtained in real-time rates for low-to-mid end commodity GPUs.
3	FRANKLIN DE JESUS SILVA Highly-configurable systems in software startups: unveiling the white label model Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : IVAN DO CARMO MACHADO RITA SUZANA PITANGUEIRA MACIEL ADRIANO BESSA ALBUQUERQUE Data: 25 mars 2021 Afficher le Résumé Startups are small companies that seek to explore new businesses by embodying new technologies to different markets through innovation. Startups' Ecosystems exist to provide a supportive environment for these companies, being a valuable source of networking and knowledge to them. White label software projects are developed by Startups and known in Ecosystems as highly adaptable products, capable of generating new products faster than stand-alone applications, ensuring the best cost x benefit. Startups are divided into stages, and they face different challenges depending on the current stage. In the early stages, they barely plan their development activities but rather assess the market's needs and find users for their initial product, which leaves technical debt. White Label software projects suffer more from these issues since Startups do not address advanced code reuse techniques widely known by the academy. In the late stages, startups need to solve the technical debt managed in the early stages and need even more scalable processes to grow their business that involves White Label Software projects. This work seeks to unveil the concept of White Label software projects and their feasibility for software startups; investigate if and how advanced code reuse techniques, such as highly-configurable systems, could be used as opportunity-lever for software startups; build an adaptable framework for helping White Label software projects manage technical debt and face its challenges. In order to accomplish the aforementioned goals, we applied three empirical studies under different public. A semi-structured interview with startups using White Label software projects, an online questionnaire applied for stakeholders from ecosystems of innovation, and a Multivocal Literature Review with gray literature articles over the internet about White Label software projects. The yielded results presented a compelling portrait of how software startups have dealt with software reuse in their daily practices, with particular attention on how White Label software development has been explored in their projects. The data gathered allowed the creation of the framework above for mobile White Label software projects. In addition, this work gives a path of improvement that can be applied for startups working with White Label software projects under different stages, besides giving researchers a set of open challenges to be studied in future work.
4	Luciane de Andrade Meconi BOA – BusinessProcess Oriented Architecture: Uma Proposta para Publicação, Busca/Descoberta e Composição de Processos de Negócio em SoS Leader : RITA SUZANA PITANGUEIRA MACIEL MEMBRES DE LA BANQUE : FLAVIO OQUENDO LAIS DO NASCIMENTO SALVADOR RITA SUZANA PITANGUEIRA MACIEL Data: 21 juin 2021 Afficher le Résumé Os sistemas de software têm apresentado um crescimento exponencial, tornando-se cada vez maiores e complexos. Os sistemas complexos são sistemas caracterizados pela ausência de autoridade central, não são projetados a partir de uma especificação conhecida, mas envolvem diferentes partes interessadas em criar sistemas funcionais para novas finalidades. Apresentam também uma variedade de comportamentos, incluindo a auto-organização, propriedades emergentes, comportamento não-linear e, com frequência, contraintuitivos. Um sistema complexo com independência gerencial e operacional dos componentes é considerado um Sistema de Sistemas (SoS), não importando a complexidade individual dos seus subsistemas. O SoS é uma nova classe de sistemas, resultante, muitas vezes, da interação de vários sistemas, independente da operação. Um estilo arquitetônico padrão amplamente aplicado aos SoS é a SOA (Arquitetura Orientada a Serviço). Apesar de SOA ser vista como uma solução promissora para viabilizar a organização e utilização das funções distribuídas sob o domínio de diferentes proprietários de SoS, os recursos de software são acondicionados como serviços. A granularidade fina dos serviços pode dificultar a composição de um SoS, tendo em vista a grande quantidade de relacionamentos e conexões necessárias entre os sistemas constituintes para que componham serviços para prover suas funcionalidades. Essa dificuldade é ainda maior em SoS colaborativo, uma vez que a entrada e saída dos sistemas constituintes ocorre de maneira dinâmica. A elevação do nível de serviços para processos já vem sendo utilizada para diminuir a complexidade no gerenciamento de sistemas, processos e pessoas de uma mesma organização. As arquiteturas de referência no contexto SoS visam apoiar a padronização de interfaces dos sistemas constituintes, auxiliando às empresas no desenho de seus sistemas e, consequentemente, viabilizando a interoperabilidade entre estes sistemas. Algumas ferramentas conhecidas no mercado como BPMS, de domínio público ou privado, são utilizadas para apoiar o gerenciamento de processos sob a ótica do BPM, porém estas ferramentas não se mostram adaptadas ao domínio SoS, sendo desenvolvidas para atender às necessidades internas das organizações. Esta dissertação apresenta uma abordagem arquitetural orientada a processos de negócio, voltada para o domínio de Sistema de Sistemas, denominada BOA (Business Process Oriented Architecture). O objetivo foi propor, especificar, implementar e avaliar aspectos de uma Arquitetura Orientada a Processos de Negócio, específica para publicação, descoberta/busca e composição de processos de negócio. A BOA é composta por uma definição de uma linguagem de descrição de processos de negócio (BPDL) e pela implementação de uma ferramenta de publicação, busca/descoberta e composição de processo de negócio, denominada neste trabalho como Plataforma BOA Suite. Para avaliação da nossa proposta foi realizado um estudo exploratório, com abordagem qualitativa, utilizando como ferramenta a entrevista. Os resultados deste estudo evidenciaram que os participantes, a partir de cenários de avaliação e da interação com a BOA Suite, conseguiram realizar a publicação, busca/descoberta e composição de processos de negócio, evidenciando, assim, a viabilidade da proposta. Como contribuição, espera-se diminuir o nível de complexidade da composição por processos de negócio e auxiliar no gerenciamento da integração dos sistemas participantes de um SoS.
5	DIEGO CORREA DA SILVA Exploring Weighted Calibration, Balancing, and Metrics for Justice in Recommender Systems Leader : FREDERICO ARAUJO DURAO MEMBRES DE LA BANQUE : FREDERICO ARAUJO DURAO TATIANE NOGUEIRA RIOS LEANDRO BALBY MARINHO Data: 1 juil. 2021 Afficher le Résumé Recommendation systems are tools used to suggest items that might be of interest to users. These systems are based on the user's preference history to generate a list of suggestions that are more similar to the items in the user's history, aiming at better accuracy and less error. It is expected that when recommending an item, the user gives a feedback to the system, indicating if he liked or how much he liked the recommended item. User interaction with the system enables a better understanding of users' tastes, which over time, add more and more items to their preference profile. The recommendation based on the similarity of the item with the preferences, seeking the best precision can cause side effects in the list, such as: overspecialization of the recommendations in a certain core of items, little diversity of categories and imbalance of category or gender. Thus, this dissertation aims to explore calibration, which is a means to produce recommendations that are relevant to users and at the same time consider all areas of their preferences, seeking to avoid disproportion in the recommendation list. For this, ways of weighing the balance between the relevance of the recommendations and the calibration based on measures of divergence are discussed. The hypothesis is that calibration can positively contribute to fairer recommendations according to user preference. The research is carried out through a broad approach that contemplates nine recommendation algorithms applied in the film and music domains, analyzing three of divergence measures, two custom balance weights and two balances between relevance-calibration. The assessment is analyzed with widely used metrics as well as proposed metrics. The results indicate that calibration has positive effects both for recommendation accuracy and fairness with user preferences, creating recommendation lists that respect all areas. The results also indicate which is the best combination to obtain the best performance when applying the calibration proposals.
6	NILDO CEZARIO DA SILVA JUNIOR A Multimethod Study on Understanding Test Smells in the Brazilian Software Industry Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : IVAN DO CARMO MACHADO RITA SUZANA PITANGUEIRA MACIEL CARLA ILANE MOREIRA BEZERRA Data: 21 juil. 2021 Afficher le Résumé Software is present in everyday life and has aroused increasing interest in society. Its development process encompasses activities from understanding the requirements to delivering the executable software and its subsequent maintenance and evolution. However, the software could have defects, leading to losses of a large order of magnitude. Software testing is an effective strategy for early identification of defects, that is, before delivery to the end-user, and a consequent reduction in damage and associated costs. Testing involves activities, manual or automated, of analyzing the operational behavior of software systems, considering the input data set and output validation. The prevalence of automated test mechanisms is notorious, given time, effort, and cost savings in test development and execution. However, while creating automated tests, code may be implemented from inappropriate standards, resulting in code that is difficult to understand or even hinder maintenance activities. Such anti-patterns are called test smells. The literature presents studies aimed at analyzing the effects of test smells in the test code and tools that automate the detection and correction of test smells. However, little is known about the knowledge of industry professionals about this concept. Thus, this study aims to empirically analyze the knowledge related to test smells from professionals’ perspectives in the Brazilian software industry. We seek to identify programming practices that influence the insertion of test smells in the test code and analyze the existence of procedures aimed at preventing, identifying, and correcting test smells. To accomplish this goal, we divided the research into four stages: bibliographic survey, application of a survey with professionals from the software industry, interviewing with a selection of these professionals, and finally, developing a grounded theory. The survey included the participation of 60 professionals, and it was possible to infer the practices most frequently adopted in practice and a preliminary assessment of the impact of professional experience on the use of these practices. In the interview study, carried out with 50 professionals, it was possible to assess the level of knowledge about the concept of test smells. Data-based theory identified five areas related to improving the quality of automated testing: competency, technical alignment, test development, quality improvement, and tool use. The presentation of the level of knowledge about test smells and the quality verification practices of software tests are the main contributions of this research, which still foresees possible future investigations.
7	RAMON ARAÚJO GOMES Antecipanting Technical Debt Items in Model Driven Development Projects Leader : RITA SUZANA PITANGUEIRA MACIEL MEMBRES DE LA BANQUE : RITA SUZANA PITANGUEIRA MACIEL CLAUDIO NOGUEIRA SANT ANNA UIRÁ KULESZA Data: 19 août 2021 Afficher le Résumé ABSTRACT Model Driven Development (MDD) and the Technical Debt (TD) metaphor are software engineering approaches that look for promoting quality of systems under development. While MDD does it through the use of models as primary artifacts in the software development process, TD management does it by correctly dealing with problems that, out of control, can impair system maintenance and evolution. Most research on TD focuses on application code as primary TD sources. In an MDD project, however, dealing with technical debt only on the source code may not be an adequate strategy because code generation is often done at a later stage than creating models, and then dealing with TD only in source code can lead to unnecessary interest payments due to unmanaged debts. Recent works concluded that MDD project codes are not technical debt free, making it necessary to investigate the possibility and bene ts of applying TD identi cation techniques in earlier stages of the development process, such as in modeling phases. This work intends to analyze whether it is possible to use source code technical debt detection strategies to identify TD on code-generating models in the context of model-driven development projects. For this, we investigated source code TD detection techniques were evaluated with the purpose of adapting them to be used in the model abstraction level. A catalog of nine di erent model technical debt items for platform-independent codegenerating models was speci ed, with detection strategies that can automate the TD identi cation process in the models. Then, the proposed detection strategies were implemented in a tool that allows both speci cation and automatic identi cation of so-called model smells in EMF models. A preliminary study was performed to evaluate and re ne the proposed detection strategies by using 3 opensource projects versioned on Github. Then, a deeper experimental study was performed in order to evaluate the detection strategies real e ectiveness in anticipating the technical debt identi cation, detecting them in the code-generating models. A total of 9 di erent projects, with 36 EMF models and more than 400 thousand lines of code were used in this evaluation, all opensource, git versioned and developed with EMF technology. Results showed that the evaluated detection strategies are able to anticipate a great amount of source code technical debt in the code-generating models. However, di erent catalog items presented di erent anticipation e ectiveness levels and various levels of occurrences in the studied projects. A discussion was performed for each evaluated item in order to explain the obtained results. Finally, aspects were obtained that may guide future works in improvements and also extension of the catalog.
8	EDERSON DOS SANTOS ASSUNÇÃO Investigating the incidence of code smells in methods of design patterns Leader : RODRIGO ROCHA GOMES E SOUZA MEMBRES DE LA BANQUE : CLAUDIO NOGUEIRA SANT ANNA RODRIGO ROCHA GOMES E SOUZA RICARDO TERRA NUNES BUENO VILLELA Data: 1 sept. 2021 Afficher le Résumé Design patterns are reusable solutions that can be applied to solve specific problems of software projects. However, developers may not implement patterns adequately on some occasions. This can lead to the appearance of code smells. Code smells (or smells) are fragments of code that can indicate potential design flaws. Co-occurrences are situations in which design patterns and smells exist simultaneously in the same code unit (e.g., classes and methods of Java-based software projects). In case of such phenomenon occurs frequently it can reveal opportunities to investigate if a bad practice in the application of patterns is the cause of the occurrence of smells. As a consequence, refining the way how developers apply design patterns may be a good measure to avoid code smells. In this context, we present two experimental studies that aim at examining cases of co-occurrence of patterns and smells. Both studies required the application of detection tools to evaluate Java-based software projects. With our first study, we seek to comprehend the frequency by which code smells co-occur with design patterns. We also report the most common cases of co-occurrence. To achieve this, we identified instances of smells in methods of design patterns of 25 software systems. We also manually inspected source code fragments to obtain useful information about pairs of patterns and smells. Among other findings, we perceived that methods that take part in the implementation of the Adapter pattern are prone to contain smells, e.g., the Feature Envy smell, although it can be argued that the detection of this smell in this context is a false positive. Our second study has the purpose of analyzing the evolution of open-source software projects. We wanted to provide insights about the co-occurrences along with the evolution of the projects. As a result, we found out that the Template Method pattern is more inclined to appear concomitantly with smells through time.
9	LEILA DE CARVALHO COSTA FORMAL VERIFICATION OF SYSTEMS OF SYSTEMS (SOS) MODELED IN BPMN CHOREOGRAPHY DIAGRAMS - PREVIOUS DETECTION OF TYPICAL RUNTIME ERRORS Leader : RITA SUZANA PITANGUEIRA MACIEL MEMBRES DE LA BANQUE : FLAVIO OQUENDO LAIS DO NASCIMENTO SALVADOR RITA SUZANA PITANGUEIRA MACIEL Data: 14 sept. 2021 Afficher le Résumé System of systems (SoS) is composed of a set of systems, which interact together for a common goal. Thereby, it tends to be larger and more complex than traditional systems. To address the question of the inherent complexity of this kind of complex systems, they are frequently modeled using the Business Process Modeling and Notation (BPMN), which is a standard modeling notation for business processes. The choreography diagram, introduced in version BPMN 2.0, provides suitable concepts for representing the interactions among constituents of a SoS. However, models created using this notation may contain errors, some of which can be detected at design-time and others only at runtime. Syntax errors are easily detected with the assistance of modeling tools. Nevertheless, the absence of a formal semantics for BPMN makes it harder to identify runtime errors, as for example, deadlocks, livelocks and other safety properties in BPMN choreography diagrams. These errors are challenging to detect and may lead to an improper operation or even a system lockup. In this context, a method for identifying runtime errors consists of translating the BPMN diagram into a formal model that can be analyzed in a model checker. Thereby we present an approach to build a formal model for the BPMN choreography diagrams in terms of the formal language ⇡-ADL, which is properly designed for the specification of dynamic architectures, an intrinsic characteristic of SoS. Therefore, we define the mapping of the elements of the choreography diagram to ⇡-ADL, in order to obtain its formal description in ⇡-ADL. Such ⇡-ADL models allow its formal verification using a specific model checker, enabling the prior detection of runtime errors in the modeled system.
10	BRUNNA CAROLINE DO CARMO MOURÃO MDSS: A model to support the development of sustainable software systems 80 / 5000 Resultados de tradução MDSS: A model to support the development of sustainable software systems Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : SHEILA DOS SANTOS REINEHR CLAUDIO NOGUEIRA SANT ANNA IVAN DO CARMO MACHADO Data: 5 oct. 2021 Afficher le Résumé Thinking sustainably is an urgent need for the planet and an obligation for all areas of knowledge. Thus, the information technology area has been looking for ways to contribute to sustainability by investigating improvements related to the efficiency of energy consumption by software and hardware. The difficulty of this theme is due to the importance and the complexity of adapting traditional software development processes to new procedures, not to mention that human activities have been increasingly dependent on software, with the recent prevalence of developed software systems for the domain of mobile applications. In this context, a significant challenge lies in investigating methods that increase software sustainability, thus reducing energy consumption by the software, which minimizes the impact of technology on natural resources. Software sustainability can be analyzed from different perspectives (social, economic, technical, and environmental). Thus, before identifying, measuring, and managing the energy consumption of software, it is necessary to understand the concepts and definitions coined in the literature, existing tools, the level of knowledge on the part of professionals, and the proposed techniques and methods. This work developed a model for sustainable software development and organized a body of knowledge on sustainability in Software Engineering based on studies in the literature, considering concepts/definitions, methods/models, and working professionals' opinions. To achieve the goals mentioned above, the research followed two lines of work: (1) carrying out systematic mapping of the literature complemented by a survey for the collection of evidence that allowed to establish an overview of sustainable software engineering; (2) Structuring the identified information to develop the model intended to support teams in developing sustainable software, which took place through qualitative studies with industry professionals.
11	SARA MENDES OLIVEIRA LIMA Thematic synthesis on the adoption of regression testing techniques in software projects for the Android platform Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : ALCEMIR RODRIGUES SANTOS IVAN DO CARMO MACHADO RODRIGO ROCHA GOMES E SOUZA Data: 13 oct. 2021 Afficher le Résumé The mobile device market has grown exponentially in recent years, as the need for apps that strive for quality and that offer features that increase user retention. In recent years, a growing number of studies have provided solutions to problems inherent to the demand mentioned above. The software testing activity plays an essential role in the software quality assurance process. In particular, regression tests are presented as a viable strategy to deal with the complexity and constant evolution of applications since its main objective is to ensure that changes made between versions of a product do not change the system's behavior. Although the literature has devoted efforts to developing new regression testing techniques for Android - the most popular operating system for mobile devices - existing studies do not address how professionals use testing techniques. This investigation aims to carry out a thematic synthesis on the adoption of regression testing techniques in software projects for the Android platform. We organized the research in four steps: (i) carrying out a structured review of the technical literature on regression testing of Android applications, (ii) applying a survey, (iii) carrying out an identifier with professionals from the industry, and (iv) the construction of a thematic synthesis. The survey got 100 responses and provided preliminary insight into how the testing process is performed during and after application maintenance, how automated regression testing of Android applications is in practice, and why the professionals do not perform regression testing after updating applications. In the interview study, we inquired 16 industry professionals. This study contributed to the research as it could unveil the level of knowledge of professionals and its relationship with test automation. In addition, we could identify the most used languages in the development of Android applications and the tools used for testing automation. Furthermore, the study provided us with a preliminary understanding of how professionals perform regression testing in practice. Finally, the thematic synthesis presents a model resulting from the comparison of evidence gathered from different data sources - literature, survey and declares about the use of regression test techniques. These studies bring an insight into the subject and provide possible future investigations.
12	OTÁVIO GONÇALVEZ VICENTE RIBEIRO FILHO A new Artificial Neural Network architecture based on end-to-end approaches to classify signals Leader : RICARDO ARAUJO RIOS MEMBRES DE LA BANQUE : RICARDO ARAUJO RIOS RUBISLEY DE PAULA LEMES MOACIR ANTONELLI PONTI Data: 10 nov. 2021 Afficher le Résumé The increasing computer power, the introduction of specialized processors designed to accelerate graphics rendering, and the high availability of new data have supported the adoption of artificial neural networks (ANN) to solve real-world problems in different areas, such as computer vision natural language processing. The application of such networks to deal with data characterized by temporal dependencies, as signals (time series), produces better results when a preprocessing stage is considered to extract implicit information. In this work, by taking into account the different preprocessing methods, we have noticed that wavelet transforms have been widely used due to, in general, the remarkable results. However, such transformations have two main drawbacks by modeling signals. Firstly, it is necessary to define a set of parameters to extract better information from different levels of scale and resolution. Secondly, the application of such transformations requires an initial step to convert, in batch, all analyzed signals before starting the network training. In our scenario, such limitations were overcome by our new ANN architecture, which analyzes signals and yields as output a set of characteristics similar to those produced by wavelet transforms. Therefore, our ANN can be combined with others to model signals without requiring an execution a priori of any wavelet transformation. The proposed ANN was assessed against a pre-trained network to classify signals from two real-world applications: local field potential (LFP) from monitoring monkey brains and seismic signals from the Llaima volcano. The final accuracy obtained to model both applications using traditional preprocessing, and our approach were, respectively: (i) LFP -- 0.655 and 0.649, and (ii) Llaima -- 0.976 and 0.974.
13	ANDERSON DA SILVA BRITO SACRAMENTO A computational method for closed domain automatic event extraction in Portuguese Leader : MARLO VIEIRA DOS SANTOS E SOUZA MEMBRES DE LA BANQUE : PAULO MIGUEL TORRES DUARTE QUARESMA MARLO VIEIRA DOS SANTOS E SOUZA RENATA VIEIRA Data: 23 nov. 2021 Afficher le Résumé The registration and digital publication of texts in natural language is part of different domains of human activity. Among the information recorded, events are a significant part of the content, since, regardless of the domain, new events continue to happen, and those considered relevant are recorded. With the spread of information technologies, the ease of access to these records has increased the supply of registered events and also the demand for knowledge to aid decision making, derived from new applications of a large volume of information increasingly globalized and instantaneous. The task of event extraction aims to automate the extraction of events from text by identifying the event type and its participants and attributes. This is an information extraction task, which is part of the area of natural language processing. There is no known solution with a performance equal or superior to that of a human for this task in any natural language. In the specific case of the Portuguese language, there is not only a lack of a solution, but there is a need for methods and linguistic resources for the development and evaluation of those methods (via learning from data, i.e. machine learning). To the best of our knowledge, the few works that were published did not perform event extraction in a closed domain configuration, when one knows the structure of the events that one aims to extract. In this research project proposal, we intend to develop a method for automatic event extraction in a closed domain configuration in Portuguese sentences and to build or adapt a corpus in Portuguese that supports the development of this method. This work presents a method for event extraction in a closed domain configuration in Portuguese language sentences based on deep learning using contextualized word embeddings. Furthermore, to support the development and evaluation of the proposed method, we construct a Portuguese corpus with annotations of events in closed domain.
14	JOSELITO MOTA JÚNIOR An evidence-based study on issue labeling in Github-based repositories Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : IVAN DO CARMO MACHADO CLAUDIO NOGUEIRA SANT ANNA JOSÉ AMÂNCIO MACEDO SANTOS Data: 7 déc. 2021 Afficher le Résumé The open-source software community has grown in size and importance over the years. As a consequence, the number of project contributors has increased considerably. The capability of open-source project repositories to accommodate issue reports is essential. An issue report encompasses a large set of data that describes the necessary changes a software should handle. As developers need detailed information to reproduce and find them, incomplete information is a severe problem that may influence triage and defect detection leading to delays in project maintenance. Issue trackers commonly use the labeling method to add extra details to issues. Knowing the importance of labels, this dissertation focus on investigating the usage, creation, and similarities in the context of the issue lifecycle in both maintenance and evolution in the repository issue trackers of the largest and most popular code hosting platform, Github. In addition, it analyzes the number of labeled and unlabeled issues in the repository and the connection between the issues' components, with an analysis focused on the lifecycle. The results indicate a significant correlation between repositories with many issues and the creation of labels, but not all repositories use them. 64.58\% of the repositories insert new labels as the project evolves. 73.14\% repositories applied on issues the Github standard labels. We also found an influence of primary issue fields such as title, description, and comments in most issue labels, impacting the creation and labeling issues. These numbers show that issue labeling is of prominent relevance for project maintenance and evolution. It provides developers with an easy and convenient way to inform about an incoming issue reported by systems users.
15	Wallas Fróes de Oliveira ProgLab: On Provisioning QoS Through Programmable Labels in Residue-Defined Networking Leader : LEOBINO NASCIMENTO SAMPAIO MEMBRES DE LA BANQUE : LEOBINO NASCIMENTO SAMPAIO MAYCON LEONE MACIEL PEIXOTO RODOLFO DA SILVA VILLAÇA Data: 9 déc. 2021 Afficher le Résumé Residues Defined Network (RDN) have the ability to use labels for end-to-end routing in autonomous systems. RDN is characterized by dispensing with the use of routing tables at the core of the network. In this way, packets are forwarded based on the remainder of the division between the label found in the packet and the local key found in the core switch. Initially, RDN was intended to improve latency in SDNs, since the switches at the core of the network do not communicate with the controller. On the other hand, a range of solutions adopts the Quality of Service (QoS) based on the use of MPLS, which allows providing differentiated services to different traffic classes and patterns and generating considerable benefits for network communications. However, such solutions have high OPEX and CAPEX costs.Motivated by the capacity of RDNs, this dissertation brings an alternative method for using the labels in the core of the network, in addition to trying to solve the problem of scalability and flexibility found in the proposed KeyFlow solution and offer new functionalities for this technology.Through the P4 language, a new flexible, scalable and reprogrammable protocol for an RDN was implemented, in order to implement the model for provisioning QoS by programmable labels, ProgLab {Programmable Labels}.ProgLab implements a new method for reading labels by core switches, which uses the residual number system to define the output ports and queue priority, thus providing QoS to the RDN. To validate the proposal, emulation tests were performed with QoS and Quality of Experience (QoE) metrics to evaluate and demonstrate the feasibility of the model.The results demonstrated the viability of this proposal for the provision of QoS in core networks that use the RNS as packet forwarding, showing that in the future an RDN can become a low-cost alternative for packet forwarding with QoS services.
16	CÍNTIA DA COSTA SOUZA DOSN-PROV: Model and Services for Data Provenance in Decentralized Social Networks. Leader : CASSIO VINICIUS SERAFIM PRAZERES MEMBRES DE LA BANQUE : RITA CRISTINA GALARRAGA BERARDI CASSIO VINICIUS SERAFIM PRAZERES MARLO VIEIRA DOS SANTOS E SOUZA Data: 10 déc. 2021 Afficher le Résumé The information on social networks cannot always verify the origin, the paths described, and the registered modifications. Issues such as these generate uncertainty and distrust in processes and interactions in OSNs. With the emergence of DOSNs and the increase in the number of active users in these networks, it becomes essential to develop effective solutions regarding the provenance of decentralized data. The provenance is a factor of considerable importance because it is possible to evaluate the information's authenticity, reliability, and relevance through its results. Tracing and capture of provenance are critical tasks for DOSNs since they produce answers about the steps go through by the information. The large volume of data, the speed of generation and sharing of information, and the decentralized storage strategy make the provenance of DOSNs a non-trivial task. Thus, this work proposes the PROV-DOSN provenance ontological model, a specific model for the DOSNs domain, based on the PROV-O specification from W3C. In addition, this work also proposes services based on the DOSN-PROV to support the capture and tracking of provenance in DOSNs. We evaluated DOSN-PROV in two steps to demonstrate compliance with the proposed domain. Finally, the services underwent a performance evaluation, and their results indicated acceptable response times for the capture and tracking tasks.
17	BEATRIZ SANTANA FAGUNDES SOUZA DE LIMA A Study about the Influence of Text Specificity in the Perceived Helpfulness Classification of Online Reviews Leader : TATIANE NOGUEIRA RIOS MEMBRES DE LA BANQUE : TATIANE NOGUEIRA RIOS RICARDO MARCONDES MARCACINI THIAGO ALEXANDRE SALGUEIRO PARDO Data: 15 déc. 2021 Afficher le Résumé Online reviews are valuable sources of information to support the decision-making process, both for individuals and companies. Nevertheless, the large volume of reviews that have a low quality hinders the process of gathering helpful information from those reviews. Several retailers’ websites provide a voting system to allow customers to evaluate other product reviews as helpful or not. However, those votes are often biased and most of the reviews do not receive any votes at all. Besides that, several websites do not even have this voting mechanism or any other component for organizing the reviews in terms of their helpfulness. Therefore, classifying reviews according to their helpfulness has paramount importance in facilitating access to truly informative content. In this context, previous studies have unveiled several features and architectures that are beneficial for the perceived helpfulness prediction. In the present work, we argue that text specificity, defined as the level of details expressed in a text, can influence the perception of review helpfulness and, consequently, can also be a novel useful linguistic aspect for modeling the helpfulness prediction. We proposed two approaches to incorporate the specificity aspect into helpfulness classification models: i) using hand-crafted features based on text specificity and ii) using the review specificity prediction as an auxiliary task in a Multitask Learning (MTL) setting. First, we conducted an unsupervised domain adaptation approach [Ko, Durrett and Li 2019] to label text specificity scores on sentences from online reviews automatically. To evaluate the different trained models using this approach, we proposed a measure named Specificity Prediction Evaluation (SPE), which is based on the assumption that, on average, reliable specific sentences tend to be longer than reliable general sentences [Li and Nenkova 2015]. For the 18 collections of online reviews used in all of our experiments, we could achieve more reliable specificity predictions, according to SPE, by varying only the training set size and the number of training epochs. Finally, we performed experiments to assess the performance of the helpfulness classification models using two neural architectures: Convolutional Neural Network (CNN) [Kim 2014] and Bidirectional Encoder Representations from Transformers (BERT) [Devlin et al. 2019]. In summary, using balanced datasets, the perceived helpfulness classification models, embodied with text specificity- either as features or MTL - showed significantly higher precision results in comparison to a popular SVM baseline when using CNN. With BERT, the experiments showed that MTL outperformed the single-task models for most of the 18 datasets and both accuracy and precision were improved compared to the SVM baseline.
18	JOSÉ RONALDO LELES JUNIOR DOSN-PRIV:: Model and Service for Privacy in Decentralized Online Social Networks Leader : CASSIO VINICIUS SERAFIM PRAZERES MEMBRES DE LA BANQUE : DILVAN DE ABREU MOREIRA CASSIO VINICIUS SERAFIM PRAZERES FREDERICO ARAUJO DURAO Data: 21 déc. 2021 Afficher le Résumé The intensive use of Online Social Networks (OSNs) has been caused users to worry about data privacy. Given the centralized nature of these platforms, and since each has a particular storage mechanism, authentication, and access control, its users do not have the control and the right over their data. Thus, users can not easily switch between similar platforms or even transfer data from a platform to another. This implies, among other things, in a threat to privacy, since such users depend on the interests of the service provider responsible for OSN's administration. As a strategy for the decentralization of the OSNs and consequently as a solution to the privacy problems in these environments, the so-called Decentralized Online Social Networks (DOSNs) have emerged. Unlike OSNs, DOSNs are decentralized content management platforms in the sense that they do not use of centralized service providers. Although DOSNs address some of the privacy issues encountered in OSNs, the DOSNs also pose important challenges to be considered, for example, with regard to access control to user profile information with a lower level of granularity. In this context, this work proposes the development of an ontological model and a service to support privacy in DOSNs. The proposed model will be formalized through an OWL ontology that should describe the main concepts of access control for privacy in DOSNs and their relationships, while the service will consume the model to apply access control according to the policies represented in the model. The Dosn-Priv model was evaluated in two phases to verify its compliance with the proposed domain. Finally, the DosnPrivServ privacy service underwent a performance evaluation and its results were satisfactory regarding the response time of access control requests.
19	LUCINÉIA BATISTA DE SOUZA ORGANIZATION OF A SET OF EXPERIMENTAL FINDINGS ON TECHNICAL DEBT RELATED TO TEST Leader : MANOEL GOMES DE MENDONCA NETO MEMBRES DE LA BANQUE : MANOEL GOMES DE MENDONCA NETO JOSÉ AMÂNCIO MACEDO SANTOS THIAGO SOUTO MENDES Data: 23 déc. 2021 Afficher le Résumé Context: The term Technical Debt (TD) refers to activities deferred during the development of the software project. TD can be compared to a financial debt whose debtor has the option of paying the principal amount of the debt with interest. Several initiatives have sought to identify the causes and effects associated with TD. Among them, the InsightTD project has carried out a survey in several countries to collect evidence on the state of TD practice. One of the results obtained by the project was the list of causes and effects of TD reported by software professionals. TD can be incurred at any stage of development, such as software testing. This activity aims to ensure the quality of the software being developed. However, both the project and the technical literature have not addressed the TD items that occur in software testing activities. Objective: To investigate, from the point of view of software testing professionals, the types of TD associated with test-related TD, the causes that lead to it and the effects of its existence. Method: A replication of the survey of the InsighTD project was carried out with software testing professionals in Brazil. In addition, responses from software professionals collected in the first execution of the survey were used. Thus, 54 responses from participants in the Brazilian software industry were collected and analyzed, quantitatively and qualitatively. Result: This study highlights the point of view of testing professionals on TD. It was noticed that test-related debt items are related to different types of TD, such as documentation, code, and the test debt itself. Sixty-one causes of TD were identified and organized into eight categories. The main causes identified were deadlines, outdated/incomplete documentation and inadequate planning. Forty-five TD effects were identified and grouped into six categories. The main effects are delivery delays, rework and poor external quality. Conclusion: The list of causes and effects along with their categories was organized into a set of cause and effect diagrams that can help software professionals in managing TD.
	Thèses
1	CLEBER JORGE LIRA DE SANTANA AN ARCHITECTURE BASED ON REACTIVE MICROSERVICES FOR DEVELOPING RELIABLE IOT APPLICATIONS Leader : CASSIO VINICIUS SERAFIM PRAZERES MEMBRES DE LA BANQUE : CASSIO VINICIUS SERAFIM PRAZERES CHRISTINA VON FLACH GARCIA CHAVEZ NABOR DAS CHAGAS MENDONÇA PAULO DE FIGUEIREDO PIRES THAÍS VASCONCELOS BATISTA Data: 9 mars 2021 Afficher le Résumé The Internet of Things has changed the scope of the Internet and it has become a network connecting a myriad of devices of different types. The integration of these heterogeneous devices fosters novel services and applications, generates value-added information and actionable knowledge for the end user. Developing IoT applications and services and fulfill their quality of service requirements, such as confidentiality, interoperability, scalability, evolvability, adaptability, and reliability pose multiple challenges. These challenges are imposed mainly by the ultra large scale of the IoT, the heterogeneous nature of applications and devices, and the highly dynamic environment. Nowadays, the microservices pattern has been successfully applied by companies such as Netflix and SoundCloud to address some of these issues in the context of cloud computing applications. However, in the IoT field, the wide adoption of this Microservices still faces some obstacles to be solved in order to achieve the potential benefits of such approach. The goal of this work is to propose an architecture based on reactive Microservices for the development of IoT applications. The proposed architecture consists of a set of software components, tailored to meet the typical requirements of IoT applications that demand high availability. In addition, the proposal includes a software platform that materializes several of its components and help, at runtime, to meet the availability requirement of IoT applications. We applied our proposal in a real-world scenario in the field of Smart Agriculture and in a simulated scenario in the field of Smart Water. The experimental results conclude that our approach improves availability, minimizes latency and maximizes throughput, and increases accuracy significantly, in comparison to another study. We also carried out a performance evaluation from two perspectives. The first perspective is related to the evaluation of the IoT application through different deployment policies, which aimed to show how the architecture components used in microservices can impact the use of computational resources. The second perspective is related to the evaluation of the IoT application that uses the components of the proposed architecture in a scenario in which we increased the traffic of IoT data. The results of the performance evaluation in the two perspectives show that the IoT application remained functional during the analysis performed.
2	JOÃO PAULO DIAS DE ALMEIDA Exploiting Open Data for Improving Spatial Keyword Query Applications Leader : FREDERICO ARAUJO DURAO MEMBRES DE LA BANQUE : CARLOS ANDRÉ GUIMARÃES FERRAZ CARLOS EDUARDO SANTOS PIRES DANIELA BARREIRO CLARO FREDERICO ARAUJO DURAO VANINHA VIEIRA DOS SANTOS Data: 3 mai 2021 Afficher le Résumé Nowadays information is an essential resource in various sectors of the economy. The popularization of social networks, smartphone applications, and online services has increased the volume of data available online. Among this extensive amount of information, there is a specific data type called spatial data. It represents a physical object using its spatial coordinates (e.g. latitude and longitude). Spatial data is critical in a large number of application domains (e.g. land use, transportation plan). For instance, the user can find points of interest (POIs) or be warned of critical situations by spatial data applications like web search engines or emergency response applications. It’s been asserted that 80% of all data business has some locational reference. Spatial queries are widely employed to manipulate spatial data more efficiently. However, the user has a crucial role in the spatial information retrieval process when querying the needed information. For decades, researchers have proposed several techniques to aid users in expressing their information needs, such as Boolean models, pattern matching operators, and query expansion. Despite the existence of relevant alternatives in the field, there is still a lack of solutions applied to keyword preference queries. The Spatial Keyword Preference Query (SKPQ) arises as a potential solution to assist users in finding POIs. SKPQ selects POI based on the description of features in their neighborhood. In essence, the user defines a spatial (i.e. radius) and textual (i.e. query keywords) constraint to be satisfied. In this context, this thesis aims at proposing strategies to improve SKPQ results. The contribution is threefold. First, two Linked Open Data (LOD) repositories (i.e. DBpedia and LinkedGeoData) are exploited to improve the features description. The feature description in LOD contains more information than traditional spatial databases, leading to a more detailed description. Second, the query results are personalized to present the best POIs for the underlying user. By exploiting reviews on POIs, the system identifies the object that best satisfies the user and re-order the rank with respect to the user preference. Third, we model the user preference in visiting locations near to each other using a probabilistic function. This function is incorporated into the ranking function to retrieve POIs considering this user preference. We evaluate each technique employed in this proposal separately. The first technique achieves a relative NDCG improvement of 20% when using random query keywords. Also, it finds POIs where SKPQ is unable to find. The second technique further improves the relative NDCG by 92%. Finally, the third technique improves the rank consistency achieving a Tau performance of 52%.
3	JÔNATAS FERREIRA BASTOS UNDERSTANDING TEST EVOLUTION: FROM HIGHLY-CONFIGURABLE SYSTEMS TO SOFTWARE ECOSYSTEMS Leader : EDUARDO SANTANA DE ALMEIDA MEMBRES DE LA BANQUE : EDUARDO SANTANA DE ALMEIDA CHRISTINA VON FLACH GARCIA CHAVEZ RODRIGO ROCHA GOMES E SOUZA GUSTAVO HENRIQUE LIMA PINTO MARCELO BEZERRA D'AMORIM Data: 4 mai 2021 Afficher le Résumé Software evolution is inevitable if the systems are planned to survive in the long-term. Equally, well-understood is the necessity of having a good test suite available to ensure the quality of the current state of the software system and to ease future changes. This is especially true in the context of reusable systems since they are planned to attend for a long time a specific market niche and need to support a large number of configuration options. However, developing and maintaining a test suite is time-intensive and costly. This situation is challenging for the projects: on the one hand, tests are essential for the success of software; on the other hand, tests become a severe burden during maintenance. Even though a substantial body of literature has studied testing in reusable environments, test evolution analysis has not been addressed. In general, researchers have looked into analyzing test strategies, dynamic test selection techniques, and co-evolution of tests along with other systems artifacts. This thesis intends to improve the test evolution body of knowledge in reusable systems, investigating characteristics that indicate the effort to develop and maintain the test suite and unveiling how the reusable aspects affect the tests. The set of evidence can help researchers and practitioners to better planning the test development and evolution. This way, we employed a multi-method approach to develop the understanding of test evolution in configurable systems and unveil evidence on the topic from various sources. In the first phase of the research program, we provided an overview of the existing research related to this thesis’s subjects and presented related work to our investigation. The second phase was composed of four empirical studies. First, we performed a case study to analyze the test evolution of a large configurable system. Next, we performed a comparative study to evaluate the test evolution in 18 open-source projects from various sizes and domains in configurable systems and their similarities and differences to 18 Single Systems (SS) projects. Third, we conducted an extended study to analyze the test suite evolution in another category of reusable systems to verify whether some observations are recurring and gather new data that support the findings. Finally, we surveyed test contributors to investigate the test evolution from the development point of view and improve the findings in previous stages. This work collected a set of findings of test evolution, and these findings were strengthened by using different research methods. Our work provided a better understanding of test evolution in configurable systems by documenting evidence observed in open-source projects and test contributors. Moreover, in this Thesis, we synthesized the gathered evidence and identified open issues in this research topic. These findings are an important step to establish guidelines for addressing test evolution in configurable systems.
4	FLAVIA MARISTELA SANTOS NASCIMENTO A Flexible Framework to Schedule Hard and Soft Tasks in Multiprocessor Real-Time Systems Leader : GEORGE MARCONI DE ARAUJO LIMA MEMBRES DE LA BANQUE : GEORGE MARCONI DE ARAUJO LIMA VINICIUS TAVARES PETRUCCI DANIEL MOSSÉ KONSTANTINOS BLETSAS LILIANA CUCU-GROSJEAN Data: 28 juin 2021 Afficher le Résumé The increasing number of modern applications, that rely on the processor capacity of multicore CPUs, has been pushing the area of real-time systems towards multiprocessor architecture. In fact, such applications are present in a wide variety of areas, such as automotive, avionics and personal electronics industries, most of them being composed of both time critical (hard) and non-critical (soft) services. Indeed, for multiprocessor real-time systems that deal with both hard and soft tasks, one is interested in minimizing the response time of the soft tasks, while not jeopardizing timeliness of hard tasks. Several scheduling approaches have been proposed to deal with hard and soft tasks on a single processor, while for a multiprocessor environment such approaches are less common and are typically extensions of the mechanisms described for uniprocessor systems. This work describes a flexible scheduling approach designed for multiprocessor real-time systems composed of both hard and soft tasks. According to the proposed approach, hard tasks are assigned to processors in an off-line manner and do not migrate during their execution. As for soft tasks, two main configurations are possible. They may be placed either in each processor's local queue, for which migration is not allowed, or in a global queue, being accessed by all processors. The scheduling approach described in this thesis is strictly linked to the concept of servers, which are scheduling proxies responsible for scheduling the tasks (or other servers) assigned to them. Two special servers are described, whose definition, rules and correctness proof are also presented in this thesis. Together, these servers are able to (a) encapsulate soft tasks so that their execution does not put at risk the timing requirements of hard tasks and to (b) reclaim system unused slack to favor the execution of soft tasks. By doing so, the average response time of soft tasks are improved, without jeopardizing the timeliness of hard tasks. It is shown that by using the scheduling approach described in this thesis, in both configurations, improves the QoS-related metrics for soft tasks, such as deadline miss ratio and average response time, without compromising timeliness guarantees for hard tasks. The experiments has shown a reduction of up to $27\%$ in the deadline miss ratio and a reduction of up to $14\%$ in the average response time of the soft tasks.Such results indicate that the available processing capacity left unused by hard tasks can be effectively used for executing soft tasks.Additionally, the proposed approach has shown to be flexible enough to be adapted to other EDF-based scheduling policies at low cost..
5	RAFAEL LIMA COSTA Tactful Networking as a cornerstone for Opportunistic Human-Aware D2D Communication Leader : LEOBINO NASCIMENTO SAMPAIO MEMBRES DE LA BANQUE : ALINE CARNEIRO VIANA EDUARDO COELHO CERQUEIRA FABIOLA GONCALVES PEREIRA GREVE LEOBINO NASCIMENTO SAMPAIO RICARDO ARAUJO RIOS Data: 3 août 2021 Afficher le Résumé Among the mobile network challenges, there is data traffic growth, an increasing number of connected devices, spectrum limitations, and costly infrastructure updates. In this context, opportunistic D2D communication appears as a solution to offload data, increase capillarity, deliver content in specific scenarios (e.g., emergency) and foster innovative applications and services. Previous standards like LTE discussed D2D broadly. Still, recent research remarks that 5G/6G will be D2D's real enablers, given the possibility of using human-behavior big data from users' devices. Previous opportunistic D2D forwarding algorithms extracted mobility characteristics for improving content delivery cost-effectiveness. Nevertheless, most initiatives dealt with traditional metrics due to constraints, such as limited availability of real traces or the lack of a human-centered networking vision. Next-generation solutions need a more in-depth vision of human aspects hidden into datasets. Moreover, scenarios practical to mobile carriers lack evaluation. To this end, this thesis~\textit{guides the reader through the whole process for building a novel Tactful Opportunistic COmmunicaTion Strategy (TOOTS)}. TOOTS leverages wireless encounter patterns, temporal, spatial, geographic, and direction awareness to improve cost-effectiveness content delivery in a mobile networks scenario. The proposal consists of: learning human-aspect state-of-art best practices; achieving insights to improve strategy's performance; using, proposing, and analyzing human-aware metrics; combining metrics and insights into the strategy targeting an improved performance in a more realistic mobile scenario; and finally, evaluating the strategy through its strengths and shortcomings. This thesis shows that TOOTS improved the performance of an opportunistic content delivery scenario in terms of overhead, delivery rate, and latency by following this process. First, to achieve the results, we survey the human aspect in networking solutions for over a decade. We found that there is an evolution in how the human links to networking challenges. This broader human-aware vision culminates into the Tactful Networking perspective, which we introduce to follow, including discussions concerning application examples and insights. We complete the literature review by discussing state-of-art opportunistic forwarding strategies, including contributions, gaps, and open issues. Second, we discuss a framework for enhancing raw human data hidden into datasets. A case study with the MACACO dataset validates the framework through trace characterization results and analysis. Following, we characterize MACACO, NCCU, and GRM datasets to bring insights and to validate TOOTS. The presented and analyzed strategy's metrics come from such datasets' characterization results and insights. Third, we introduce TOOTS, targeting cost-effective content delivery in a mobile scenario. Afterward, we present the formal definition for choosing the disseminator nodes problem (TOOTS' 1st phase). As motivation, we analyze the overhead and latency of the Epidemic forwarding. Subsequently, we introduce TOOTS' two phases: a Tactful Dissemination Policy and a forwarding algorithm. Finally, we compare TOOTS' effectiveness on its phases with enhanced Store-wait-forward, Epidemic, and Bubble Rap algorithms through real and synthetic traces. Among the main outcomes, in a scenario with a 30~m range, TOOTS reached a 100\% delivery ratio with 28\% and 73\% reduced delivery latency, and with 16\% and 27\% reduced overhead, respectively, in the real and synthetic datasets. With a restricted (10~m) range, TOOTS was the fastest strategy and the only one that delivered 100\% of the contents within the real trace. Consequently, this thesis contributes to addressing the opportunistic content delivery problem in cellular networks by providing a cost-effective human-aware opportunistic solution. The strategy can be applied in different scenarios to assist the operator in delivering content without not necessarily using their legacy network, e.g., by exploring the capillarity of their mobile users, for data offloading and other applications.
6	MARCOS BARBOSA DÓSEA Design-Sensitive Metric Thresholds based on Design Roles Leader : CLAUDIO NOGUEIRA SANT ANNA MEMBRES DE LA BANQUE : CLAUDIO NOGUEIRA SANT ANNA CHRISTINA VON FLACH GARCIA CHAVEZ MANOEL GOMES DE MENDONCA NETO MAURÍCIO FINAVARO ANICHE MARCELO DE ALMEIDA MAIA Data: 16 août 2021 Afficher le Résumé State-of-the-art techniques and Automated Static Analysis Tools (ASATs) for identifying code smells rely on metric-based assessment. However, most of these techniques have low accuracy. One possible reason is that source code elements, such as methods implemented according to different design decisions, are assessed through the same generic threshold for each metric. Other possible reason is that these metric thresholds are usually derived from classes driven by different design decisions. Using generic metric thresholds that do not consider the design context of each evaluated class can generate many false positives and false negatives for software developers. Our goal is to propose design-sensitive techniques to derive contextual metric thresholds. Our primary hypothesis is that using the design role played by each system class to define this context may point out more relevant code smells to software developers. We conducted some empirical studies to define the proposed techniques. Firstly, we performed a large-scale survey that showed that practitioners recognize difficulties in fitting ASATs into the software development process. They also claim that there is no routine for application. One possible reason practitioners recognize that most of these tools use a single metric threshold, which might not be adequate to evaluate all system classes. Secondly, we conducted an empirical study to investigate whether fine-grained design decisions also influence the distribution of software metrics and, therefore, should be considered to derive metric thresholds. Our findings show that the distribution of metrics is sensitive to the following design decisions: (i) design role of the class (ii) used libraries, (iii) coding style, (iv) exception handling, and (v) logging and debugging code mechanisms. We used these findings to propose two new techniques to derive design-sensitive metric thresholds using the class design role as context. Then, we carried out two large-scale empirical studies to evaluate them. The first study showed that our proposed techniques improved precision according to developers' perceptions. Since it is impossible and tiring to perform a complete source code quality assessment with developers, we conducted a second study mining the evolution of software projects from popular architectural domains. We found that our techniques improved recall to point out methods effectively refactored during software evolution.
7	ERIC BERNARDES CHAGAS BARROS Jemadar-AI: A Fog-Based Framework for Microgrid Power Management Leader : MAYCON LEONE MACIEL PEIXOTO MEMBRES DE LA BANQUE : MAYCON LEONE MACIEL PEIXOTO GUSTAVO BITTENCOURT FIGUEIREDO LEOBINO NASCIMENTO SAMPAIO LUIZ FERNANDO BITTENCOURT EDWARD DAVID MORENO ORDONEZ Data: 23 sept. 2021 Afficher le Résumé The constant increase in demand for electricity and the reduction of conventional means of production point to a future energy crisis. One of the factors that impact the growing demand is uncontrolled consumption, generating an increasing energy dispatch to meet the consumer's needs. Thus, production needs to use a greater amount of energy sources, especially during peak consumption hours. To control demand, demand management techniques are being used by consumers to reduce peak periods and energy bills. Likewise, energy production can combine the use of renewable and non-renewable energy sources with a focus on reducing non-renewable energy production. In this scenario, intelligent energy production requires automated control to avoid energy losses. Currently, several researchers propose the use of optimization algorithms for electrical equipment scheduling as a demand management solution, so the use of equipment be displaced to avoid peak energy periods. Research has also been proposed on the use of the Proportional-Integral-Derivative (PID) control to balance renewable and non-renewable production. However, few researchers consider fog computing with the help of the cloud to perform the processing of these algorithms. Thus, this work proposes the JEMADAR-AI framework to deal with the data traffic of electrical networks. JEMADAR-AI trains two neural networks based on reinforcement learning: (i) the first one to performs equipment scheduling; (ii) the second adjusts the PID controller dynamically. For this, JEMADAR-AI uses an architecture that combines fog computing with cloud computing to reduce request response times for equipment scheduling and power adjustment requests. The results obtained through the performance evaluation showed that the use of JEMADAR-AI reduces up to 19% the electrical power at peak hours and 21.6% the values of the energy bills when compared with scenarios that do not perform any optimization.
8	LAECIO ARAUJO COSTA SapeS: An architecture for evaluating academic performance in distance education based on Learning Analytics and Ontologies Leader : MARLO VIEIRA DOS SANTOS E SOUZA MEMBRES DE LA BANQUE : MARLO VIEIRA DOS SANTOS E SOUZA ECIVALDO DE SOUZA MATOS FREDERICO LUIZ GONÇALVES DE FREITAS LEÔNIDAS DE OLIVEIRA BRANDÃO PAULEANY SIMOES DE MORAIS Data: 28 sept. 2021 Afficher le Résumé Distance Learning, along with the evolution of Information and Communication Technology (ICT), has higher enrollment rates than traditional education. With this growth, educators started to manage classes with a more significant number of geographically distributed students. Challenges arise for teachers regarding how to monitor academic progress based on the fulfillment of planned educational objectives and assessing students' academic performance, supervising the acquisition of knowledge. Tracking students' learning experiences in Learning Management Systems (LMS) to assess academic performance and student progress is one of the most intense and exhausting challenges for educators due to the amount of educational data produced and the various learning indicators available on the LMS. Thus, this research aims to investigate how an approach to monitoring and evaluating the student's academic trajectory in the LMS, called SapeS, can positively impact the teaching and learning process in Distance Learning. The proposed solution aims to process educational data from the LMS, extract useful knowledge, and make information available to educators and students with techniques of Learning Analytics and Computational Ontologies, guided by Taxonomy of Educational Objectives. The solution was developed based on the Design Science Research (DSR) methodology and applied in a real context for evaluation with the population (n=24). Through the analysis carried out it was verified that there was a strong adherence to the use of the proposed tool and a significant difference between the data collected at different moments of the investigation. There was a greater engagement, motivation, and participation in the academic activities of the investigated course and greater attention from the teachers regarding the evaluation of the student's trajectory, in addition to checking and adapting the pedagogical planning in relation to the student's situation, and the planned educational objectives.
9	JEAN CLEMISSON SANTOS ROSA The coordination of collaboration in SPIDe: mechanisms and practices to support interaction co-design Leader : ECIVALDO DE SOUZA MATOS MEMBRES DE LA BANQUE : ECIVALDO DE SOUZA MATOS DUARTE NUNO JARDIM NUNES RODRIGO FREESE GONZATTO LUCIANA APARECIDA MARTINEZ ZAINA TAYANA UCHÔA CONTE Data: 25 nov. 2021 Afficher le Résumé The creation of interactive computational artifacts should reflect the needs, interests, restrictions, ways of working, and cultural aspects of their potential users whenever possible. SPIDe (Semio-Participatory Interaction Design) was conceived to reflect these aspects in the developed computational product through collaborative design practices (codesign). In this sense, the literature indicates that codesign practices favor the product's identity concerning its potential users. Codesign expresses a collaborative perspective for design, in the sense that people influenced by the creation and use of technologies must collaborate in its creation and development. Collaboration has three well-defined dimensions in the literature, according to the 3C Model: cooperation, communication, and coordination. The cooperation concerns the joint operation of users in a shared space making use of communication; in turn, communication consists of the exchange of messages for mutual and shared construction based on the management carried out by the coordination; coordination defines the activities and the set of objects that will be manipulated, in addition to defining the media and the mode of transmission of messages. Although SPIDe is characterized as an interaction codesign process, several aspects are to be investigated and improved. One of them concerns coordination. Therefore, in this research, we seek to identify how to promote the coordination of collaboration in SPIDe. To this, we investigated mechanisms and practices to support and promote the coordination of collaboration in SPIDe. Methodologically, we conducted multiple case studies with different data collection approaches, such as systematic review and narrative of literature, semi-structured interviews, logbooks, conversation, and Technology Acceptance Model questionnaire and Design Science Research. The thematic analysis based on the data produced supported the improvements made in SPIDe and reported in this thesis report, such as the guidelines for conducting the braindraw technique, the construction of the epistemic tool SPIDe\_Canvas and the construction of a platform for conducting online and distributed from SPIDe. In addition to improving SPIDe, the results of this research can help professionals and researchers build interactive computational artifacts in collaboration with potential users in a fun, dynamic, and effective way.
10	JEFFERSON FONTINELE DA SILVA Paying attention to the boundaries in semantic image segmentation Leader : LUCIANO REBOUCAS DE OLIVEIRA MEMBRES DE LA BANQUE : LUCIANO REBOUCAS DE OLIVEIRA MICHELE FÚLVIA ANGELO ALEXANDRE DA COSTA E SILVA FRANCO FLAVIO DE BARROS VIDAL RICARDO DA SILVA TORRES Data: 20 déc. 2021 Afficher le Résumé Image segmentation consists of assigning a label to each pixel in the image in such a way that pixels belonging to the same objects in the image must have the same labels. The segmented area of an object must span all pixels up to the limits (boundaries) with the other objects. The boundary region can provide helpful information for the segmentation process, as it marks a discontinuity that can define a segment limit. However, segmentation methods commonly suffer to explore boundary information and consequently to segment this region. This is so mainly due to the proximity between image regions containing different labels. In view of that, we propose to investigate how to take into account the boundary information when semantically segmenting an image object. Our first contribution is a graph-based image segmenter, called interactive dynamic programming (IDP)-expansion. This is a weakly-supervised method that requires a seed into each object targeted to be segmented in the image, subsequently minimizing an energy function to obtain the image labels. IDP-expansion explores dynamic programming to initialize an alpha expansion algorithm over superpixels to improve boundary information in a segmentation process. Over the Berkeley segmentation data set, our experiments showed that IDP-expansion is 51.2% faster than a traditional alpha-expansion based segmentation. Although IDP-expansion has shown to be faster, it suffers from two matters: A mandatory seed initialization and the lack of semantic information. This further led us to develop a supervised convolutional neural network architecture to semantically explore boundary information. Our novel method, called DS-FNet, uses two streams integrated in an end-to-end convolutional network to combine segmentation and boundary information based on an attention-aware mechanism. To evaluate DS-FNet, we initially conducted experiments on general-purpose (Pascal Context) and traffic (Cityscapes, CamVid, and Mapillary Vistas) image data sets, having the mean intersection over union (mIoU) as the reference metric. DS-FNet outperformed ten segmentation networks in the Pascal Context, Cityscapes, and CamVid data sets. In the Mapillary Vistas data set, DS-FNet achieved second place when compared to five other methods. A second round of experiments was performed to evaluate the generalization of our proposed method on challenging medical imaging data sets, containing several kidney biopsy whole slide images (WSIs). The data sets used to evaluate the second version of our network were HubMAP, WSI Fiocruz, and a subset of Neptune data set, all considering glomerulus segmentation. After training DS-FNet only over HubMAP data set, containing periodic acid-Schiff (PAS)-stained WSIs with only non-injured glomeruli, we found that our network was capable to segment glomeruli on WSIs stained by other methods (hematoxylin-eosin (HE), periodic acid-methenamine silver (PAMS), trichrome (TRI), and silver (SIL)). The results of these latter experiments show that our model is more robust than other models based on U-Net architecture. All the experiments and analyses presented in this work demonstrated that the explicit and adequate consideration of boundary information improves the results over non-boundary segmentation methods.

Thèses

MOARA SOUSA BRITO LESSA
On the Selection of Open Source Software Projects for Software Engineering Education
Leader : CHRISTINA VON FLACH GARCIA CHAVEZ
MEMBRES DE LA BANQUE :
CHRISTINA VON FLACH GARCIA CHAVEZ
IGOR FABIO STEINMACHER
RODRIGO ROCHA GOMES E SOUZA
Data: 5 févr. 2021

Afficher le Résumé
Context: Teaching Software Engineering (SE) in higher education is a challenging task. A solid theoretical base should be provided to students, including basic SE concepts (requirements, design, tests, among others), and topics related to project management, human-computer interaction, configuration and release management. Practical experience, communication skills and professionalism are dimensions that should also be considered, including teamwork and ethics, to prepare students for a career in the software industry.

Free/open source software (FLOSS) has been used in Software Engineering Education (SEE) as a viable strategy to address the need for training students' technical and non-technical skills and prepare them to deal with the challenges posed by the evolving software industry.
Some studies report that the pedagogical use of FLOSS projects has brought benefits but also challenges that may hinder their effective adoption in SEE.

A common challenge is project selection, due to the diversity of available FLOSS projects and the complexity of the task. In general, the teacher must search for, manually or with the support of different tools (not necessarily designed for use in Education), one or more projects for pedagogical use in the course. Several criteria can be considered for the selection of a FLOSS project for pedagogical use in SEE, including project size, number of tests, programming language, community size and their best practices.

Problem: The selection of FLOSS projects for pedagogical use in SEE requires effort and experience, and there are few tools that support such complex task while providing clear and well-defined selection criteria to be used by the teacher. Thus, the burden of selecting FLOSS projects can hinder their use and adoption in the context of SEE.

Objective: This research aimed to investigate the usefulness of an automated approach to support the teacher in the selection of open source projects guided by criteria that take into account socio-technical aspects of the FLOSS project, for pedagogical use in SEE higher education.

Reasearch Methods: A literature review was carried out to identify criteria already in use for the selection of FLOSS projects. Then, a set of selection criteria was identified, documented, operationalized and implemented in an open source tool called FlossSearch.Edu. The first version of the tool was evaluated by students and professors from different Brazilian higher education institutions. Initially, students used FlossSearch.Edu, in the context of a SE discipline, to select FLOSS projects based on criteria defined by the teacher, and then evaluated the use of the tool through a survey with questions based on the Technology Acceptance Model (TAM). In a qualitative and quantitative study, teachers already familiar with the pedagogical use of FLOSS projects, used FlossSearch.Edu in an individual session, recorded and guided by predefined scenarios, while reporting their impressions out loud (``Think Aloud''protocol). At the end of the sessions, teachers evaluated the use of the tool using a TAM questionnaire.

Findings: In the context of selecting FLOSS projects in which FlossSearch.Edu was used, the tool was well evaluated. The majority of students who used FlossSearch.Edu in the classroom and teachers who participated in the second assessment said that the tool is useful, easy to use and that they intend to use it in the future. Several suggestions for improvement were received and should guide the evolution of the tool and further studies.

VITOR MORAES ARANHA
SCREEN-SPACE VIRTUAL POINT LIGHT PROPAGATION FOR REAL-TIME GLOBAL ILLUMINATION
Leader : ANTONIO LOPES APOLINARIO JUNIOR
MEMBRES DE LA BANQUE :
ANTONIO LOPES APOLINARIO JUNIOR
KARL PHILIPS APAZA AGUERO
RICARDO GUERRA MARROQUIM
Data: 1 mars 2021

Afficher le Résumé
Proper light-transport simulation adds both realism and aesthetically pleasing effects to virtual 3D scenes. However, the cost of computing complex light interactions is prohibitive for real-time applications. Indirect diffuse lighting is a low frequency component of global illumination that can greatly enhance the quality of an image. Virtual point lights are commonly used to reproduce diffuse bounces by tracing light paths through the scene and creating proxy light sources at the intersections between a path and geometry. In this work we extend clusterization-based methods for Virtual Point Lights, allowing for the reproduction of up to two bounces of light with a projection-aware sampling method in real-time. We show that plausible images can be obtained in real-time rates for low-to-mid end commodity GPUs.

FRANKLIN DE JESUS SILVA
Highly-configurable systems in software startups: unveiling the white label model
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
IVAN DO CARMO MACHADO
RITA SUZANA PITANGUEIRA MACIEL
ADRIANO BESSA ALBUQUERQUE
Data: 25 mars 2021

Afficher le Résumé
Startups are small companies that seek to explore new businesses by embodying new technologies to different markets through innovation. Startups' Ecosystems exist to provide a supportive environment for these companies, being a valuable source of networking and knowledge to them. White label software projects are developed by Startups and known in Ecosystems as highly adaptable products, capable of generating new products faster than stand-alone applications, ensuring the best cost x benefit. Startups are divided into stages, and they face different challenges depending on the current stage. In the early stages, they barely plan their development activities but rather assess the market's needs and find users for their initial product, which leaves technical debt. White Label software projects suffer more from these issues since Startups do not address advanced code reuse techniques widely known by the academy. In the late stages, startups need to solve the technical debt managed in the early stages and need even more scalable processes to grow their business that involves White Label Software projects. This work seeks to unveil the concept of White Label software projects and their feasibility for software startups; investigate if and how advanced code reuse techniques, such as highly-configurable systems, could be used as opportunity-lever for software startups; build an adaptable framework for helping White Label software projects manage technical debt and face its challenges. In order to accomplish the aforementioned goals, we applied three empirical studies under different public. A semi-structured interview with startups using White Label software projects, an online questionnaire applied for stakeholders from ecosystems of innovation, and a Multivocal Literature Review with gray literature articles over the internet about White Label software projects. The yielded results presented a compelling portrait of how software startups have dealt with software reuse in their daily practices, with particular attention on how White Label software development has been explored in their projects. The data gathered allowed the creation of the framework above for mobile White Label software projects. In addition, this work gives a path of improvement that can be applied for startups working with White Label software projects under different stages, besides giving researchers a set of open challenges to be studied in future work.

Luciane de Andrade Meconi
BOA – BusinessProcess Oriented Architecture: Uma Proposta para Publicação, Busca/Descoberta e Composição de Processos de Negócio em SoS
Leader : RITA SUZANA PITANGUEIRA MACIEL
MEMBRES DE LA BANQUE :
FLAVIO OQUENDO
LAIS DO NASCIMENTO SALVADOR
RITA SUZANA PITANGUEIRA MACIEL
Data: 21 juin 2021

Afficher le Résumé
Os sistemas de software têm apresentado um crescimento exponencial, tornando-se cada vez maiores e complexos. Os sistemas complexos são sistemas caracterizados pela ausência de autoridade central, não são projetados a partir de uma especificação conhecida, mas envolvem diferentes partes interessadas em criar sistemas funcionais para novas finalidades. Apresentam também uma variedade de comportamentos, incluindo a auto-organização, propriedades emergentes, comportamento não-linear e, com frequência, contraintuitivos. Um sistema complexo com independência gerencial e operacional dos componentes é considerado um Sistema de Sistemas (SoS), não importando a complexidade individual dos seus subsistemas. O SoS é uma nova classe de sistemas, resultante, muitas vezes, da interação de vários sistemas, independente da operação. Um estilo arquitetônico padrão amplamente aplicado aos SoS é a SOA (Arquitetura Orientada a Serviço). Apesar de SOA ser vista como uma solução promissora para viabilizar a organização e utilização das funções distribuídas sob o domínio de diferentes proprietários de SoS, os recursos de software são acondicionados como serviços. A granularidade fina dos serviços pode dificultar a composição de um SoS, tendo em vista a grande quantidade de relacionamentos e conexões necessárias entre os sistemas constituintes para que componham serviços para prover suas funcionalidades. Essa dificuldade é ainda maior em SoS colaborativo, uma vez que a entrada e saída dos sistemas constituintes ocorre de maneira dinâmica. A elevação do nível de serviços para processos já vem sendo utilizada para diminuir a complexidade no gerenciamento de sistemas, processos e pessoas de uma mesma organização. As arquiteturas de referência no contexto SoS visam apoiar a padronização de interfaces dos sistemas constituintes, auxiliando às empresas no desenho de seus sistemas e, consequentemente, viabilizando a interoperabilidade entre estes sistemas. Algumas ferramentas conhecidas no mercado como BPMS, de domínio público ou privado, são utilizadas para apoiar o gerenciamento de processos sob a ótica do BPM, porém estas ferramentas não se mostram adaptadas ao domínio SoS, sendo desenvolvidas para atender às necessidades internas das organizações. Esta dissertação apresenta uma abordagem arquitetural orientada a processos de negócio, voltada para o domínio de Sistema de Sistemas, denominada BOA (Business Process Oriented Architecture). O objetivo foi propor, especificar, implementar e avaliar aspectos de uma Arquitetura Orientada a Processos de Negócio, específica para publicação, descoberta/busca e composição de processos de negócio. A BOA é composta por uma definição de uma linguagem de descrição de processos de negócio (BPDL) e pela implementação de uma ferramenta de publicação, busca/descoberta e composição de processo de negócio, denominada neste trabalho como Plataforma BOA Suite. Para avaliação da nossa proposta foi realizado um estudo exploratório, com abordagem qualitativa, utilizando como ferramenta a entrevista. Os resultados deste estudo evidenciaram que os participantes, a partir de cenários de avaliação e da interação com a BOA Suite, conseguiram realizar a publicação, busca/descoberta e composição de processos de negócio, evidenciando, assim, a viabilidade da proposta. Como contribuição, espera-se diminuir o nível de complexidade da composição por processos de negócio e auxiliar no gerenciamento da integração dos sistemas participantes de um SoS.

DIEGO CORREA DA SILVA

Exploring Weighted Calibration, Balancing, and Metrics for Justice in Recommender Systems

Leader : FREDERICO ARAUJO DURAO
MEMBRES DE LA BANQUE :
FREDERICO ARAUJO DURAO
TATIANE NOGUEIRA RIOS
LEANDRO BALBY MARINHO
Data: 1 juil. 2021

Afficher le Résumé
Recommendation systems are tools used to suggest items that might be of interest to users. These systems are based on the user's preference history to generate a list of suggestions that are more similar to the items in the user's history, aiming at better accuracy and less error. It is expected that when recommending an item, the user gives a feedback to the system, indicating if he liked or how much he liked the recommended item. User interaction with the system enables a better understanding of users' tastes, which over time, add more and more items to their preference profile. The recommendation based on the similarity of the item with the preferences, seeking the best precision can cause side effects in the list, such as: overspecialization of the recommendations in a certain core of items, little diversity of categories and imbalance of category or gender. Thus, this dissertation aims to explore calibration, which is a means to produce recommendations that are relevant to users and at the same time consider all areas of their preferences, seeking to avoid disproportion in the recommendation list. For this, ways of weighing the balance between the relevance of the recommendations and the calibration based on measures of divergence are discussed. The hypothesis is that calibration can positively contribute to fairer recommendations according to user preference. The research is carried out through a broad approach that contemplates nine recommendation algorithms applied in the film and music domains, analyzing three of divergence measures, two custom balance weights and two balances between relevance-calibration. The assessment is analyzed with widely used metrics as well as proposed metrics. The results indicate that calibration has positive effects both for recommendation accuracy and fairness with user preferences, creating recommendation lists that respect all areas. The results also indicate which is the best combination to obtain the best performance when applying the calibration proposals.

NILDO CEZARIO DA SILVA JUNIOR
A Multimethod Study on Understanding Test Smells in the Brazilian Software Industry
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
IVAN DO CARMO MACHADO
RITA SUZANA PITANGUEIRA MACIEL
CARLA ILANE MOREIRA BEZERRA
Data: 21 juil. 2021

Afficher le Résumé
Software is present in everyday life and has aroused increasing interest in society. Its development process encompasses activities from understanding the requirements to delivering the executable software and its subsequent maintenance and evolution. However, the software could have defects, leading to losses of a large order of magnitude. Software testing is an effective strategy for early identification of defects, that is, before delivery to the end-user, and a consequent reduction in damage and associated costs. Testing involves activities, manual or automated, of analyzing the operational behavior of software systems, considering the input data set and output validation. The prevalence of automated test mechanisms is notorious, given time, effort, and cost savings in test development and execution. However, while creating automated tests, code may be implemented from inappropriate standards, resulting in code that is difficult to understand or even hinder maintenance activities. Such anti-patterns are called test smells. The literature presents studies aimed at analyzing the effects of test smells in the test code and tools that automate the detection and correction of test smells. However, little is known about the knowledge of industry professionals about this concept. Thus, this study aims to empirically analyze the knowledge related to test smells from professionals’ perspectives in the Brazilian software industry. We seek to identify programming practices that influence the insertion of test smells in the test code and analyze the existence of procedures aimed at preventing, identifying, and correcting test smells. To accomplish this goal, we divided the research into four stages: bibliographic survey, application of a survey with professionals from the software industry, interviewing with a selection of these professionals, and finally, developing a grounded theory. The survey included the participation of 60 professionals, and it was possible to infer the practices most frequently adopted in practice and a preliminary assessment of the impact of professional experience on the use of these practices. In the interview study, carried out with 50 professionals, it was possible to assess the level of knowledge about the concept of test smells. Data-based theory identified five areas related to improving the quality of automated testing: competency, technical alignment, test development, quality improvement, and tool use. The presentation of the level of knowledge about test smells and the quality verification practices of software tests are the main contributions of this research, which still foresees possible future investigations.

RAMON ARAÚJO GOMES
Antecipanting Technical Debt Items in Model Driven Development Projects
Leader : RITA SUZANA PITANGUEIRA MACIEL
MEMBRES DE LA BANQUE :
RITA SUZANA PITANGUEIRA MACIEL
CLAUDIO NOGUEIRA SANT ANNA
UIRÁ KULESZA
Data: 19 août 2021

Afficher le Résumé
ABSTRACT

Model Driven Development (MDD) and the Technical Debt (TD) metaphor are software

engineering approaches that look for promoting quality of systems under development.

While MDD does it through the use of models as primary artifacts in the software development

process, TD management does it by correctly dealing with problems that, out

of control, can impair system maintenance and evolution. Most research on TD focuses

on application code as primary TD sources. In an MDD project, however, dealing with

technical debt only on the source code may not be an adequate strategy because code

generation is often done at a later stage than creating models, and then dealing with TD

only in source code can lead to unnecessary interest payments due to unmanaged debts.

Recent works concluded that MDD project codes are not technical debt free, making it

necessary to investigate the possibility and bene ts of applying TD identi cation techniques

in earlier stages of the development process, such as in modeling phases. This

work intends to analyze whether it is possible to use source code technical debt detection

strategies to identify TD on code-generating models in the context of model-driven development

projects. For this, we investigated source code TD detection techniques were

evaluated with the purpose of adapting them to be used in the model abstraction level.

A catalog of nine di erent model technical debt items for platform-independent codegenerating

models was speci ed, with detection strategies that can automate the TD

identi cation process in the models. Then, the proposed detection strategies were implemented

in a tool that allows both speci cation and automatic identi cation of so-called

model smells in EMF models. A preliminary study was performed to evaluate and re ne

the proposed detection strategies by using 3 opensource projects versioned on Github.

Then, a deeper experimental study was performed in order to evaluate the detection strategies

real e ectiveness in anticipating the technical debt identi cation, detecting them

in the code-generating models. A total of 9 di erent projects, with 36 EMF models and

more than 400 thousand lines of code were used in this evaluation, all opensource, git

versioned and developed with EMF technology. Results showed that the evaluated detection

strategies are able to anticipate a great amount of source code technical debt in the

code-generating models. However, di erent catalog items presented di erent anticipation

e ectiveness levels and various levels of occurrences in the studied projects. A discussion

was performed for each evaluated item in order to explain the obtained results. Finally,

aspects were obtained that may guide future works in improvements and also extension

of the catalog.

EDERSON DOS SANTOS ASSUNÇÃO
Investigating the incidence of code smells in methods of design patterns
Leader : RODRIGO ROCHA GOMES E SOUZA
MEMBRES DE LA BANQUE :
CLAUDIO NOGUEIRA SANT ANNA
RODRIGO ROCHA GOMES E SOUZA
RICARDO TERRA NUNES BUENO VILLELA
Data: 1 sept. 2021

Afficher le Résumé
Design patterns are reusable solutions that can be applied to solve specific problems of software projects. However, developers may not implement patterns adequately on some occasions. This can lead to the appearance of code smells. Code smells (or smells) are fragments of code that can indicate potential design flaws. Co-occurrences are situations in which design patterns and smells exist simultaneously in the same code unit (e.g., classes and methods of Java-based software projects). In case of such phenomenon occurs frequently it can reveal opportunities to investigate if a bad practice in the application of patterns is the cause of the occurrence of smells. As a consequence, refining the way how developers apply design patterns may be a good measure to avoid code smells. In this context, we present two experimental studies that aim at examining cases of co-occurrence of patterns and smells. Both studies required the application of detection tools to evaluate Java-based software projects. With our first study, we seek to comprehend the frequency by which code smells co-occur with design patterns. We also report the most common cases of co-occurrence. To achieve this, we identified instances of smells in methods of design patterns of 25 software systems. We also manually inspected source code fragments to obtain useful information about pairs of patterns and smells. Among other findings, we perceived that methods that take part in the implementation of the Adapter pattern are prone to contain smells, e.g., the Feature Envy smell, although it can be argued that the detection of this smell in this context is a false positive. Our second study has the purpose of analyzing the evolution of open-source software projects. We wanted to provide insights about the co-occurrences along with the evolution of the projects. As a result, we found out that the Template Method pattern is more inclined to appear concomitantly with smells through time.

LEILA DE CARVALHO COSTA
FORMAL VERIFICATION OF SYSTEMS OF SYSTEMS (SOS) MODELED IN BPMN CHOREOGRAPHY DIAGRAMS

- PREVIOUS DETECTION OF TYPICAL RUNTIME ERRORS
Leader : RITA SUZANA PITANGUEIRA MACIEL
MEMBRES DE LA BANQUE :
FLAVIO OQUENDO
LAIS DO NASCIMENTO SALVADOR
RITA SUZANA PITANGUEIRA MACIEL
Data: 14 sept. 2021

Afficher le Résumé
System of systems (SoS) is composed of a set of systems, which interact together for

a common goal. Thereby, it tends to be larger and more complex than traditional systems.

To address the question of the inherent complexity of this kind of complex systems,

they are frequently modeled using the Business Process Modeling and Notation (BPMN),

which is a standard modeling notation for business processes. The choreography diagram,

introduced in version BPMN 2.0, provides suitable concepts for representing the interactions

among constituents of a SoS. However, models created using this notation may

contain errors, some of which can be detected at design-time and others only at runtime.

Syntax errors are easily detected with the assistance of modeling tools. Nevertheless,

the absence of a formal semantics for BPMN makes it harder to identify runtime errors,

as for example, deadlocks, livelocks and other safety properties in BPMN choreography

diagrams. These errors are challenging to detect and may lead to an improper operation

or even a system lockup. In this context, a method for identifying runtime errors consists

of translating the BPMN diagram into a formal model that can be analyzed in a model

checker. Thereby we present an approach to build a formal model for the BPMN choreography

diagrams in terms of the formal language ⇡-ADL, which is properly designed for

the specification of dynamic architectures, an intrinsic characteristic of SoS. Therefore,

we define the mapping of the elements of the choreography diagram to ⇡-ADL, in order

to obtain its formal description in ⇡-ADL. Such ⇡-ADL models allow its formal verification

using a specific model checker, enabling the prior detection of runtime errors in the

modeled system.

BRUNNA CAROLINE DO CARMO MOURÃO
MDSS: A model to support the development of sustainable software systems

80 / 5000

Resultados de tradução

MDSS: A model to support the development of sustainable software systems
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
SHEILA DOS SANTOS REINEHR
CLAUDIO NOGUEIRA SANT ANNA
IVAN DO CARMO MACHADO
Data: 5 oct. 2021

Afficher le Résumé
Thinking sustainably is an urgent need for the planet and an obligation for all areas of knowledge. Thus, the information technology area has been looking for ways to contribute to sustainability by investigating improvements related to the efficiency of energy consumption by software and hardware. The difficulty of this theme is due to the importance and the complexity of adapting traditional software development processes to new procedures, not to mention that human activities have been increasingly dependent on software, with the recent prevalence of developed software systems for the domain of mobile applications.

In this context, a significant challenge lies in investigating methods that increase software sustainability, thus reducing energy consumption by the software, which minimizes the impact of technology on natural resources.

Software sustainability can be analyzed from different perspectives (social, economic, technical, and environmental). Thus, before identifying, measuring, and managing the energy consumption of software, it is necessary to understand the concepts and definitions coined in the literature, existing tools, the level of knowledge on the part of professionals, and the proposed techniques and methods.

This work developed a model for sustainable software development and organized a body of knowledge on sustainability in Software Engineering based on studies in the literature, considering concepts/definitions, methods/models, and working professionals' opinions.

To achieve the goals mentioned above, the research followed two lines of work: (1) carrying out systematic mapping of the literature complemented by a survey for the collection of evidence that allowed to establish an overview of sustainable software engineering; (2) Structuring the identified information to develop the model intended to support teams in developing sustainable software, which took place through qualitative studies with industry professionals.

SARA MENDES OLIVEIRA LIMA
Thematic synthesis on the adoption of regression testing techniques in software projects for the Android platform
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
ALCEMIR RODRIGUES SANTOS
IVAN DO CARMO MACHADO
RODRIGO ROCHA GOMES E SOUZA
Data: 13 oct. 2021

Afficher le Résumé
The mobile device market has grown exponentially in recent years, as the need for apps that strive for quality and that offer features that increase user retention. In recent years, a growing number of studies have provided solutions to problems inherent to the demand mentioned above. The software testing activity plays an essential role in the software quality assurance process. In particular, regression tests are presented as a viable strategy to deal with the complexity and constant evolution of applications since its main objective is to ensure that changes made between versions of a product do not change the system's behavior. Although the literature has devoted efforts to developing new regression testing techniques for Android - the most popular operating system for mobile devices - existing studies do not address how professionals use testing techniques. This investigation aims to carry out a thematic synthesis on the adoption of regression testing techniques in software projects for the Android platform. We organized the research in four steps: (i) carrying out a structured review of the technical literature on regression testing of Android applications, (ii) applying a survey, (iii) carrying out an identifier with professionals from the industry, and (iv) the construction of a thematic synthesis. The survey got 100 responses and provided preliminary insight into how the testing process is performed during and after application maintenance, how automated regression testing of Android applications is in practice, and why the professionals do not perform regression testing after updating applications. In the interview study, we inquired 16 industry professionals. This study contributed to the research as it could unveil the level of knowledge of professionals and its relationship with test automation. In addition, we could identify the most used languages in the development of Android applications and the tools used for testing automation. Furthermore, the study provided us with a preliminary understanding of how professionals perform regression testing in practice. Finally, the thematic synthesis presents a model resulting from the comparison of evidence gathered from different data sources - literature, survey and declares about the use of regression test techniques. These studies bring an insight into the subject and provide possible future investigations.

OTÁVIO GONÇALVEZ VICENTE RIBEIRO FILHO
A new Artificial Neural Network architecture based on end-to-end approaches to classify signals
Leader : RICARDO ARAUJO RIOS
MEMBRES DE LA BANQUE :
RICARDO ARAUJO RIOS
RUBISLEY DE PAULA LEMES
MOACIR ANTONELLI PONTI
Data: 10 nov. 2021

Afficher le Résumé
The increasing computer power, the introduction of specialized processors designed to accelerate graphics rendering, and the high availability of new data have supported the adoption of artificial neural networks (ANN) to solve real-world problems in different areas, such as computer vision natural language processing. The application of such networks to deal with data characterized by temporal dependencies, as signals (time series), produces better results when a preprocessing stage is considered to extract implicit information. In this work, by taking into account the different preprocessing methods, we have noticed that wavelet transforms have been widely used due to, in general, the remarkable results. However, such transformations have two main drawbacks by modeling signals. Firstly, it is necessary to define a set of parameters to extract better information from different levels of scale and resolution. Secondly, the application of such transformations requires an initial step to convert, in batch, all analyzed signals before starting the network training. In our scenario, such limitations were overcome by our new ANN architecture, which analyzes signals and yields as output a set of characteristics similar to those produced by wavelet transforms. Therefore, our ANN can be combined with others to model signals without requiring an execution a priori of any wavelet transformation. The proposed ANN was assessed against a pre-trained network to classify signals from two real-world applications: local field potential (LFP) from monitoring monkey brains and seismic signals from the Llaima volcano. The final accuracy obtained to model both applications using traditional preprocessing, and our approach were, respectively: (i) LFP -- 0.655 and 0.649, and (ii) Llaima -- 0.976 and 0.974.

ANDERSON DA SILVA BRITO SACRAMENTO
A computational method for closed domain automatic event extraction in Portuguese
Leader : MARLO VIEIRA DOS SANTOS E SOUZA
MEMBRES DE LA BANQUE :
PAULO MIGUEL TORRES DUARTE QUARESMA
MARLO VIEIRA DOS SANTOS E SOUZA
RENATA VIEIRA
Data: 23 nov. 2021

Afficher le Résumé
The registration and digital publication of texts in natural language is part of different domains of human activity. Among the information recorded, events are a significant part of the content, since, regardless of the domain, new events continue to happen, and those considered relevant are recorded. With the spread of information technologies, the ease of access to these records has increased the supply of registered events and also the demand for knowledge to aid decision making, derived from new applications of a large volume of information increasingly globalized and instantaneous. The task of event extraction aims to automate the extraction of events from text by identifying the event type and its participants and attributes. This is an information extraction task, which is part of the area of natural language processing. There is no known solution with a performance equal or superior to that of a human for this task in any natural language. In the specific case of the Portuguese language, there is not only a lack of a solution, but there is a need for methods and linguistic resources for the development and evaluation of those methods (via learning from data, i.e. machine learning). To the best of our knowledge, the few works that were published did not perform event extraction in a closed domain configuration, when one knows the structure of the events that one aims to extract. In this research project proposal, we intend to develop a method for automatic event extraction in a closed domain configuration in Portuguese sentences and to build or adapt a corpus in Portuguese that supports the development of this method. This work presents a method for event extraction in a closed domain configuration in Portuguese language sentences based on deep learning using contextualized word embeddings. Furthermore, to support the development and evaluation of the proposed method, we construct a Portuguese corpus with annotations of events in closed domain.

JOSELITO MOTA JÚNIOR
An evidence-based study on issue labeling in Github-based repositories
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
IVAN DO CARMO MACHADO
CLAUDIO NOGUEIRA SANT ANNA
JOSÉ AMÂNCIO MACEDO SANTOS
Data: 7 déc. 2021

Afficher le Résumé
The open-source software community has grown in size and importance over the years. As a consequence, the number of project contributors has increased considerably. The capability of open-source project repositories to accommodate issue reports is essential. An issue report encompasses a large set of data that describes the necessary changes a software should handle. As developers need detailed information to reproduce and find them, incomplete information is a severe problem that may influence triage and defect detection leading to delays in project maintenance. Issue trackers commonly use the labeling method to add extra details to issues. Knowing the importance of labels, this dissertation focus on investigating the usage, creation, and similarities in the context of the issue lifecycle in both maintenance and evolution in the repository issue trackers of the largest and most popular code hosting platform, Github. In addition, it analyzes the number of labeled and unlabeled issues in the repository and the connection between the issues' components, with an analysis focused on the lifecycle. The results indicate a significant correlation between repositories with many issues and the creation of labels, but not all repositories use them. 64.58\% of the repositories insert new labels as the project evolves. 73.14\% repositories applied on issues the Github standard labels. We also found an influence of primary issue fields such as title, description, and comments in most issue labels, impacting the creation and labeling issues. These numbers show that issue labeling is of prominent relevance for project maintenance and evolution. It provides developers with an easy and convenient way to inform about an incoming issue reported by systems users.

Wallas Fróes de Oliveira
ProgLab: On Provisioning QoS Through Programmable Labels in Residue-Defined Networking
Leader : LEOBINO NASCIMENTO SAMPAIO
MEMBRES DE LA BANQUE :
LEOBINO NASCIMENTO SAMPAIO
MAYCON LEONE MACIEL PEIXOTO
RODOLFO DA SILVA VILLAÇA
Data: 9 déc. 2021

Afficher le Résumé
Residues Defined Network (RDN) have the ability to use labels for end-to-end routing in autonomous systems. RDN is characterized by dispensing with the use of routing tables at the core of the network. In this way, packets are forwarded based on the remainder of the division between the label found in the packet and the local key found in the core switch. Initially, RDN was intended to improve latency in SDNs, since the switches at the core of the network do not communicate with the controller. On the other hand, a range of solutions adopts the Quality of Service (QoS) based on the use of MPLS, which allows providing differentiated services to different traffic classes and patterns and generating considerable benefits for network communications. However, such solutions have high OPEX and CAPEX costs.Motivated by the capacity of RDNs, this dissertation brings an alternative method for using the labels in the core of the network, in addition to trying to solve the problem of scalability and flexibility found in the proposed KeyFlow solution and offer new functionalities for this technology.Through the P4 language, a new flexible, scalable and reprogrammable protocol for an RDN was implemented, in order to implement the model for provisioning QoS by programmable labels, ProgLab {Programmable Labels}.ProgLab implements a new method for reading labels by core switches, which uses the residual number system to define the output ports and queue priority, thus providing QoS to the RDN. To validate the proposal, emulation tests were performed with QoS and Quality of Experience (QoE) metrics to evaluate and demonstrate the feasibility of the model.The results demonstrated the viability of this proposal for the provision of QoS in core networks that use the RNS as packet forwarding, showing that in the future an RDN can become a low-cost alternative for packet forwarding with QoS services.

CÍNTIA DA COSTA SOUZA
DOSN-PROV: Model and Services for Data Provenance in Decentralized Social Networks.
Leader : CASSIO VINICIUS SERAFIM PRAZERES
MEMBRES DE LA BANQUE :
RITA CRISTINA GALARRAGA BERARDI
CASSIO VINICIUS SERAFIM PRAZERES
MARLO VIEIRA DOS SANTOS E SOUZA
Data: 10 déc. 2021

Afficher le Résumé
The information on social networks cannot always verify the origin, the paths described, and the registered modifications. Issues such as these generate uncertainty and distrust in processes and interactions in OSNs. With the emergence of DOSNs and the increase in the number of active users in these networks, it becomes essential to develop effective solutions regarding the provenance of decentralized data. The provenance is a factor of considerable importance because it is possible to evaluate the information's authenticity, reliability, and relevance through its results. Tracing and capture of provenance are critical tasks for DOSNs since they produce answers about the steps go through by the information. The large volume of data, the speed of generation and sharing of information, and the decentralized storage strategy make the provenance of DOSNs a non-trivial task. Thus, this work proposes the PROV-DOSN provenance ontological model, a specific model for the DOSNs domain, based on the PROV-O specification from W3C. In addition, this work also proposes services based on the DOSN-PROV to support the capture and tracking of provenance in DOSNs. We evaluated DOSN-PROV in two steps to demonstrate compliance with the proposed domain. Finally, the services underwent a performance evaluation, and their results indicated acceptable response times for the capture and tracking tasks.

BEATRIZ SANTANA FAGUNDES SOUZA DE LIMA
A Study about the Influence of Text Specificity in the Perceived Helpfulness Classification of Online Reviews
Leader : TATIANE NOGUEIRA RIOS
MEMBRES DE LA BANQUE :
TATIANE NOGUEIRA RIOS
RICARDO MARCONDES MARCACINI
THIAGO ALEXANDRE SALGUEIRO PARDO
Data: 15 déc. 2021

Afficher le Résumé
Online reviews are valuable sources of information to support the decision-making process, both for individuals and companies. Nevertheless, the large volume of reviews that have a low quality hinders the process of gathering helpful information from those reviews. Several retailers’ websites provide a voting system to allow customers to evaluate other product reviews as helpful or not. However, those votes are often biased and most of the reviews do not receive any votes at all. Besides that, several websites do not even have this voting mechanism or any other component for organizing the reviews in terms of their helpfulness. Therefore, classifying reviews according to their helpfulness has paramount importance in facilitating access to truly informative content. In this context, previous studies have unveiled several features and architectures that are beneficial for the perceived helpfulness prediction. In the present work, we argue that text specificity, defined as the level of details expressed in a text, can influence the perception of review helpfulness and, consequently, can also be a novel useful linguistic aspect for modeling the helpfulness prediction. We proposed two approaches to incorporate the specificity aspect into helpfulness classification models: i) using hand-crafted features based on text specificity and ii) using the review specificity prediction as an auxiliary task in a Multitask Learning (MTL) setting. First, we conducted an unsupervised domain adaptation approach [Ko, Durrett and Li 2019] to label text specificity scores on sentences from online reviews automatically. To evaluate the different trained models using this approach, we proposed a measure named Specificity Prediction Evaluation (SPE), which is based on the assumption that, on average, reliable specific sentences tend to be longer than reliable general sentences [Li and Nenkova 2015]. For the 18 collections of online reviews used in all of our experiments, we could achieve more reliable specificity predictions, according to SPE, by varying only the training set size and the number of training epochs. Finally, we performed experiments to assess the performance of the helpfulness classification models using two neural architectures: Convolutional Neural Network (CNN) [Kim 2014] and Bidirectional Encoder Representations from Transformers (BERT) [Devlin et al. 2019]. In summary, using balanced datasets, the perceived helpfulness classification models, embodied with text specificity- either as features or MTL - showed significantly higher precision results in comparison to a popular SVM baseline when using CNN. With BERT, the experiments showed that MTL outperformed the single-task models for most of the 18 datasets and both accuracy and precision were improved compared to the SVM baseline.

JOSÉ RONALDO LELES JUNIOR
DOSN-PRIV:: Model and Service for Privacy in Decentralized Online Social Networks
Leader : CASSIO VINICIUS SERAFIM PRAZERES
MEMBRES DE LA BANQUE :
DILVAN DE ABREU MOREIRA
CASSIO VINICIUS SERAFIM PRAZERES
FREDERICO ARAUJO DURAO
Data: 21 déc. 2021

Afficher le Résumé
The intensive use of Online Social Networks (OSNs) has been caused users to worry about data privacy. Given the centralized nature of these platforms, and since each has a particular storage mechanism, authentication, and access control, its users do not have the control and the right over their data. Thus, users can not easily switch between similar platforms or even transfer data from a platform to another. This implies, among other things, in a threat to privacy, since such users depend on the interests of the service provider responsible for OSN's administration. As a strategy for the decentralization of the OSNs and consequently as a solution to the privacy problems in these environments, the so-called Decentralized Online Social Networks (DOSNs) have emerged. Unlike OSNs, DOSNs are decentralized content management platforms in the sense that they do not use of centralized service providers. Although DOSNs address some of the privacy issues encountered in OSNs, the DOSNs also pose important challenges to be considered, for example, with regard to access control to user profile information with a lower level of granularity. In this context, this work proposes the development of an ontological model and a service to support privacy in DOSNs. The proposed model will be formalized through an OWL ontology that should describe the main concepts of access control for privacy in DOSNs and their relationships, while the service will consume the model to apply access control according to the policies represented in the model. The Dosn-Priv model was evaluated in two phases to verify its compliance with the proposed domain. Finally, the DosnPrivServ privacy service underwent a performance evaluation and its results were satisfactory regarding the response time of access control requests.

LUCINÉIA BATISTA DE SOUZA
ORGANIZATION OF A SET OF EXPERIMENTAL FINDINGS ON TECHNICAL DEBT RELATED TO
TEST
Leader : MANOEL GOMES DE MENDONCA NETO
MEMBRES DE LA BANQUE :
MANOEL GOMES DE MENDONCA NETO
JOSÉ AMÂNCIO MACEDO SANTOS
THIAGO SOUTO MENDES
Data: 23 déc. 2021

Afficher le Résumé
Context: The term Technical Debt (TD) refers to activities deferred during the development of the software project. TD can be compared to a financial debt whose debtor has the option of paying the principal amount of the debt with interest. Several initiatives have sought to identify the causes and effects associated with TD. Among them, the InsightTD project has carried out a survey in several countries to collect evidence on the state of TD practice. One of the results obtained by the project was the list of causes and effects of TD reported by software professionals. TD can be incurred at any stage of development, such as software testing. This activity aims to ensure the quality of the software being developed. However, both the project and the technical literature have not addressed the TD items that occur in software testing activities. Objective: To investigate, from the point of view of software testing professionals, the types of TD associated with test-related TD, the causes that lead to it and the effects of its existence. Method: A replication of the survey of the InsighTD project was carried out with software testing professionals in Brazil. In addition, responses from software professionals collected in the first execution of the survey were used. Thus, 54 responses from participants in the Brazilian software industry were collected and analyzed, quantitatively and qualitatively. Result: This study highlights the point of view of testing professionals on TD. It was noticed that test-related debt items are related to different types of TD, such as documentation, code, and the test debt itself. Sixty-one causes of TD were identified and organized into eight categories. The main causes identified were deadlines, outdated/incomplete documentation and inadequate planning. Forty-five TD effects were identified and grouped into six categories. The main effects are delivery delays, rework and poor external quality. Conclusion: The list of causes and effects along with their categories was organized into a set of cause and effect diagrams that can help software professionals in managing TD.

Thèses

CLEBER JORGE LIRA DE SANTANA
AN ARCHITECTURE BASED ON REACTIVE MICROSERVICES FOR DEVELOPING RELIABLE IOT APPLICATIONS
Leader : CASSIO VINICIUS SERAFIM PRAZERES
MEMBRES DE LA BANQUE :
CASSIO VINICIUS SERAFIM PRAZERES
CHRISTINA VON FLACH GARCIA CHAVEZ
NABOR DAS CHAGAS MENDONÇA
PAULO DE FIGUEIREDO PIRES
THAÍS VASCONCELOS BATISTA
Data: 9 mars 2021

Afficher le Résumé
The Internet of Things has changed the scope of the Internet and it has become a network connecting a myriad of devices of different types. The integration of these heterogeneous devices fosters novel services and applications, generates value-added information and actionable knowledge for the end user. Developing IoT applications and services and fulfill their quality of service requirements, such as confidentiality, interoperability, scalability, evolvability, adaptability, and reliability pose multiple challenges. These challenges are imposed mainly by the ultra large scale of the IoT, the heterogeneous nature of applications and devices, and the highly dynamic environment. Nowadays, the microservices pattern has been successfully applied by companies such as Netflix and SoundCloud to address some of these issues in the context of cloud computing applications. However, in the IoT field, the wide adoption of this Microservices still faces some obstacles to be solved in order to achieve the potential benefits of such approach. The goal of this work is to propose an architecture based on reactive Microservices for the development of IoT applications. The proposed architecture consists of a set of software components, tailored to meet the typical requirements of IoT applications that demand high availability. In addition, the proposal includes a software platform that materializes several of its components and help, at runtime, to meet the availability requirement of IoT applications. We applied our proposal in a real-world scenario in the field of Smart Agriculture and in a simulated scenario in the field of Smart Water. The experimental results conclude that our approach improves availability, minimizes latency and maximizes throughput, and increases accuracy significantly, in comparison to another study. We also carried out a performance evaluation from two perspectives. The first perspective is related to the evaluation of the IoT application through different deployment policies, which aimed to show how the architecture components used in microservices can impact the use of computational resources. The second perspective is related to the evaluation of the IoT application that uses the components of the proposed architecture in a scenario in which we increased the traffic of IoT data. The results of the performance evaluation in the two perspectives show that the IoT application remained functional during the analysis performed.

JOÃO PAULO DIAS DE ALMEIDA
Exploiting Open Data for Improving Spatial Keyword Query Applications
Leader : FREDERICO ARAUJO DURAO
MEMBRES DE LA BANQUE :
CARLOS ANDRÉ GUIMARÃES FERRAZ
CARLOS EDUARDO SANTOS PIRES
DANIELA BARREIRO CLARO
FREDERICO ARAUJO DURAO
VANINHA VIEIRA DOS SANTOS
Data: 3 mai 2021

Afficher le Résumé
Nowadays information is an essential resource in various sectors of the economy. The popularization of social networks, smartphone applications, and online services has increased the volume of data available online. Among this extensive amount of information, there is a specific data type called spatial data. It represents a physical object using its spatial coordinates (e.g. latitude and longitude). Spatial data is critical in a large number of application domains (e.g. land use, transportation plan). For instance, the user can find points of interest (POIs) or be warned of critical situations by spatial data applications like web search engines or emergency response applications. It’s been asserted that 80% of all data business has some locational reference. Spatial queries are widely employed to manipulate spatial data more efficiently. However, the user has a crucial role in the spatial information retrieval process when querying the needed information. For decades, researchers have proposed several techniques to aid users in expressing their information needs, such as Boolean models, pattern matching operators, and query expansion. Despite the existence of relevant alternatives in the field, there is still a lack of solutions applied to keyword preference queries. The Spatial Keyword Preference Query (SKPQ) arises as a potential solution to assist users in finding POIs. SKPQ selects POI based on the description of features in their neighborhood. In essence, the user defines a spatial (i.e. radius) and textual (i.e. query keywords) constraint to be satisfied. In this context, this thesis aims at proposing strategies to improve SKPQ results. The contribution is threefold. First, two Linked Open Data (LOD) repositories (i.e. DBpedia and LinkedGeoData) are exploited to improve the features description. The feature description in LOD contains more information than traditional spatial databases, leading to a more detailed description. Second, the query results are personalized to present the best POIs for the underlying user. By exploiting reviews on POIs, the system identifies the object that best satisfies the user and re-order the rank with respect to the user preference. Third, we model the user preference in visiting locations near to each other using a probabilistic function. This function is incorporated into the ranking function to retrieve POIs considering this user preference. We evaluate each technique employed in this proposal separately. The first technique achieves a relative NDCG improvement of 20% when using random query keywords. Also, it finds POIs where SKPQ is unable to find. The second technique further improves the relative NDCG by 92%. Finally, the third technique improves the rank consistency achieving a Tau performance of 52%.

JÔNATAS FERREIRA BASTOS
UNDERSTANDING TEST EVOLUTION: FROM HIGHLY-CONFIGURABLE SYSTEMS TO SOFTWARE ECOSYSTEMS
Leader : EDUARDO SANTANA DE ALMEIDA
MEMBRES DE LA BANQUE :
EDUARDO SANTANA DE ALMEIDA
CHRISTINA VON FLACH GARCIA CHAVEZ
RODRIGO ROCHA GOMES E SOUZA
GUSTAVO HENRIQUE LIMA PINTO
MARCELO BEZERRA D'AMORIM
Data: 4 mai 2021

Afficher le Résumé
Software evolution is inevitable if the systems are planned to survive in the long-term. Equally, well-understood is the necessity of having a good test suite available to ensure the quality of the current state of the software system and to ease future changes. This is especially true in the context of reusable systems since they are planned to attend for a long time a specific market niche and need to support a large number of configuration options. However, developing and maintaining a test suite is time-intensive and costly. This situation is challenging for the projects: on the one hand, tests are essential for the success of software; on the other hand, tests become a severe burden during maintenance. Even though a substantial body of literature has studied testing in reusable environments, test evolution analysis has not been addressed. In general, researchers have looked into analyzing test strategies, dynamic test selection techniques, and co-evolution of tests along with other systems artifacts. This thesis intends to improve the test evolution body of knowledge in reusable systems, investigating characteristics that indicate the effort to develop and maintain the test suite and unveiling how the reusable aspects affect the tests. The set of evidence can help researchers and practitioners to better planning the test development and evolution. This way, we employed a multi-method approach to develop the understanding of test evolution in configurable systems and unveil evidence on the topic from various sources. In the first phase of the research program, we provided an overview of the existing research related to this thesis’s subjects and presented related work to our investigation. The second phase was composed of four empirical studies. First, we performed a case study to analyze the test evolution of a large configurable system. Next, we performed a comparative study to evaluate the test evolution in 18 open-source projects from various sizes and domains in configurable systems and their similarities and differences to 18 Single Systems (SS) projects. Third, we conducted an extended study to analyze the test suite evolution in another category of reusable systems to verify whether some observations are recurring and gather new data that support the findings. Finally, we surveyed test contributors to investigate the test evolution from the development point of view and improve the findings in previous stages. This work collected a set of findings of test evolution, and these findings were strengthened by using different research methods. Our work provided a better understanding of test evolution in configurable systems by documenting evidence observed in open-source projects and test contributors. Moreover, in this Thesis, we synthesized the gathered evidence and identified open issues in this research topic. These findings are an important step to establish guidelines for addressing test evolution in configurable systems.

FLAVIA MARISTELA SANTOS NASCIMENTO
A Flexible Framework to Schedule Hard and Soft Tasks in Multiprocessor Real-Time Systems
Leader : GEORGE MARCONI DE ARAUJO LIMA
MEMBRES DE LA BANQUE :
GEORGE MARCONI DE ARAUJO LIMA
VINICIUS TAVARES PETRUCCI
DANIEL MOSSÉ
KONSTANTINOS BLETSAS
LILIANA CUCU-GROSJEAN
Data: 28 juin 2021

Afficher le Résumé
The increasing number of modern applications, that rely on the processor capacity of multicore CPUs, has been pushing the area of real-time systems towards multiprocessor architecture. In fact, such applications are present in a wide variety of areas, such as automotive, avionics and personal electronics industries, most of them being composed of both time critical (hard) and non-critical (soft) services.
Indeed, for multiprocessor real-time systems that deal with both hard and soft tasks, one is interested in minimizing the response time of the soft tasks, while not jeopardizing timeliness of hard tasks. Several scheduling approaches have been proposed to deal with hard and soft tasks on a single processor, while for a multiprocessor environment such approaches are less common and are typically extensions of the mechanisms described for uniprocessor systems.

This work describes a flexible scheduling approach designed for multiprocessor real-time systems composed of both hard and soft tasks. According to the proposed approach, hard tasks are assigned to processors in an off-line manner and do not migrate during their execution. As for soft tasks, two main configurations are possible. They may be placed either in each processor's local queue, for which migration is not allowed, or in a global queue, being accessed by all processors. The scheduling approach described in this thesis is strictly linked to the concept of servers, which are scheduling proxies responsible for scheduling the tasks (or other servers) assigned to them. Two special servers are described, whose definition, rules and correctness proof are also presented in this thesis. Together, these servers are able to (a) encapsulate soft tasks so that their execution does not put at risk the timing requirements of hard tasks and to (b) reclaim system unused slack to favor the execution of soft tasks. By doing so, the average response time of soft tasks are improved, without jeopardizing the timeliness of hard tasks.

It is shown that by using the scheduling approach described in this thesis, in both configurations, improves the QoS-related metrics for soft tasks, such as deadline miss ratio and average response time, without compromising timeliness guarantees for hard tasks. The experiments has shown a reduction of up to $27\%$ in the deadline miss ratio and a reduction of up to $14\%$ in the average response time of the soft tasks.Such results indicate that the available processing capacity left unused by hard tasks can be effectively used for executing soft tasks.Additionally, the proposed approach has shown to be flexible enough to be adapted to other EDF-based scheduling policies at low cost..

RAFAEL LIMA COSTA
Tactful Networking as a cornerstone for Opportunistic Human-Aware D2D Communication
Leader : LEOBINO NASCIMENTO SAMPAIO
MEMBRES DE LA BANQUE :
ALINE CARNEIRO VIANA
EDUARDO COELHO CERQUEIRA
FABIOLA GONCALVES PEREIRA GREVE
LEOBINO NASCIMENTO SAMPAIO
RICARDO ARAUJO RIOS
Data: 3 août 2021

Afficher le Résumé
Among the mobile network challenges, there is data traffic growth, an increasing number of connected devices, spectrum limitations, and costly infrastructure updates. In this context, opportunistic D2D communication appears as a solution to offload data, increase capillarity, deliver content in specific scenarios (e.g., emergency) and foster innovative applications and services. Previous standards like LTE discussed D2D broadly. Still, recent research remarks that 5G/6G will be D2D's real enablers, given the possibility of using human-behavior big data from users' devices. Previous opportunistic D2D forwarding algorithms extracted mobility characteristics for improving content delivery cost-effectiveness. Nevertheless, most initiatives dealt with traditional metrics due to constraints, such as limited availability of real traces or the lack of a human-centered networking vision. Next-generation solutions need a more in-depth vision of human aspects hidden into datasets. Moreover, scenarios practical to mobile carriers lack evaluation. To this end, this thesis~\textit{guides the reader through the whole process for building a novel Tactful Opportunistic COmmunicaTion Strategy (TOOTS)}. TOOTS leverages wireless encounter patterns, temporal, spatial, geographic, and direction awareness to improve cost-effectiveness content delivery in a mobile networks scenario. The proposal consists of: learning human-aspect state-of-art best practices; achieving insights to improve strategy's performance; using, proposing, and analyzing human-aware metrics; combining metrics and insights into the strategy targeting an improved performance in a more realistic mobile scenario; and finally, evaluating the strategy through its strengths and shortcomings. This thesis shows that TOOTS improved the performance of an opportunistic content delivery scenario in terms of overhead, delivery rate, and latency by following this process. First, to achieve the results, we survey the human aspect in networking solutions for over a decade. We found that there is an evolution in how the human links to networking challenges. This broader human-aware vision culminates into the Tactful Networking perspective, which we introduce to follow, including discussions concerning application examples and insights. We complete the literature review by discussing state-of-art opportunistic forwarding strategies, including contributions, gaps, and open issues. Second, we discuss a framework for enhancing raw human data hidden into datasets. A case study with the MACACO dataset validates the framework through trace characterization results and analysis. Following, we characterize MACACO, NCCU, and GRM datasets to bring insights and to validate TOOTS. The presented and analyzed strategy's metrics come from such datasets' characterization results and insights. Third, we introduce TOOTS, targeting cost-effective content delivery in a mobile scenario. Afterward, we present the formal definition for choosing the disseminator nodes problem (TOOTS' 1st phase). As motivation, we analyze the overhead and latency of the Epidemic forwarding. Subsequently, we introduce TOOTS' two phases: a Tactful Dissemination Policy and a forwarding algorithm. Finally, we compare TOOTS' effectiveness on its phases with enhanced Store-wait-forward, Epidemic, and Bubble Rap algorithms through real and synthetic traces. Among the main outcomes, in a scenario with a 30~m range, TOOTS reached a 100\% delivery ratio with 28\% and 73\% reduced delivery latency, and with 16\% and 27\% reduced overhead, respectively, in the real and synthetic datasets. With a restricted (10~m) range, TOOTS was the fastest strategy and the only one that delivered 100\% of the contents within the real trace. Consequently, this thesis contributes to addressing the opportunistic content delivery problem in cellular networks by providing a cost-effective human-aware opportunistic solution. The strategy can be applied in different scenarios to assist the operator in delivering content without not necessarily using their legacy network, e.g., by exploring the capillarity of their mobile users, for data offloading and other applications.

MARCOS BARBOSA DÓSEA
Design-Sensitive Metric Thresholds based on Design Roles
Leader : CLAUDIO NOGUEIRA SANT ANNA
MEMBRES DE LA BANQUE :
CLAUDIO NOGUEIRA SANT ANNA
CHRISTINA VON FLACH GARCIA CHAVEZ
MANOEL GOMES DE MENDONCA NETO
MAURÍCIO FINAVARO ANICHE
MARCELO DE ALMEIDA MAIA
Data: 16 août 2021

Afficher le Résumé
State-of-the-art techniques and Automated Static Analysis Tools (ASATs) for identifying code smells rely on metric-based assessment. However, most of these techniques have low accuracy. One possible reason is that source code elements, such as methods implemented according to different design decisions, are assessed through the same generic threshold for each metric. Other possible reason is that these metric thresholds are usually derived from classes driven by different design decisions. Using generic metric thresholds that do not consider the design context of each evaluated class can generate many false positives and false negatives for software developers. Our goal is to propose design-sensitive techniques to derive contextual metric thresholds. Our primary hypothesis is that using the design role played by each system class to define this context may point out more relevant code smells to software developers. We conducted some empirical studies to define the proposed techniques. Firstly, we performed a large-scale survey that showed that practitioners recognize difficulties in fitting ASATs into the software development process. They also claim that there is no routine for application. One possible reason practitioners recognize that most of these tools use a single metric threshold, which might not be adequate to evaluate all system classes. Secondly, we conducted an empirical study to investigate whether fine-grained design decisions also influence the distribution of software metrics and, therefore, should be considered to derive metric thresholds. Our findings show that the distribution of metrics is sensitive to the following design decisions: (i) design role of the class (ii) used libraries, (iii) coding style, (iv) exception handling, and (v) logging and debugging code mechanisms. We used these findings to propose two new techniques to derive design-sensitive metric thresholds using the class design role as context. Then, we carried out two large-scale empirical studies to evaluate them. The first study showed that our proposed techniques improved precision according to developers' perceptions. Since it is impossible and tiring to perform a complete source code quality assessment with developers, we conducted a second study mining the evolution of software projects from popular architectural domains. We found that our techniques improved recall to point out methods effectively refactored during software evolution.

ERIC BERNARDES CHAGAS BARROS
Jemadar-AI: A Fog-Based Framework for Microgrid Power Management
Leader : MAYCON LEONE MACIEL PEIXOTO
MEMBRES DE LA BANQUE :
MAYCON LEONE MACIEL PEIXOTO
GUSTAVO BITTENCOURT FIGUEIREDO
LEOBINO NASCIMENTO SAMPAIO
LUIZ FERNANDO BITTENCOURT
EDWARD DAVID MORENO ORDONEZ
Data: 23 sept. 2021

Afficher le Résumé
The constant increase in demand for electricity and the reduction of conventional means of production point to a future energy crisis. One of the factors that impact the growing demand is uncontrolled consumption, generating an increasing energy dispatch to meet the consumer's needs. Thus, production needs to use a greater amount of energy sources, especially during peak consumption hours. To control demand, demand management techniques are being used by consumers to reduce peak periods and energy bills. Likewise, energy production can combine the use of renewable and non-renewable energy sources with a focus on reducing non-renewable energy production. In this scenario, intelligent energy production requires automated control to avoid energy losses. Currently, several researchers propose the use of optimization algorithms for electrical equipment scheduling as a demand management solution, so the use of equipment be displaced to avoid peak energy periods. Research has also been proposed on the use of the Proportional-Integral-Derivative (PID) control to balance renewable and non-renewable production. However, few researchers consider fog computing with the help of the cloud to perform the processing of these algorithms. Thus, this work proposes the JEMADAR-AI framework to deal with the data traffic of electrical networks. JEMADAR-AI trains two neural networks based on reinforcement learning: (i) the first one to performs equipment scheduling; (ii) the second adjusts the PID controller dynamically. For this, JEMADAR-AI uses an architecture that combines fog computing with cloud computing to reduce request response times for equipment scheduling and power adjustment requests. The results obtained through the performance evaluation showed that the use of JEMADAR-AI reduces up to 19% the electrical power at peak hours and 21.6% the values of the energy bills when compared with scenarios that do not perform any optimization.

LAECIO ARAUJO COSTA
SapeS: An architecture for evaluating academic performance in distance education based on Learning Analytics and Ontologies
Leader : MARLO VIEIRA DOS SANTOS E SOUZA
MEMBRES DE LA BANQUE :
MARLO VIEIRA DOS SANTOS E SOUZA
ECIVALDO DE SOUZA MATOS
FREDERICO LUIZ GONÇALVES DE FREITAS
LEÔNIDAS DE OLIVEIRA BRANDÃO
PAULEANY SIMOES DE MORAIS
Data: 28 sept. 2021

Afficher le Résumé
Distance Learning, along with the evolution of Information and Communication Technology (ICT), has higher enrollment rates than traditional education. With this growth, educators started to manage classes with a more significant number of geographically distributed students. Challenges arise for teachers regarding how to monitor academic progress based on the fulfillment of planned educational objectives and assessing students' academic performance, supervising the acquisition of knowledge. Tracking students' learning experiences in Learning Management Systems (LMS) to assess academic performance and student progress is one of the most intense and exhausting challenges for educators due to the amount of educational data produced and the various learning indicators available on the LMS. Thus, this research aims to investigate how an approach to monitoring and evaluating the student's academic trajectory in the LMS, called SapeS, can positively impact the teaching and learning process in Distance Learning. The proposed solution aims to process educational data from the LMS, extract useful knowledge, and make information available to educators and students with techniques of Learning Analytics and Computational Ontologies, guided by Taxonomy of Educational Objectives. The solution was developed based on the Design Science Research (DSR) methodology and applied in a real context for evaluation with the population (n=24). Through the analysis carried out it was verified that there was a strong adherence to the use of the proposed tool and a significant difference between the data collected at different moments of the investigation. There was a greater engagement, motivation, and participation in the academic activities of the investigated course and greater attention from the teachers regarding the evaluation of the student's trajectory, in addition to checking and adapting the pedagogical planning in relation to the student's situation, and the planned educational objectives.

JEAN CLEMISSON SANTOS ROSA
The coordination of collaboration in SPIDe: mechanisms and practices to support interaction co-design
Leader : ECIVALDO DE SOUZA MATOS
MEMBRES DE LA BANQUE :
ECIVALDO DE SOUZA MATOS
DUARTE NUNO JARDIM NUNES
RODRIGO FREESE GONZATTO
LUCIANA APARECIDA MARTINEZ ZAINA
TAYANA UCHÔA CONTE
Data: 25 nov. 2021

Afficher le Résumé
The creation of interactive computational artifacts should reflect the needs, interests, restrictions, ways of working, and cultural aspects of their potential users whenever possible. SPIDe (Semio-Participatory Interaction Design) was conceived to reflect these aspects in the developed computational product through collaborative design practices (codesign). In this sense, the literature indicates that codesign practices favor the product's identity concerning its potential users. Codesign expresses a collaborative perspective for design, in the sense that people influenced by the creation and use of technologies must collaborate in its creation and development. Collaboration has three well-defined dimensions in the literature, according to the 3C Model: cooperation, communication, and coordination. The cooperation concerns the joint operation of users in a shared space making use of communication; in turn, communication consists of the exchange of messages for mutual and shared construction based on the management carried out by the coordination; coordination defines the activities and the set of objects that will be manipulated, in addition to defining the media and the mode of transmission of messages. Although SPIDe is characterized as an interaction codesign process, several aspects are to be investigated and improved. One of them concerns coordination. Therefore, in this research, we seek to identify how to promote the coordination of collaboration in SPIDe. To this, we investigated mechanisms and practices to support and promote the coordination of collaboration in SPIDe. Methodologically, we conducted multiple case studies with different data collection approaches, such as systematic review and narrative of literature, semi-structured interviews, logbooks, conversation, and Technology Acceptance Model questionnaire and Design Science Research. The thematic analysis based on the data produced supported the improvements made in SPIDe and reported in this thesis report, such as the guidelines for conducting the braindraw technique, the construction of the epistemic tool SPIDe\_Canvas and the construction of a platform for conducting online and distributed from SPIDe. In addition to improving SPIDe, the results of this research can help professionals and researchers build interactive computational artifacts in collaboration with potential users in a fun, dynamic, and effective way.

JEFFERSON FONTINELE DA SILVA
Paying attention to the boundaries in semantic image segmentation
Leader : LUCIANO REBOUCAS DE OLIVEIRA
MEMBRES DE LA BANQUE :
LUCIANO REBOUCAS DE OLIVEIRA
MICHELE FÚLVIA ANGELO
ALEXANDRE DA COSTA E SILVA FRANCO
FLAVIO DE BARROS VIDAL
RICARDO DA SILVA TORRES
Data: 20 déc. 2021

Afficher le Résumé
Image segmentation consists of assigning a label to each pixel in the image in such a way that pixels belonging to the same objects in the image must have the same labels. The segmented area of an object must span all pixels up to the limits (boundaries) with the other objects. The boundary region can provide helpful information for the segmentation process, as it marks a discontinuity that can define a segment limit. However, segmentation methods commonly suffer to explore boundary information and consequently to segment this region. This is so mainly due to the proximity between image regions containing different labels. In view of that, we propose to investigate how to take into account the boundary information when semantically segmenting an image object. Our first contribution is a graph-based image segmenter, called interactive dynamic programming (IDP)-expansion. This is a weakly-supervised method that requires a seed into each object targeted to be segmented in the image, subsequently minimizing an energy function to obtain the image labels. IDP-expansion explores dynamic programming to initialize an alpha expansion algorithm over superpixels to improve boundary information in a segmentation process. Over the Berkeley segmentation data set, our experiments showed that IDP-expansion is 51.2% faster than a traditional alpha-expansion based segmentation. Although IDP-expansion has shown to be faster, it suffers from two matters: A mandatory seed initialization and the lack of semantic information. This further led us to develop a supervised convolutional neural network architecture to semantically explore boundary information. Our novel method, called DS-FNet, uses two streams integrated in an end-to-end convolutional network to combine segmentation and boundary information based on an attention-aware mechanism. To evaluate DS-FNet, we initially conducted experiments on general-purpose (Pascal Context) and traffic (Cityscapes, CamVid, and Mapillary Vistas) image data sets, having the mean intersection over union (mIoU) as the reference metric. DS-FNet outperformed ten segmentation networks in the Pascal Context, Cityscapes, and CamVid data sets. In the Mapillary Vistas data set, DS-FNet achieved second place when compared to five other methods. A second round of experiments was performed to evaluate the generalization of our proposed method on challenging medical imaging data sets, containing several kidney biopsy whole slide images (WSIs). The data sets used to evaluate the second version of our network were HubMAP, WSI Fiocruz, and a subset of Neptune data set, all considering glomerulus segmentation. After training DS-FNet only over HubMAP data set, containing periodic acid-Schiff (PAS)-stained WSIs with only non-injured glomeruli, we found that our network was capable to segment glomeruli on WSIs stained by other methods (hematoxylin-eosin (HE), periodic acid-methenamine silver (PAMS), trichrome (TRI), and silver (SIL)). The results of these latter experiments show that our model is more robust than other models based on U-Net architecture. All the experiments and analyses presented in this work demonstrated that the explicit and adequate consideration of boundary information improves the results over non-boundary segmentation methods.

Unit testing is a specific type of test that deals with the smallest units in the system. It represents the first test after the implementation of a component. When the implementation of unit tests does not follow good practices, anti-standards are likely to be introduced in the code. Anti-patterns in tests, also known as test smells, are poor decisions for designing and implementing test code. Test smells impair the quality of the test code and reduce the ability of developers to interact with the test code, which makes it difficult to understand, read and, consequently, maintainability and evolution of the system. One strategy for removing test smells is to refactor the test code. Few studies in the literature offer automated support for the detection and refactoring of test smells. Thus, the present study provides a semi-automatic approach to identify test smells and suggest refactorings for the test code. Our approach is supported by the RAIDE tool (RefActorIng test Design Errors). We developed the RAIDE integrated with a development IDE to help test engineers refactor test code. RAIDE supports two test smells, Assertion Roulette and Duplicate Assert. Although there are other tools capable of identifying test smells, as far as we know, there is still no evidence of tools that provide automated support for refactoring test smells. In addition, no tool provides a user-friendly interface integrated with an IDE for the identification of test smells. To assess how our approach can help improve the quality of the test code, we also conducted an experimental study. We compared our tool with the state of the art and found that participants who used RAIDE were able to identify test smells more easily and quickly. In addition, RAIDE also proved to be efficient in refactoring test methods. Since RAIDE is a tool integrated with the Eclipse IDE, the identification and refactoring processes are faster and more intuitive compared to the state of the art and manual refactoring.

Data modeling in the temporal domain has been applied to different systems such as meteorology, engineering, medicine and economics. In this sense, the Fuzzy Time Series area has stood out due to its ability to approximate mathematical functions and linguistic variables to create rules, which are easier to be interpreted by specialists. As a consequence of this capacity, more accurate models can be obtained to understand the behavior of systems and, for example, to make predictions of future observations. Since the appearance of fuzzy time series models, researchers have been proposing improvements aiming, for example, to reduce errors in the forecasting task. In general, such tasks are composed of three stages: i) fuzzification; ii) fuzzy logical relationship; and iii) defuzzification. According to the literature, one of the points most studied by researchers is the fuzzification stage, with a focus on partitioning the universe of discourse for modeling fuzzy sets. However, few studies in the literature consider the separation of influences from the stochastic and deterministic components present in order to assist in the modeling process of fuzzy sets. In order to overcome this limitation, the present work presents two approaches that make improvements in the modeling of fuzzy sets in the fuzzification stage, considering influences of stochastic and deterministic components present in time series. In the first approach, the series was initially decomposed, separating its components into different monocomponents. Then, single components with high frequency were removed, resulting in a new smoothed series whose combination with the original series allows to generate a new scatter plot. The second approach was developed to model series with chaotic behavior.

In this sense, instead of modeling the series as a scatter plot, tools from the field of Dynamic Systems and Chaos Theory were used to reconstruct the series in phase space. Although using different methods, the two approaches remove the temporal dependence between the observations to assist in the modeling process of the fuzzy modeling sets, which is conducted considering the Fuzzy C-Means clustering method and cluster validation indexes, to estimate the quantity and function of fuzzy sets. In the experiments, time series with influences of stochastic and deterministic components were used to validate and test the generalization of the approach. The results achieved were promising and superior compared to the state of the art.

	Thèses
1	DIEGO ZABOT (SPIDe Kids): adapting an interaction codesign process for deaf or hard of hearing children participation Leader : ECIVALDO DE SOUZA MATOS MEMBRES DE LA BANQUE : ECIVALDO DE SOUZA MATOS LYNN ROSALINA GAMA ALVES SORAIA SILVA PRIETCH Data: 22 mars 2019 Afficher le Résumé In 2006 the Brazilian Computer Society (Sociedade Brasileira de Computação - SBC) proposed five major challenges in computer research in Brazil. The fourth challenge, "Participatory and universal access of the Brazilian citizen to knowledge", involved Human-Computer Interaction (HCI). The objective of this challenge was to overcome the barriers that impeded access and interaction by designing systems, tools, models, methods, procedures and theories capable of giving Brazilian citizens access to knowledge. In addition, interactive technologies are penetrating into every aspect of daily life. In this context, interaction design raises questions about how to design systems that are suitable for all users. This research aimed to understand how to adapt an interaction codesign process for people who are deaf or hard of hearing (D/HH) participate. The starting point was the analysis of a participatory codesign process, SPIDe, with the objective of identifying, adapting and evaluating design solutions for the participation of D/HH in the entire process. A case study was conducted at a state college for deaf people with a group of six year old students to test the adapted process. In this case study, it was created an educational game by codesign with the children. As result, the children produced two video games, created by e for D/HH childrens, whose goals were to help deaf children’s alphabetization, teaching numbers systems and the alphabet. In addition, the children recognized themselves as producers and the games are imbued with elements of their culture. From this results, guidelines and indications were defined on how to conduct a codesign process with D/HH children and also about possible extensions of this process.
2	PAULO ROBERTO DE SOUZA RecTwitter: A Semantic Recommendation System for Twitter Users Leader : FREDERICO ARAUJO DURAO MEMBRES DE LA BANQUE : DANILO BARBOSA COIMBRA FREDERICO ARAUJO DURAO JOÃO BATISTA DA ROCHA JÚNIOR Data: 25 mars 2019 Afficher le Résumé Twitter is one of the most popular microblogging services today, which allows users to share images, links, texts, etc., as well as follow or not to follow preferred accounts, which are also managed by other users. Public opinion has to do with the differences in volume and volume of publication. In view of this, it is necessary to adopt intelligent schemas to identify and filter accounts that publish content similar to the interests of the target user. This dissertation offers a comment system for a subject with rules of conversation, which leads us to analyze the accounts of a user, and recommends those that are discontinued and new ones that are followed in a row. As rules work as recommendation engines, people are shaped through an interaction between Twitter users as they are modeled through a domain ontology. To evaluate the proposed model, experiments were carried out with real users and comparisons with works related to the state of the art. The results of the online experiments indicate that 76% of users were approved as a reference. In relation to the results obtained in the off-line experiments, one of the best models of the state of the art.
3	EUDES DIÔNATAS SILVA SOUZA An Architecture for Data Authoring and Web Publishing Leader : CASSIO VINICIUS SERAFIM PRAZERES MEMBRES DE LA BANQUE : CASSIO VINICIUS SERAFIM PRAZERES ARTUR HENRIQUE KRONBAUER RENATO DE FREITAS BULCÃO NETO Data: 27 mars 2019 Afficher le Résumé On the one hand, the simplicity of the HTML document standard was an initial design requirement, with the goal of rapidly expanding the Web by creating various HTML-authoring tools. On the other hand, this same simplicity also generates limitations for the use of the Web by machines due to the lack of structuring and semantics of HTML documents. In this context, the Semantic Web aims at extending the current Web, or the Web of Documents, such that the Web has become susceptible to interpretation and use by machines (e.g. software agents). To that end, the Web of Data attempts to describe the current Web’s data through structured documents that contain metadata and explicit semantics. In addition, rules for authoring and publication of structured and explicitly semantic data called Linked Data have been proposed for the Web of Data. However, the proposed document patterns in the Linked Data seems to undermine the initial goal of the HTML standard, i.e. simplicity. Thus, the aforementioned authoring process must now take into account that such simplicity is no longer a valid prerequisite for Web documents. Therefore, to generate and publish data for the Wed of Data just as it was for the Web of Documents, some studies have proposed methods and tools for the following: automatic generation of Linked Data from databases; semiautomatic extraction of Linked Data from HTML pages; and authored Linked Data for ordinary users. However, in relation to authoring and publication by ordinary users, these methods and tools are still being used in an ad hoc manner. That is, the previously mentioned studies present solutions that are domain specific such as social networks, content managers, wikis, blogs, etc. In this context, this work proposes a framework for data authoring and publication in the Web of Data according to the rules of the Linked Data standard while still focusing on ordinary users.
4	JÚLIA MADALENA MIRANDA CAMPOS A Restricted Logic Programming Approach to the Nursing Scheduling and Routing Problem. Leader : TIAGO DE OLIVEIRA JANUARIO MEMBRES DE LA BANQUE : TIAGO DE OLIVEIRA JANUARIO MARLO VIEIRA DOS SANTOS E SOUZA MAYRON CESAR DE OLIVEIRA MOREIRA Data: 26 avr. 2019 Afficher le Résumé The Home Care Service is characterized as a health care modality composed of a set of actions for prevention, rehabilitation and treatment of diseases, provided at home. This service has become increasingly present as a complementary health action, replacing hospitalization, as it offers a new modality of care to people with stable clinical conditions that require medical care. The Nursing Scheduling and Routing Problem is to determine in an integrated way the assignment of health professionals to patients, along with the routes of the vehicles that will transport the professionals. This problem is often resolved manually, making the process inefficient and often yielding unsatisfactory results. This dissertation presents a case study carried out with the Home Care Service team of the General Hospital of the State of Salvador, in order to propose a Constrained Logic Programming Model for the Nursing Scheduling and Routing Problem in order to maximize the number of patients attended and minimize the total distance traveled by health professionals.
5	BRAIAN VARJÃO GAMA BISPO Xangô: a framework for robust attribute selection to the problem of unbalance between classes in text classification tasks Leader : TATIANE NOGUEIRA RIOS MEMBRES DE LA BANQUE : TATIANE NOGUEIRA RIOS Ricardo Cerri RICARDO MARCONDES MARCACINI Data: 3 mai 2019 Afficher le Résumé Attribute selection is a widely used dimensionality reduction technique to deal with the difficulties associated with the "dimensionality curse" in text classification tasks. The most common attribute selection approach for textual databases is to weigh the relevance of each attribute to the learning process and to select the best-valued N's, where N is generally an empirically defined number. Although this strategy is widely applied, it can lead to the partial or complete exclusion of attributes essential for learning In this sense, the research presented in this paper aims to foster the generation of more reliable text classifiers through the improved Xangô, a framework for selecting attributes in tasks of classifying texts, whose selection process seeks to construct a reduced dimensional space where all classes are represented in a balanced way by the its most discriminatory terms. Experimental results indicate that Xangô is a framework that is adaptable to different state-of-art attribute selection methods, surpassing its individual performances in learning tasks performed under varied conditions, including multi-class problems, imbalance between classes, different classification algorithms and drastic dimensionality reductions.
6	Emilayne Feitosa Corlett The construction of the algorithm concept in Computing textbooks through the External Didactic Transposition Leader : ECIVALDO DE SOUZA MATOS MEMBRES DE LA BANQUE : AYLA DÉBORA DANTAS DE SOUZA REBOUÇAS ECIVALDO DE SOUZA MATOS MARIA CAROLINA DE SOUZA SAMPAIO Data: 21 mai 2019 Afficher le Résumé In Brazilian Basic Education, the textbook is most often the main pedagogical resource used by the teacher in stimulating the construction of the knowledge by the students. This is due to the fact that the textbook is characterized as an object that directs the work of the teacher and important reference of studies for the student, besides being an object that does not require specific physical infrastructure to be used, reusable and free distribution in public education networks. In addition, the didactic book is elaborated with the intention of being a didatized version of the scientific knowledge for educational purposes, destined to the school public. This process of didatization is called External Teaching Transposition - an approach proposed by Yves Chevallard for the transformation of academic / scientific knowledge into teaching objects appropriate to the school context in which it is inserted. Considering the national movement for the insertion of the teaching of computation in the Brazilian school, the shortage of specific didactic books for the teaching of computation, as well as the scarcity of studies on the processes of didatization of the scientific knowledge of computation, this research had as objective to investigate the construction of the algorithm concept in computer books (used in school) through the External Didactic Transposition. This research followed a qualitative approach, in the documentary perspective, through the technique of content analysis carried out in three works: two textbooks and an academic book used as conceptual reference. Demonstrations of external didactic transposition of the algorithm concept were verified in the two plans of analysis made possible in the research, both in the field of "knowing wise" and in "knowing how to teach". The study showed that the transposition of scientific knowledge into "knowing to teach" required special care so that this knowledge did not become just a summary of the original concept, because changes and simplifications of scientific knowledge were found when transposed to computer textbooks.
7	ALBERTO PIETRO SIRONI "VISUAL INTEGRATION AND MINING PUBLIC HEALTH DEDICES: DECASION STUDY IN MALARIA" Leader : MARCOS ENNES BARRETO MEMBRES DE LA BANQUE : DANILO BARBOSA COIMBRA MARCOS ENNES BARRETO VANDERSON DE SOUZA SAMPAIO Data: 29 mai 2019 Afficher le Résumé In the last decades, the need to store and process large volumes of data, generated daily under the most varied sources and commonly referred to as big data, has grown significantly. The importance of these data increases as they are integrated to enable the execution of studies and analyzes, to support decision-making processes and to generate comprehensive information about a particular domain. The integration and processing of large volumes of data impose considerable computational complexity. The analysis of these data, for example, requires machine learning and visualization techniques to allow the correct identification of the information present in these data, not always easily perceived. In public health, specifically in the context of malaria, there is a worldwide effort to eradicate the disease, since it still has high incidence rates. This research is part of a project that deals with integration techniques and visual data mining applied in a malaria surveillance ecosystem in the construction of an integrated database comprising notifications, deaths, vector control and climatic data. This database is accessed through Malaria-VisAnalytics, a mining platform for descriptive, predictive and visual analysis, assisting public policy decision-making by government and health stakeholders. Our experimental and validation results favor interaction and visual exploration allowing effective surveillance. Finally, our study can be easily extended with new features and data sources to accommodate more complex scenarios.
8	NILTON FLÁVIO SOUSA SEIXAS PRIMO: An ICN Model for Preventing Denial of Service Attacks by Flooding of False Interests Leader : LEOBINO NASCIMENTO SAMPAIO MEMBRES DE LA BANQUE : LEOBINO NASCIMENTO SAMPAIO MAYCON LEONE MACIEL PEIXOTO ANTONIO AUGUSTO DE ARAGÃO ROCHA Data: 6 juin 2019 Afficher le Résumé The current Internet architecture is designed to share resources available to communities. However, the way the Internet is used today has made the architecture incompatible with the explosive demand for distribution and data collection. Network users are interested in the content and not in their host. The Named Data Networking (NDN) is an Internet architecture that adapts to this new demand, reducing network traffic caused by the need to obtain the same content, and changes the orientation of data security , which currently secures the connections by which they will be transported, into the data itself, making the architecture resilient to many recurring problems and difficult to treat in the current architecture, such as the denial of service attack. However, these attacks have been adapted to this architecture and it is necessary to mitigate them so that the architecture is considered safe. In this context, PRIMO is proposed, an NDN-based model that forges a router-producer collaboration to detect, mitigate and prevent Denial of Service attacks by Flood of Interests (IFA). The components of the PRIMO model cooperate to: (i) detect an ongoing IFA attack, (ii) mitigate the attack by distinguishing between legitimate and false interests, (iii) prevent attack from reaching the core of the network after mitigation, and the network of new instances of attacks with prefixes already used. The experimental results show that PRIMO is effective in: detecting the attack, mitigating it by distinguishing between legitimate and false interests, preserving network performance and preventing further occurrences of an ongoing attack. The contributions of this dissertation are: i) a model with collaborative mechanisms for the detection, mitigation and prevention of IFA and ii) a mechanism for identifying false interests.
9	ANTONIO BATISTA DE OLIVEIRA NETO Prototyping and validating the CORA ontology and its reference architecture Leader : MARCOS ENNES BARRETO MEMBRES DE LA BANQUE : MARCOS ENNES BARRETO LAIS DO NASCIMENTO SALVADOR FLAVIO MORAIS DE ASSIS SILVA Data: 12 juin 2019 Afficher le Résumé Ontologies play an important role across several domains as they represent and define categories, properties and relationships among concepts, data and entities existing in those domains. Specially in Robotics, ontologies can be used as a standard way to represent and share knowledge and reasoning among autonomous agents. CORA (core ontology for robotics and automation) is a standard ontology developed by the IEEE ORA working group mapping main concepts and axioms from the robotics and automation (R&A) domain. After CORA has been approved as an official IEEE standard, ORA was split in different sub-groups addressing, among other topics, task representation (RTR) and autonomous robots architectures (AuR). This work presents a simulation study on a multi-robot reconnaissance application intended to serve as proof-of-concept on the feasibility of the CORA ontology and its related architecture. We show how robots and tasks are represented and dynamically mapped into the ontology, as well discuss some technical aspects of our simulation when dealing with the proposed architecture. Our results show the proposed ontology (CORA) and its associated reference architecture (ROA) are able to be correctly mapped during application runtime.
10	ANTONIO MATEUS DE SOUSA MOBILITY MANAGEMENT IN VEHICLE DATA NETWORKS NAMES BASED ON STABILITY FROM THE LINK Leader : LEOBINO NASCIMENTO SAMPAIO MEMBRES DE LA BANQUE : ANTONIO ALFREDO FERREIRA LOUREIRO LEOBINO NASCIMENTO SAMPAIO MAYCON LEONE MACIEL PEIXOTO Data: 19 juin 2019 Afficher le Résumé Vehicular Ad Hoc Networks are cooperative networks that focus on provide to drivers, passengers and pedestrians more security and comfort at the same time. On the other hand, the mobility scenario of such networks are still complex to the current protocols, which difficult the communication service providing. Vehicular Named Data Networking (VNDN) emerges as an alternative to the conventional Vehicular Ad hoc Networks (VANETs) which persist in using the IP-based addressing scheme to locate and consume services. Several studies have already demonstrated the inefficiency of the TCP/IP architecture in mobility scenarios such as VANETs, characterized by unpredictability, dynamicity, and intermittent communication. In this way, VNDNs aim to overcome the limitations related to TCP/IP architecture by introducing a new demand and consumption model that is centralized in content rather than in the producer of it. Such features are inherent in the Named Data Networking (NDN) architecture, which present a native support for mobility. However, NDNs do not cope with the temporal scope related to VANETs, where both the contents and connections are of limited duration as well as do not handle the broadcast storm of packets. Therefore, this work proposes two new interest forwarding strategies for VNDN that make use of the native characteristic of VANETs, estimated link lifetime, to mitigate the broadcast storm of interest packets as well as its side effects. In our strategies, the routing of interests is carried out in a controlled and selective manner, which only those who are classified as able to be forwarders will forward the packets. This behavior is determined by the use of link lifetime in the forwarding decision-making process. Based on this, we can summarize the contributions of this work as follows: i) propose a new efficient routing strategies for VNDN; ii) mitigation of broadcast storm of interest/data packets; iii) experimental evaluation on the effectiveness of link life in routing. The results show that the proposed strategies reduces significantly the number of interest retransmissions, which also alleviates the number of data packets when compared to the other strategies. Additionally, both levels of interests satisfaction and delays remained stable.
11	MARIA CLARA PESTANA SARTORI Using Context to Support Task Distribution in Mobile Crowdsourcing. Leader : VANINHA VIEIRA DOS SANTOS MEMBRES DE LA BANQUE : ARTUR HENRIQUE KRONBAUER FREDERICO ARAUJO DURAO VANINHA VIEIRA DOS SANTOS Data: 27 juin 2019 Afficher le Résumé In Mobile Crowdsourcing, tasks are distributed to workers according to sensor data, aiming to solve problems using collective intelligence. Requesters create tasks and distribute to the crowd workers in order to solve collectively a certain issue. Matching a task to the appropriate worker is an open issue in the area. This task distribution influences the quality of the workers’ response or the task acceptance rate. In this research we investigate the usage of context on mobile crowdsourcing, and propose ConTask, a context-aware approach to support task distribution. ConTask comprises a context model and a mobile app, which extends the task model proposed by Mrazovic and Matskin (2015) implementing the five context dimensions proposed by Zimmermann et al (2007): Individuality, Time, Relation, Activity, and Location. Additionally, ConTask presents a task distribution architecture, instantiated in the ConTask app prototype, which aims to support task matching to crowd workers according to the task's contextual requirements as defined by the requesters. The ConTask App was implemented according to the proposed model and achitecture, using popular sensors for each context dimension. Experiments were performed with different users in a University, and provided some insights regarding sensors usage; the accuracy of the implemented task distribution was 63%, with 73% precision and 63% revocation. To evaluate the ConTask model, we instantiated an performed an investigation with users in the University Administration domain. This experiment aimed to get insights about how the context dimensions influence tasks' acceptance from the crowd workers point of view. The results illustrated some tendencies in the influence of context usage in the worker willingness to perform the task; such as the higher the distance between the worker and the task location the lower the willingness of the worker on performing the task; the activity the worker is performing when receive the task request influences its acceptance, such as the activities of "waiting" or "talking" in contrast to "on cellphone" and "working"; the time of the day the worker received the task, such as morning versus night; the worker expertise regarding the task requirements; and the social relation between the worker and the task requester.
12	KARLA MALTA AMORIM DA SILVA The Lehman's Laws in Highly-Configurable Systems: An Empirical Study on the Linux Kernel Variable Features” Leader : EDUARDO SANTANA DE ALMEIDA MEMBRES DE LA BANQUE : EDUARDO SANTANA DE ALMEIDA LEOPOLDO MOTTA TEIXEIRA RODRIGO ROCHA GOMES E SOUZA Data: 30 juil. 2019 Afficher le Résumé Background: Highly-configurable systems (HCS) provide a set of features which can be combined in order to generate a family of different products. These features can be either common to all products or variable features - present only in some of the generated products. When evolving this kind of system, we have to deal with the modification of both common and variable features, which may be different regarding their evolutionary behavior. Thus, the Lehman's Laws of software evolution may not hold in all cases. Objective: In this study, our objective is to evaluate the validity of Lehman's Laws in the context of variable features of an open source HCS - the Linux kernel. Methods: Our approach consists of a replication of an empirical study, where we reused the original design but modified the type of subject in order to observe if the results hold in a different context. We analyzed bug reports, commits, and source code of 47 releases of the Linux kernel launched over a period of 23 years of evolution. Results: Results show that half of the laws evaluated were supported, namely, Continuing Change, Continuing Growth and Conservation of Familiarity laws. Conversely, the laws Increasing Complexity, Declining Quality and Conservation of Organizational Stability were not supported for the variable features of the Linux kernel. Conclusions: This study brings together the available evidence of the validity of the Lehman's Laws in an HCS and discusses how its open source nature and domain of application influence the results. We reflect upon the implications of our results and propose a set of recommendations to the HCS community in order to help improving the evolution of variable features.
13	ELTON FIGUEIREDO DA SILVA INTEGRATION OF DDM PRACTICES IN THE AGILE PROCESS: ASSESSING ASPECTS OF SOFTWARE EVOLUTION. Leader : RITA SUZANA PITANGUEIRA MACIEL MEMBRES DE LA BANQUE : ANA PATRICIA FONTES MAGALHÃES MASCARENHAS LEONARDO GUERREIRO AZEVEDO RODRIGO ROCHA GOMES E SOUZA Data: 6 août 2019 Afficher le Résumé In software development that uses some agile method, the source code, rather than documentation, is the main and most up-to-date artifact. In contrast to this reality, the Model Driven Development (DDM) approach has the model as the main artifact of the development process. Intending to get th advantages of both approaches, (SALES, 2017) developed the ScrumDDM meta-process (agile MDD) that added modeling and model transformation practices coming from model driven development to the Scrum framework. Unlike the work of (SALES, 2017), this paper will investigate whether the ScrumDDM metaprocess preserves the characteristic agility of SCRUM, supports software evolution through user stories and documentation provided by DDM using for that aexisting software and developed by ScrumDDM, as well as whether this metaprocess is effective in creating new processes that integrate SCRUM and DDM. To evaluate the metaprocess regarding its ability to evolve user stories and the agility in software development, a controlled experiment was developed. To evaluate the generalization of metaprocess, in turn, a new software process from the academic literature was instantiated from this metaprocess. Through the controlled experiment developed in this work, it was possible to demonstrate that the evaluated metaprocess supported the software evolution through DDM documentation. It has also been shown that development agility has been enhanced through the transformations and models of the DDM approach. In addition to the results cited, at the end of the development process, it was observed that, both the project code and documentation, were up to date.
14	RAFAEL HENRIQUE TIBÃES RECOGNITION OF DIGITAL PRINTING UNDER SCALE VARIATIONS FOR IDENTIFICATION OF DISAPPEARED CHILDREN Leader : MAURICIO PAMPLONA SEGUNDO MEMBRES DE LA BANQUE : LEONARDO GOMES MAURICIO PAMPLONA SEGUNDO RODRIGO MINETTO Data: 20 août 2019 Afficher le Résumé Since the fingerprint of a newborn individual grows to adulthood, the fingerprint of this individual will present different scales when captured over time. The literature comparison methods do not They comprise a robust range of variation of the image, preventing automatic comparison of fingerprints of a same individual collected when newborn and after its growth. For this reason, this paper presents a solution to calculate the thickness of the ridges and valleys of the fingerprint, and thus calculate their average frequency. The ratio scale between two fingerprint images is estimated based on the average frequency for each image, allowing resizing these images to a common frequency, which enables the use of comparison algorithms of literature to perform recognition over time. In experiments using a synthetic database, our approach EER rate obtained for a 0.09 ratio scale 4 times, while the same comparison algorithm without resizing the images you get the EER rate of 0.5, i.e. similar to a random hit.
15	MICHELL FELIPPE FERNANDES MACEDO QUEIROZ Matheuristics for the minimum weighted feedback vertex set and $b$-coloring problems Leader : RAFAEL AUGUSTO DE MELO MEMBRES DE LA BANQUE : CARLOS EDUARDO DE ANDRADE CELSO DA CRUZ CARNEIRO RIBEIRO MARCIO COSTA SANTOS RAFAEL AUGUSTO DE MELO TIAGO DE OLIVEIRA JANUARIO Data: 24 sept. 2019 Afficher le Résumé In this thesis, we propose new matheuristics for two NP-hard graph optimization problems. The first studied problem is denoted the minimum weighted feedback vertex set problem (MWFVS), for which given a weighted graph $G=(V,E)$ it consists in obtaining a minimum weight subset $F\subseteq V$ of the vertex set whose removal makes the graph acyclic. Differently from other approaches in the literature, we tackle this problem via the maximum weighted induced forest problem (MWIF). First, we propose two new compact mixed integer programming (MIP) formulations, using a polynomial number of variables and constraints. Next, we develop a matheuristic that hybridizes a multi-start iterated local search metaheuristic with a MIP-based local search procedure. This thesis also studies the $b$-coloring problem, for which given a graph $G=(V,E)$ it consists in attributing a color to every vertex in $V$ such that adjacent vertices receive different colors, every color has a $b$-vertex, and the number of colors is maximized. A $b$-vertex is a vertex adjacent to vertices colored with all used colors but his own. The optimal solution of the $b$-coloring problem determines the $b$-chromatic number of $G$, denoted $\rchi_b$. We present an integer programming formulation and a very effective multi-greedy randomized heuristic which can be used in a multi-start fashion. Furthermore, a matheuristic is proposed combining the heuristic with the integer programming formulation. Extensive computational experiments show that the proposed techniques for both problems outperform the state-of-art metaheuristics for the majority of tested instances.
16	DIEGO BARBOSA ARIZE SANTOS Towards an accurate energy forecast given uncertainties in Smart Grids Leader : TATIANE NOGUEIRA RIOS MEMBRES DE LA BANQUE : MARCOS ENNES BARRETO MATHEUS GIOVANNI PIRES TATIANE NOGUEIRA RIOS Data: 14 oct. 2019 Afficher le Résumé Managing energy generation and its consumption is usually a difficult task for electric power grids systems, given the great volume of data. Moreover, the data obtained from these systems is tied to uncertainty occasioned by seasonality and natural environment disturbances. Therefore, efforts have been made on the construction of Smart Grids, i.e. intelligent energy networks, which combine Computational Intelligence with the electricity power grids, in order to improve the balance between energy generation and its consumption. Smart Grids require powerful controllers to keep the balance of energy generation and its demand. Those controllers have to be aware of future loads, and the prediction of this data must be very accurate to provide efficient decision support. Since Smart Grid's energy generation and consumption data varies over time, following a time series distribution, time series forecasting methods can yield predictions to support those controllers on the decision-making process. Nevertheless, forecasting over data that tied to uncertainty may have some disturbances. In order to overcome those issues, on this work, we investigate time series forecasting methods, mapped in a systematic literature review, aiming to deliver accurate forecasts even for uncertain data. Towards finding a method for that, we present a comparison study over different time series forecasting methods to evaluate which one would achieve better accuracy in energy distribution. The compared methods were the Adaptive Fuzzy Neural Network (ANFIS), Recurrent Neural Networks (RNN), Support Vector Regression (SVR), Random Forest and SARIMAX. The ANFIS algorithm had outperformed the other approaches, delivering more accurate results. From that comparison, we have proposed a framework that combines the ANFIS' fuzzification step with well-known forecasting methods to improve their performances on forecasting under uncertain energy data.
17	JOÃO ALBERTO CASTELO BRANCO OLIVEIRA Fuzzy Software Analyzer (FSA): A new approach to interpret and analyze software repositories Leader : RICARDO ARAUJO RIOS MEMBRES DE LA BANQUE : RICARDO ARAUJO RIOS CLAUDIO NOGUEIRA SANT ANNA TATIANE NOGUEIRA RIOS CRISTIANO HORA DE OLIVEIRA FONTES Data: 31 oct. 2019 Afficher le Résumé Software metrics have been widely adopted during the process of developing source code, allowing to manage projects with higher quality by quantifying different characteristics that may impact, for example, the project costs and the planned activities. In general, such metrics can be divided into direct and indirect. The direct ones are commonly used in software development and maintenance. Those measures are directly extracted from source codes such as, for instance, the number of written lines and the total number of errors collected in a given time instant. The indirect measures, on the other hand, are obtained after performing some interpretation of the project. They may also include direct metrics and were developed to be generally applied to different software projects. However, the interpretation of these metrics depends on the experience of the project manager (domain specialist) and must consider specific characteristics related to the team of developers. In this project, we present a new approach, referred to as Fuzzy Software Analyzer -- FSA, designed to automatically extract characteristics and patterns in software repositories, aiming at: i) assisting the expert during the task of interpreting metrics, especially by working on large volumes of source codes; and ii) monitoring the evolution of the software as new releases are submitted to the repositories. The proposed approach was assessed by analyzing the Linux Test Project repository, demonstrating its usefulness and applicability.
18	EDISON DE JESUS SANTOS Process Smell : Um Catálogo de Bad Smells para Processos de Software. Leader : RITA SUZANA PITANGUEIRA MACIEL MEMBRES DE LA BANQUE : ANA PATRICIA FONTES MAGALHÃES MASCARENHAS IVAN DO CARMO MACHADO RITA SUZANA PITANGUEIRA MACIEL Data: 28 nov. 2019 Afficher le Résumé The systematic use of the software process favors the quality of the generated product and guides the steps for the construction of the adherent software to the expected quality. Processes are commonly specified and represented by software process modeling languages known as Software Process Modeling Language (SPML). Among these languages, Software & Systems Process Engineering Metamodel (SPEM) stands out for being a Unified Modeling Language (UML) profile for modeling software processes and systems. Software processes evolve together with the needs of the institution and professionals who use it, needing to be monitored and evaluated constantly to maintain its qualities. Currently, four forms of process assessments are best known among industry practices and the software engineering literature, namely: (i) product assessment against the process, (ii) checks associated with assessment models, (iii) validation through formal specifications and (vi) systemic view of human-centered software engineering. Although it is possible to apply any of the four forms of assessments mentioned above to guarantee the evolution and quality of the software process, they have some characteristics that can be inconvenient. Firstly, these forms of evaluation do not accurately detect the elements of the process that potentially have the most negative impact on their quality. According to these evaluations, data obtained after the execution of the process are used, so it is only possible to verify problems in the process after the execution of the same, which may already cause some possible losses. More specifically, when these forms of evaluation are carried out using simulation, they require specific knowledge for the execution of the simulation procedures and the use of symbolic data that simulate the process data after its execution. Although SPML like SPEM are used to improve the understanding of a process, the specification of a process can be done inappropriately hurting factors of quality of the process. This phenomenon can be compared to the concept of bad smells, which are design problems in software code and in this work it will be presented as a process smell for observing problems in the process design. Thus, a process smell in the specification of a process can negatively impact the quality of the process and consequently affect the quality of the software product.
19	LUIS EMANUEL NEVES DE JESUS FEATURE INTERACTION IN ENVIRONMENTS SMART: AN APPROACH BASED ON DATA ANALYSI Leader : DANIELA BARREIRO CLARO MEMBRES DE LA BANQUE : DANIELA BARREIRO CLARO IVAN DO CARMO MACHADO CELIA GHEDINI RALHA Data: 16 déc. 2019 Afficher le Résumé The growth of the Internet of Things has allowed many devices to be daily connected. Each device has its own action rules and its peculiarities that can be monitored by a domain expert. Many of these devices are also configured to work in conjunction with other devices in controlled environments. With the advancement of the Internet of Things, it becomes more difficult to manage when devices are connected and/or disconnected from the environment. Moreover, with the presence of a new device, such as restricted usage rules for a controlled environment, may be violated and cause unexpected behavior in the environment. These cases are related to feature interaction. Feature Interaction occur when two or more devices/services generate instability in the environment due to input or output of a device. In dynamic environments such as IoT, device inclusion and exclusion makes it impossible for an expert to maintain these rules. According to the systematic mapping review performed, most works use the rules applied by the expert, requiring a method that allows to detect these feature interactions automatically. In this sense, the main objective of the present work is to detect features interacionista based on data analysis. Feature interaction data sets were annotated to obtain a model for automatically identifying rules. Experiments were performed and the results showed evidence of automatic detection of resource interactions in similar or complementary domains.
20	RUIVALDO AZEVEDO LOBÃO NETO Using Markov Chains and Radial Base Function Networks for Online Concept Change Detection in Streaming Data Leader : RICARDO ARAUJO RIOS MEMBRES DE LA BANQUE : GLORIA MILLARAY JULIA CURILEM SALDIAS MARCELO MAGALHAES TADDEO RICARDO ARAUJO RIOS Data: 16 déc. 2019 Afficher le Résumé The amount of information produced by computer systems has grown dramatically in recent decades. A significant portion of this information is produced as continuous streams, which are constant and potentially infinite sequences of data. These flows are mostly non-stationary and may change in the distribution of data or in the context of the generating process. These changes are called concept changes and can negatively impact the performance of applied learning models. To mitigate this problem, researchers have been developing specialized methods for detecting concept changes. However, the proposed methods have limitations when applied in some continuous flow scenarios, such as the need for expert labeling and the inability to meet the processing time and computational resource constraints of these scenarios. Aiming to overcome these limitations, this paper presents a new concept change detection method called RBFChain, based on Radial Base Function Networks (RBF) and Markov Chains. Briefly, RBF networks perform, in their middle layer, an activation process that implicitly produces groups from the observations received over time. In addition, Markov chains allow us to model the center transitions that occur in such observation groups. Concept changes are then detected when the active center of the cluster changes and the probability of transition in the Markov model exceeds a threshold. The presented method differs from existing works by detecting changes in runtime, computationally efficient and independent of labels. To evaluate the RBFChain method as a viable concept change detector, a sensitivity, precision and noise tolerance analysis was performed using synthetic data sets, and their results were compared with the main algorithms available in the literature. In addition, the technique was applied to a real fixation and saccade classification problem in eye tracking activity. With this application, it was possible to investigate and propose a solution to a relevant problem involving the area of Neuroscience and Computation. The results obtained with the synthetic data sets suggest that the RBFChain is statistically better or equivalent to the main detectors present in the literature. Moreover, the developed technique presented good results when applied to the ocular monitoring problem, being able to classify fixations and balconies in real time and with precision equivalent to the state of the art.
21	WITÃ DOS SANTOS ROCHA SEMANTIC SIMILARITY OF ATTRIBUTES TO CLOUD DATA: A CASE STUDY IN MIDAS Leader : DANIELA BARREIRO CLARO MEMBRES DE LA BANQUE : DANIELA BARREIRO CLARO FRANK AUGUSTO SIQUEIRA LAIS DO NASCIMENTO SALVADOR Data: 16 déc. 2019 Afficher le Résumé Large amounts of heterogeneous data produced by social networks, Internet-connected devices, and web applications has been stored and managed in different formats at distinct service levels Data-as-a-Service (DaaS) and Database-as-a-Service (DBaaS). This solutions provides on demand services via Application Programming Interface (API). To access these distributed cloud services, users in most cases face data ambiguity issues. To solve these problems and to avoid performing complex and laborious tasks to access this data, an automated solution that serves as an intermediate layer for establishing communication between SaaS and DaaS / DBaaS service levels is required. This solution allows SaaS service level consumers to access their data stored at different DaaS service levels through a single query. The middleware layer called Middleware for Interoperability Between SaaS and DaaS (MIDAS) provides this solution to users in a transparent way. Over time, DaaS service level parameter updates can semantically affect query terms (that are automatically or not automatically) configured for applications at SaaS service levels. To ensure that cloud consumers continue to access DaaS data, an approach that ensures semantic similarity between DaaS parameters to maintain the reliability of the original request is recommended. Our work proposes a method (SM) with two similarity ways: (i) edge counting (Cosine and Jaccard) to measure the similarity between two attributes and the Information Content(IC) to measure similarity based on knowledge through a WordNet corpus. The IC method is used in the model to fill the limitations of the edge counting method. To choose the edge count methods (Cosine and Jaccard), an environment with eight distance measurement algorithms, twenty-two authentic parameters from eleven DaaS providers, and five possible parameter change situations was simulated. As a proof of concept, our model is implemented in MIDAS to evaluate three criteria: overload, performance and correctness. The results of our experiments showed that we are in the first direction to provide semantic interoperability between SaaS and DaaS in MIDAS.
22	ROSELANE SILVA FARIAS DESIGNING SMART CITY MOBILE APPLICATIONS: A GROUNDED THEORY Leader : EDUARDO SANTANA DE ALMEIDA MEMBRES DE LA BANQUE : CHRISTINA VON FLACH GARCIA CHAVEZ EDUARDO SANTANA DE ALMEIDA ELISA YUMI NAKAGAWA Data: 20 déc. 2019 Afficher le Résumé The software architecture community has played a crucial role in the development of mobile software. Many of the ideas used in the design of these systems came from traditional software architecture and those ideas have contributed to mobile computing becoming ubiquitous. Mobile applications in the context of smart cities are very challenging since they need to operate within the power, processor, and capacity limitations of mobile devices, the exacting demands of life critical smart city requirements, and the constantly changing and exposed environment which may not always be trusted. Since there are no widely accepted design models for this type of software, developers must resort to primitive design decisions to meet all the needs of these applications, which takes additional time and expertise. For this reason, this study aims to investigate the design process for mobile applications in the context of smart cities. In order to address the lack of verified information about designing mobile apps, we conducted a multi-case study with 9 applications from 4 different development groups to build a grounded theory. The applications were reverse engineered to expose the architecture of each application. Based on all the data, an emergent grounded theory was constructed to explain how the selected design process produces an app with the desired characteristics. The grounded theory developed through this research, and the process by which the theory was developed, were subjected to an evaluation process developed from the grounded theory literature. That evaluation validated the experimental process and verified that the experimental results followed from the correct use of the experimental process. The evaluation also addressed some of the threats to validity such as investigator influence. To further ensure validity this process included gathering data from additional projects using the experimental process. The resulting theory offers explanations for how software engineering teams design mobile apps for smart cities. This knowledge will serve as a basis to further understand the phenomena and advances towards more effective design and development process definitions.
23	JURACY BERTOLDO SANTOS JÚNIOR DEVELOPMENT AND VALIDATION OF PREDICTIVE MODELS FOR THE MALARIA EPIDEMIC. Leader : MARCOS ENNES BARRETO MEMBRES DE LA BANQUE : MARCOS ENNES BARRETO TATIANE NOGUEIRA RIOS VANDERSON DE SOUZA SAMPAIO Data: 20 déc. 2019 Afficher le Résumé Data mining techniques allow extracting relationships and features present in data that are not easily noticeable, especially in scenarios involving large volumes of data. These techniques allow grouping and classify data, identify patterns and exceptions, as well as establish association relationships between different data. THE Predictive analytics is a sub-area of data mining that uses machine learning techniques to implement of predictive analytical models. Such models are based on prior knowledge of the data (descriptive analysis) to design possible future events about the context being analyzed. This work is an integral part of a research project which aims to integrate malaria reporting data from different sources of information and develop models of predictive analysis on these data. Specifically, this work involves the study of different prediction techniques based on in models and data and, based on this study, the development of predictive models for the epidemiological scenario of malaria in Brazil. Four case studies involving socioeconomic and epidemiological data are discussed as evidence of concept of the proposed models. The results show that the models developed have an accuracy variable between 60% and 85% for the tested scenarios, with prediction windows of up to 3 weeks and considering the large variation observed in the malaria cases in the last four years
24	RICARDO BRASIL TEIXEIRA SHARED RESOURCES IN MULTIPROCESSOR REAL-TIME SYSTEMS SCHEDULED BY RUN Leader : GEORGE MARCONI DE ARAUJO LIMA MEMBRES DE LA BANQUE : ALIRIO SANTOS DE SA ERNESTO DE SOUZA MASSA NETO GEORGE MARCONI DE ARAUJO LIMA PAUL DENIS ETIENNE REGNIER Data: 20 déc. 2019 Afficher le Résumé A hard real-time system can be de􏰌ned as the one for which all its tasks must meet their deadlines whenever the system runs. This requirement makes the scheduling algorithm a key element for system correctness. Ideally, the scheduling algorithm employed must both exhibit low overhead (e􏰎ciency) and ensure that no task deadline is missed whenever this can be ensured by some scheduling algorithm (optimality). RUN (Reduction to Unipro- cessor) is an algorithm capable of e􏰎ciently and optimally scheduling a set of strictly periodic tasks on a multiprocessor platform when tasks do not share any resources but processors. Although it has already been shown that RUN is compatible with resource sharing, the only existing solution prevents preemptive access to shared resources. Unlike this approach, which can be considered too restrictive due to its poor schedulability, we used MrsP (Multiprocessor resource sharing Protocol) as a more 􏰍exible resource sharing mechanism. Making the rules of both RUN and MrsP compatible to each other was thus our main goal. The derived solution was implemented on Linux Textbed for Multi- processor Scheduling in Real-Time systems (LitmusRT), namely a Linux-based real-time operating system. We proposed a new task packaging heuristic and performed experi- mental evaluations comparing our solution with the existing one. The results showed that the proposed solution presented better results in terms of schedulability and number of migrations and preemptions.
	Thèses
1	CRESCENCIO RODRIGUES LIMA NETO An Approach for Recovering Architectural Variability from Source Code Leader : CHRISTINA VON FLACH GARCIA CHAVEZ MEMBRES DE LA BANQUE : ALESSANDRO FABRICIO GARCIA CLAUDIO NOGUEIRA SANT ANNA IVAN DO CARMO MACHADO MANOEL GOMES DE MENDONCA NETO THELMA ELITA COLANZI Data: 19 févr. 2019 Afficher le Résumé Software Product Line Engineering (SPLE) has been widely adopted for applying systematic reuse in families of systems. Given the high upfront investment required for SPLE adoption, organizations commonly start with more opportunistic reuse approaches (e.g., a single system that they clone and modify). However, maintenance problems appear when managing a large number of similar systems where each of them implements and evolves particular characteristics. One viable solution to solve this issue is to migrate to SPLs using an extractive approach. This initiative, in its early phases, includes the definition of a Product Line Architecture (PLA) to support the derivation of product variants and also to allow customization according to customers' needs. In this way, the systematic use of Software Architecture Recovery (SAR) techniques enables PLA recovery and keeps the PLA aligned with development. Our objective is to provide an automatic approach to recover PLAs and guidelines to support the PLA recovery. We gathered knowledge by means of literature reviews and exploratory studies to characterize the state-of-the-art and identify research gaps on SAR techniques and tools that support the recovery of architectural variability information from source code for a family of products. The use of SAR techniques and tools to recover a PLA that documents variability information at the architecure level may address issues related to SPL adoption, design and evolution. Unfortunately, few studies investigate PLA recovery and also provide empirical evaluation. One of the main issues in the extractive approach is the explosion of the variability in the PLA representation. Our approach is based on identifying variability on architectural level by extracting information from variants' source code. To evaluate our approach, we performed a set of empirical studies.
2	LARISSA ROCHA SOARES Feature Interactions in Highly Configurable Systems: A Dynamic Analysis Approach with Varxplorer Leader : EDUARDO SANTANA DE ALMEIDA MEMBRES DE LA BANQUE : IVAN DO CARMO MACHADO RODRIGO ROCHA GOMES E SOUZA EDUARDO SANTANA DE ALMEIDA CHRISTINA VON FLACH GARCIA CHAVEZ MARCIO DE MEDEIROS RIBEIRO EDUARDO MAGNO LAGES FIGUEIREDO Data: 21 févr. 2019 Afficher le Résumé Highly configurable systems (as known as software product lines) provide significant reuse opportunities by tailoring system variants based on a set of features. Those features can interact in undesired ways which may result in faults. However, most interactions are not easily detectable as specifications of feature interactions are usually missing. The feature interaction problem has been a challenging subject for years. Despite the existence of studies to map out available evidence on feature interaction for single systems development, there is a lack of understanding on common strategies, activities, artifacts and research gaps for interactions in configurable systems. Thus, this thesis initially gathered systematic mapping study evidence by analyzing 40 feature interaction primary studies, which were classified according to development lifecycle stages and the feature interaction solution presented, either detection, resolution or general analysis. Recent analyses focused on detecting feature interaction bugs from global specifications, i.e., specifications that all configurations of a configurable system need to fulfill, such as requiring that each configuration does not crash. However, specifications at the feature level are usually missing and, then, many approaches may not detect all incorrect system behavior, specially bugs not covered by global specifications and bugs that do not result in a crash or other easily observable behavior. Instead of starting from a set of specifications like most approaches, we propose to inspect feature interactions as they are detected and incrementally classify them as benign or problematic. We aim to provide an inspection process that helps developers to distinguish intended interactions from interactions that may lead to bugs. We use variational execution to observe internal interactions on control and data flow of highly configurable systems. To help developers understand these interactions, we propose feature-interaction graphs as a concise representation of all pairwise interactions. We provide two analyses that provide additional details about interactions, namely suppress and require interactions. Our approach and tool, VarXplorer, provide an iterative analysis of feature interactions allowing developers to focus on suspicious cases. Finally, we perform two empirical studies to evaluate the inspection process and how feature interaction graphs can help users identify suspicious interactions. The first study is a controlled experiment to investigate and compare the ability of users when identifying suspicious interactions with and without VarXplorer, in a setting composed of different systems, performing different tasks. The second study focuses on the iterative process of test cases execution and how it can be used for a faster and more objective feature interaction analysis. Sistemas altamente configuráveis (também conhecidos como linhas de produtos de software) fornecem oportunidades significativas de reuso, uma vez que eles adaptam variantes do sistema com base em um conjunto de features. Essas features podem interagir de formas indesejadas, resultando em falhas. Além disso, a maioria das interações não é facilmente detectável, já que especificações de interações entre features geralmente não são definidas, especificadas e documentadas em um projeto de software.O problema da interação entre features tem sido um assunto desafiador por anos. Apesar da existência de estudos que mapeiam essas interações, ainda não há muitos trabalhos sobre a compreensão de estratégias, atividades, artefatos e lacunas de pesquisa para interações em sistemas configuráveis. Desta forma, esta tese provê inicialmente um mapeamento sistemático de estudos por meio da análise de 40 trabalhos, os quais foram classificados de acordo com os estágios do ciclo de vida de desenvolvimento e a solução de interação apresentada (detecção ou resolução de interações). Análises recentes têm focado na detecção de erros de interação de features a partir de especificações globais, ou seja, especificações que todas as configurações de um sistema configurável precisam cumprir. No entanto, especificações no nível de features ou interações são geralmente negligenciadas e raramente documentadas. Neste cenário, muitas abordagens não conseguem detectar todos os problemas de comportamento do sistema, especialmente erros não cobertos por especificações globais e erros que não resultam em uma falha ou outro comportamento facilmente observável.Ao invés de partir de um conjunto de especificações como a maioria das abordagens, propomos inspecionar as interações de features `a medida que são detectadas e classificá-las gradativamente como benignas ou problemáticas. Nossa abordagem e ferramenta, VarXplorer, fornece um processo de inspeção que ajuda os desenvolvedores a distinguir as interações intencionais das interações que podem levar a bugs. Usamos a execução variacional para observar interações internas ao fluxo de controle e fluxo de dados de sistemas altamente configuráveis e propomos gráficos de interação de features como uma representação concisa de todas as interações entre pares de features. Por fim, realizamos dois estudos empíricos para avaliar como o processo de inspeção e os gráficos de interação de features podem ajudar os desenvolvedores a identificar e entender interações suspeitas. O primeiro é um experimento controlado que investiga e compara a capacidade dos desenvolvedores ao identificar interações suspeitas com e sem o VarXplorer. O segundo foca no processo iterativo de execução de casos de teste e como ele proporciona uma análise de interações mais rápida e objetiva.
3	ROBESPIERRE DANTAS DA ROCHA PITA CLUSTERING CATEGORICAL DATA USING THE FREQUENCY FACTOR Leader : MARCOS ENNES BARRETO MEMBRES DE LA BANQUE : FREDERICO ARAUJO DURAO MARCOS ENNES BARRETO MAYCON LEONE MACIEL PEIXOTO PAULO HENRIQUE FERREIRA DA SILVA SPIROS DENAXAS Data: 11 déc. 2019 Afficher le Résumé Data clustering refers to an unsupervised machine learning method, often used when no labeled data is available. This class of machine learning models is used to retrieve information by splitting the data into coherent and contrasted groups. The most popular partitional and hierarchical clustering algorithms are K-means and AGNES, respectively. Nevertheless, to make either K-means and AGNES suitable to categorical data, some extensions are required in terms of similarity measures to be used and central position measures to update centroids. Categorical variables bring together several scales with a finite and moderate set of values, such as nominal variables, ordinal variables, discrete interval variables, and continuous variables grouped into few categories. As an extension of K-means, K-modes aims to cluster categorical data by using the mode to update clusters’ centroids, as well as an overlap measure to define the distance between objects and centers. The extension of AGNES can follow the same logic. This work proposes the Frequency Factor, a new probability-based measure, specifically forged to update the centroids on clustering algorithms. The use of Frequency Factor to edit both K-means and AGNES yielded two proposed alternatives, K-fact and MARI, respectively. In our validation scheme, we used the Rand index from external validity criteria to measure the clustering performance and compare both algorithms. Those algorithms extended by Frequency Factor outperform or tie with K-modes, AGNES, and ROCK and can be used as a competitive alternative to cluster nominal-categorical variables.

Thèses

DIEGO ZABOT
(SPIDe Kids): adapting an interaction codesign process for deaf or hard of hearing children participation
Leader : ECIVALDO DE SOUZA MATOS
MEMBRES DE LA BANQUE :
ECIVALDO DE SOUZA MATOS
LYNN ROSALINA GAMA ALVES
SORAIA SILVA PRIETCH
Data: 22 mars 2019

Afficher le Résumé
In 2006 the Brazilian Computer Society (Sociedade Brasileira de Computação - SBC) proposed five major challenges in computer research in Brazil. The fourth challenge, "Participatory and universal access of the Brazilian citizen to knowledge", involved Human-Computer Interaction (HCI). The objective of this challenge was to overcome the barriers that impeded access and interaction by designing systems, tools, models, methods, procedures and theories capable of giving Brazilian citizens access to knowledge. In addition, interactive technologies are penetrating into every aspect of daily life. In this context, interaction design raises questions about how to design systems that are suitable for all users. This research aimed to understand how to adapt an interaction codesign process for people who are deaf or hard of hearing (D/HH) participate. The starting point was the analysis of a participatory codesign process, SPIDe, with the objective of identifying, adapting and evaluating design solutions for the participation of D/HH in the entire process. A case study was conducted at a state college for deaf people with a group of six year old students to test the adapted process. In this case study, it was created an educational game by codesign with the children. As result, the children produced two video games, created by e for D/HH childrens, whose goals were to help deaf children’s alphabetization, teaching numbers systems and the alphabet. In addition, the children recognized themselves as producers and the games are imbued with elements of their culture. From this results, guidelines and indications were defined on how to conduct a codesign process with D/HH children and also about possible extensions of this process.

PAULO ROBERTO DE SOUZA

RecTwitter: A Semantic Recommendation System for Twitter Users

Leader : FREDERICO ARAUJO DURAO
MEMBRES DE LA BANQUE :
DANILO BARBOSA COIMBRA
FREDERICO ARAUJO DURAO
JOÃO BATISTA DA ROCHA JÚNIOR
Data: 25 mars 2019

Afficher le Résumé
Twitter is one of the most popular microblogging services today, which allows users to share images, links, texts, etc., as well as follow or not to follow preferred accounts, which are also managed by other users. Public opinion has to do with the differences in volume and volume of publication. In view of this, it is necessary to adopt intelligent schemas to identify and filter accounts that publish content similar to the interests of the target user. This dissertation offers a comment system for a subject with rules of conversation, which leads us to analyze the accounts of a user, and recommends those that are discontinued and new ones that are followed in a row. As rules work as recommendation engines, people are shaped through an interaction between Twitter users as they are modeled through a domain ontology. To evaluate the proposed model, experiments were carried out with real users and comparisons with works related to the state of the art. The results of the online experiments indicate that 76% of users were approved as a reference. In relation to the results obtained in the off-line experiments, one of the best models of the state of the art.

EUDES DIÔNATAS SILVA SOUZA
An Architecture for Data Authoring and Web Publishing
Leader : CASSIO VINICIUS SERAFIM PRAZERES
MEMBRES DE LA BANQUE :
CASSIO VINICIUS SERAFIM PRAZERES
ARTUR HENRIQUE KRONBAUER
RENATO DE FREITAS BULCÃO NETO
Data: 27 mars 2019

Afficher le Résumé
On the one hand, the simplicity of the HTML document standard was an initial design requirement, with the goal of rapidly expanding the Web by creating various HTML-authoring tools. On the other hand, this same simplicity also generates limitations for the use of the Web by machines due to the lack of structuring and semantics of HTML documents. In this context, the Semantic Web aims at extending the current Web, or the Web of Documents, such that the Web has become susceptible to interpretation and use by machines (e.g. software agents). To that end, the Web of Data attempts to describe the current Web’s data through structured documents that contain metadata and explicit semantics. In addition, rules for authoring and publication of structured and explicitly semantic data called Linked Data have been proposed for the Web of Data. However, the proposed document patterns in the Linked Data seems to undermine the initial goal of the HTML standard, i.e. simplicity. Thus, the aforementioned authoring process must now take into account that such simplicity is no longer a valid prerequisite for Web documents. Therefore, to generate and publish data for the Wed of Data just as it was for the Web of Documents, some studies have proposed methods and tools for the following: automatic generation of Linked Data from databases; semiautomatic extraction of Linked Data from HTML pages; and authored Linked Data for ordinary users. However, in relation to authoring and publication by ordinary users, these methods and tools are still being used in an ad hoc manner. That is, the previously mentioned studies present solutions that are domain specific such as social networks, content managers, wikis, blogs, etc. In this context, this work proposes a framework for data authoring and publication in the Web of Data according to the rules of the Linked Data standard while still focusing on ordinary users.

JÚLIA MADALENA MIRANDA CAMPOS
A Restricted Logic Programming Approach to the Nursing Scheduling and Routing Problem.
Leader : TIAGO DE OLIVEIRA JANUARIO
MEMBRES DE LA BANQUE :
TIAGO DE OLIVEIRA JANUARIO
MARLO VIEIRA DOS SANTOS E SOUZA
MAYRON CESAR DE OLIVEIRA MOREIRA
Data: 26 avr. 2019

Afficher le Résumé
The Home Care Service is characterized as a health care modality composed of a set of actions for prevention, rehabilitation and treatment of diseases, provided at home. This service has become increasingly present as a complementary health action, replacing hospitalization, as it offers a new modality of care to people with stable clinical conditions that require medical care. The Nursing Scheduling and Routing Problem is to determine in an integrated way the assignment of health professionals to patients, along with the routes of the vehicles that will transport the professionals. This problem is often resolved manually, making the process inefficient and often yielding unsatisfactory results. This dissertation presents a case study carried out with the Home Care Service team of the General Hospital of the State of Salvador, in order to propose a Constrained Logic Programming Model for the Nursing Scheduling and Routing Problem in order to maximize the number of patients attended and minimize the total distance traveled by health professionals.

BRAIAN VARJÃO GAMA BISPO
Xangô: a framework for robust attribute selection to the problem of unbalance between classes in text classification tasks
Leader : TATIANE NOGUEIRA RIOS
MEMBRES DE LA BANQUE :
TATIANE NOGUEIRA RIOS
Ricardo Cerri
RICARDO MARCONDES MARCACINI
Data: 3 mai 2019

Afficher le Résumé
Attribute selection is a widely used dimensionality reduction technique to deal with the difficulties associated with the "dimensionality curse" in text classification tasks. The most common attribute selection approach for textual databases is to weigh the relevance of each attribute to the learning process and to select the best-valued N's, where N is generally an empirically defined number. Although this strategy is widely applied, it can lead to the partial or complete exclusion of attributes essential for learning In this sense, the research presented in this paper aims to foster the generation of more reliable text classifiers through the improved Xangô, a framework for selecting attributes in tasks of classifying texts, whose selection process seeks to construct a reduced dimensional space where all classes are represented in a balanced way by the its most discriminatory terms. Experimental results indicate that Xangô is a framework that is adaptable to different state-of-art attribute selection methods, surpassing its individual performances in learning tasks performed under varied conditions, including multi-class problems, imbalance between classes, different classification algorithms and drastic dimensionality reductions.

Emilayne Feitosa Corlett

The construction of the algorithm concept in Computing textbooks through the External Didactic Transposition

Leader : ECIVALDO DE SOUZA MATOS
MEMBRES DE LA BANQUE :
AYLA DÉBORA DANTAS DE SOUZA REBOUÇAS
ECIVALDO DE SOUZA MATOS
MARIA CAROLINA DE SOUZA SAMPAIO
Data: 21 mai 2019

Afficher le Résumé
In Brazilian Basic Education, the textbook is most often the main pedagogical resource used by the teacher in stimulating the construction of the knowledge by the students. This is due to the fact that the textbook is characterized as an object that directs the work of the teacher and important reference of studies for the student, besides being an object that does not require specific physical infrastructure to be used, reusable and free distribution in public education networks. In addition, the didactic book is elaborated with the intention of being a didatized version of the scientific knowledge for educational purposes, destined to the school public. This process of didatization is called External Teaching Transposition - an approach proposed by Yves Chevallard for the transformation of academic / scientific knowledge into teaching objects appropriate to the school context in which it is inserted. Considering the national movement for the insertion of the teaching of computation in the Brazilian school, the shortage of specific didactic books for the teaching of computation, as well as the scarcity of studies on the processes of didatization of the scientific knowledge of computation, this research had as objective to investigate the construction of the algorithm concept in computer books (used in school) through the External Didactic Transposition. This research followed a qualitative approach, in the documentary perspective, through the technique of content analysis carried out in three works: two textbooks and an academic book used as conceptual reference. Demonstrations of external didactic transposition of the algorithm concept were verified in the two plans of analysis made possible in the research, both in the field of "knowing wise" and in "knowing how to teach". The study showed that the transposition of scientific knowledge into "knowing to teach" required special care so that this knowledge did not become just a summary of the original concept, because changes and simplifications of scientific knowledge were found when transposed to computer textbooks.

ALBERTO PIETRO SIRONI
"VISUAL INTEGRATION AND MINING PUBLIC HEALTH DEDICES: DECASION STUDY IN MALARIA"
Leader : MARCOS ENNES BARRETO
MEMBRES DE LA BANQUE :
DANILO BARBOSA COIMBRA
MARCOS ENNES BARRETO
VANDERSON DE SOUZA SAMPAIO
Data: 29 mai 2019

Afficher le Résumé
In the last decades, the need to store and process large volumes of data, generated daily under the most varied sources and commonly referred to as big data, has grown significantly. The importance of these data increases as they are integrated to enable the execution of studies and analyzes, to support decision-making processes and to generate comprehensive information about a particular domain. The integration and processing of large volumes of data impose considerable computational complexity. The analysis of these data, for example, requires machine learning and visualization techniques to allow the correct identification of the information present in these data, not always easily perceived. In public health, specifically in the context of malaria, there is a worldwide effort to eradicate the disease, since it still has high incidence rates. This research is part of a project that deals with integration techniques and visual data mining applied in a malaria surveillance ecosystem in the construction of an integrated database comprising notifications, deaths, vector control and climatic data. This database is accessed through Malaria-VisAnalytics, a mining platform for descriptive, predictive and visual analysis, assisting public policy decision-making by government and health stakeholders. Our experimental and validation results favor interaction and visual exploration allowing effective surveillance. Finally, our study can be easily extended with new features and data sources to accommodate more complex scenarios.

NILTON FLÁVIO SOUSA SEIXAS
PRIMO: An ICN Model for Preventing Denial of Service Attacks by Flooding of False Interests
Leader : LEOBINO NASCIMENTO SAMPAIO
MEMBRES DE LA BANQUE :
LEOBINO NASCIMENTO SAMPAIO
MAYCON LEONE MACIEL PEIXOTO
ANTONIO AUGUSTO DE ARAGÃO ROCHA
Data: 6 juin 2019

Afficher le Résumé
The current Internet architecture is designed to share resources available to communities. However, the way the Internet is used today has made the architecture incompatible with the explosive demand for distribution and data collection. Network users are interested in the content and not in their host. The Named Data Networking (NDN) is an Internet architecture that adapts to this new demand, reducing network traffic caused by the need to obtain the same content, and changes the orientation of data security , which currently secures the connections by which they will be transported, into the data itself, making the architecture resilient to many recurring problems and difficult to treat in the current architecture, such as the denial of service attack. However, these attacks have been adapted to this architecture and it is necessary to mitigate them so that the architecture is considered safe. In this context, PRIMO is proposed, an NDN-based model that forges a router-producer collaboration to detect, mitigate and prevent Denial of Service attacks by Flood of Interests (IFA). The components of the PRIMO model cooperate to: (i) detect an ongoing IFA attack, (ii) mitigate the attack by distinguishing between legitimate and false interests, (iii) prevent attack from reaching the core of the network after mitigation, and the network of new instances of attacks with prefixes already used. The experimental results show that PRIMO is effective in: detecting the attack, mitigating it by distinguishing between legitimate and false interests, preserving network performance and preventing further occurrences of an ongoing attack. The contributions of this dissertation are: i) a model with collaborative mechanisms for the detection, mitigation and prevention of IFA and ii) a mechanism for identifying false interests.

ANTONIO BATISTA DE OLIVEIRA NETO
Prototyping and validating the CORA ontology and its reference architecture
Leader : MARCOS ENNES BARRETO
MEMBRES DE LA BANQUE :
MARCOS ENNES BARRETO
LAIS DO NASCIMENTO SALVADOR
FLAVIO MORAIS DE ASSIS SILVA
Data: 12 juin 2019

Afficher le Résumé
Ontologies play an important role across several domains as they represent and define categories, properties and relationships among concepts, data and entities existing in those domains. Specially in Robotics, ontologies can be used as a standard way to represent and share knowledge and reasoning among autonomous agents. CORA (core ontology for robotics and automation) is a standard ontology developed by the IEEE ORA working group mapping main concepts and axioms from the robotics and automation (R&A) domain. After CORA has been approved as an official IEEE standard, ORA was split in different sub-groups addressing, among other topics, task representation (RTR) and autonomous robots architectures (AuR). This work presents a simulation study on a multi-robot reconnaissance application intended to serve as proof-of-concept on the feasibility of the CORA ontology and its related architecture. We show how robots and tasks are represented and dynamically mapped into the ontology, as well discuss some technical aspects of our simulation when dealing with the proposed architecture. Our results show the proposed ontology (CORA) and its associated reference architecture (ROA) are able to be correctly mapped during application runtime.

ANTONIO MATEUS DE SOUSA
MOBILITY MANAGEMENT IN VEHICLE DATA NETWORKS NAMES BASED ON STABILITY FROM THE LINK
Leader : LEOBINO NASCIMENTO SAMPAIO
MEMBRES DE LA BANQUE :
ANTONIO ALFREDO FERREIRA LOUREIRO
LEOBINO NASCIMENTO SAMPAIO
MAYCON LEONE MACIEL PEIXOTO
Data: 19 juin 2019

Afficher le Résumé
Vehicular Ad Hoc Networks are cooperative networks that focus on provide to drivers,
passengers and pedestrians more security and comfort at the same time. On the other
hand, the mobility scenario of such networks are still complex to the current protocols,
which difficult the communication service providing. Vehicular Named Data Networking (VNDN) emerges as an alternative to the conventional Vehicular Ad hoc Networks
(VANETs) which persist in using the IP-based addressing scheme to locate and consume services. Several studies have already demonstrated the inefficiency of the TCP/IP
architecture in mobility scenarios such as VANETs, characterized by unpredictability,
dynamicity, and intermittent communication. In this way, VNDNs aim to overcome the
limitations related to TCP/IP architecture by introducing a new demand and consumption model that is centralized in content rather than in the producer of it. Such features
are inherent in the Named Data Networking (NDN) architecture, which present a native
support for mobility. However, NDNs do not cope with the temporal scope related to VANETs, where both the contents and connections are of limited duration as well as do not
handle the broadcast storm of packets. Therefore, this work proposes two new interest
forwarding strategies for VNDN that make use of the native characteristic of VANETs,
estimated link lifetime, to mitigate the broadcast storm of interest packets as well as its
side effects. In our strategies, the routing of interests is carried out in a controlled and
selective manner, which only those who are classified as able to be forwarders will forward
the packets. This behavior is determined by the use of link lifetime in the forwarding
decision-making process. Based on this, we can summarize the contributions of this work
as follows: i) propose a new efficient routing strategies for VNDN; ii) mitigation of broadcast storm of interest/data packets; iii) experimental evaluation on the effectiveness of
link life in routing. The results show that the proposed strategies reduces significantly
the number of interest retransmissions, which also alleviates the number of data packets
when compared to the other strategies. Additionally, both levels of interests satisfaction
and delays remained stable.

MARIA CLARA PESTANA SARTORI
Using Context to Support Task Distribution in Mobile Crowdsourcing.
Leader : VANINHA VIEIRA DOS SANTOS
MEMBRES DE LA BANQUE :
ARTUR HENRIQUE KRONBAUER
FREDERICO ARAUJO DURAO
VANINHA VIEIRA DOS SANTOS
Data: 27 juin 2019

Afficher le Résumé
In Mobile Crowdsourcing, tasks are distributed to workers according to sensor data, aiming to solve problems using collective intelligence. Requesters create tasks and distribute to the crowd workers in order to solve collectively a certain issue. Matching a task to the appropriate worker is an open issue in the area. This task distribution influences the quality of the workers’ response or the task acceptance rate. In this research we investigate the usage of context on mobile crowdsourcing, and propose ConTask, a context-aware approach to support task distribution. ConTask comprises a context model and a mobile app, which extends the task model proposed by Mrazovic and Matskin (2015) implementing the five context dimensions proposed by Zimmermann et al (2007): Individuality, Time, Relation, Activity, and Location. Additionally, ConTask presents a task distribution architecture, instantiated in the ConTask app prototype, which aims to support task matching to crowd workers according to the task's contextual requirements as defined by the requesters. The ConTask App was implemented according to the proposed model and achitecture, using popular sensors for each context dimension. Experiments were performed with different users in a University, and provided some insights regarding sensors usage; the accuracy of the implemented task distribution was 63%, with 73% precision and 63% revocation. To evaluate the ConTask model, we instantiated an performed an investigation with users in the University Administration domain. This experiment aimed to get insights about how the context dimensions influence tasks' acceptance from the crowd workers point of view. The results illustrated some tendencies in the influence of context usage in the worker willingness to perform the task; such as the higher the distance between the worker and the task location the lower the willingness of the worker on performing the task; the activity the worker is performing when receive the task request influences its acceptance, such as the activities of "waiting" or "talking" in contrast to "on cellphone" and "working"; the time of the day the worker received the task, such as morning versus night; the worker expertise regarding the task requirements; and the social relation between the worker and the task requester.

KARLA MALTA AMORIM DA SILVA
The Lehman's Laws in Highly-Configurable Systems: An Empirical Study on the Linux Kernel Variable Features”
Leader : EDUARDO SANTANA DE ALMEIDA
MEMBRES DE LA BANQUE :
EDUARDO SANTANA DE ALMEIDA
LEOPOLDO MOTTA TEIXEIRA
RODRIGO ROCHA GOMES E SOUZA
Data: 30 juil. 2019

Afficher le Résumé
Background: Highly-configurable systems (HCS) provide a set of features which can be combined in order to generate a family of different products. These features can be either common to all products or variable features - present only in some of the generated products. When evolving this kind of system, we have to deal with the modification of both common and variable features, which may be different regarding their evolutionary behavior. Thus, the Lehman's Laws of software evolution may not hold in all cases.

Objective: In this study, our objective is to evaluate the validity of Lehman's Laws in the context of variable features of an open source HCS - the Linux kernel.

Methods: Our approach consists of a replication of an empirical study, where we reused the original design but modified the type of subject in order to observe if the results hold in a different context. We analyzed bug reports, commits, and source code of 47 releases of the Linux kernel launched over a period of 23 years of evolution.

Results: Results show that half of the laws evaluated were supported, namely, Continuing Change, Continuing Growth and Conservation of Familiarity laws. Conversely, the laws Increasing Complexity, Declining Quality and Conservation of Organizational Stability were not supported for the variable features of the Linux kernel.

Conclusions: This study brings together the available evidence of the validity of the Lehman's Laws in an HCS and discusses how its open source nature and domain of application influence the results. We reflect upon the implications of our results and propose a set of recommendations to the HCS community in order to help improving the evolution of variable features.

ELTON FIGUEIREDO DA SILVA

INTEGRATION OF DDM PRACTICES IN THE AGILE PROCESS: ASSESSING ASPECTS OF SOFTWARE EVOLUTION.

Leader : RITA SUZANA PITANGUEIRA MACIEL
MEMBRES DE LA BANQUE :
ANA PATRICIA FONTES MAGALHÃES MASCARENHAS
LEONARDO GUERREIRO AZEVEDO
RODRIGO ROCHA GOMES E SOUZA
Data: 6 août 2019

Afficher le Résumé
In software development that uses some agile method, the source code, rather than documentation, is the main and most up-to-date artifact. In contrast to this reality, the Model Driven Development (DDM) approach has the model as the main artifact of the development process. Intending to get th advantages of both approaches, (SALES, 2017) developed the ScrumDDM meta-process (agile MDD) that added modeling and model transformation practices coming from model driven development to the Scrum framework. Unlike the work of (SALES, 2017), this paper will investigate whether the ScrumDDM metaprocess preserves the characteristic agility of SCRUM, supports software evolution through user stories and documentation provided by DDM using for that aexisting software and developed by ScrumDDM, as well as whether this metaprocess is effective in creating new processes that integrate SCRUM and DDM. To evaluate the metaprocess regarding its ability to evolve user stories and the agility in software development, a controlled experiment was developed. To evaluate the generalization of metaprocess, in turn, a new software process from the academic literature was instantiated from this metaprocess. Through the controlled experiment developed in this work, it was possible to demonstrate that the evaluated metaprocess supported the software evolution through DDM documentation. It has also been shown that development agility has been enhanced through the transformations and models of the DDM approach. In addition to the results cited, at the end of the development process, it was observed that, both the project code and documentation, were up to date.

RAFAEL HENRIQUE TIBÃES

RECOGNITION OF DIGITAL PRINTING UNDER SCALE VARIATIONS FOR IDENTIFICATION OF DISAPPEARED CHILDREN

Leader : MAURICIO PAMPLONA SEGUNDO
MEMBRES DE LA BANQUE :
LEONARDO GOMES
MAURICIO PAMPLONA SEGUNDO
RODRIGO MINETTO
Data: 20 août 2019

Afficher le Résumé
Since the fingerprint of a newborn individual grows to adulthood, the fingerprint of this

individual will present different scales when captured over time. The literature comparison methods do not

They comprise a robust range of variation of the image, preventing automatic comparison of fingerprints of a

same individual collected when newborn and after its growth. For this reason, this paper presents a

solution to calculate the thickness of the ridges and valleys of the fingerprint, and thus calculate their average frequency.

The ratio scale between two fingerprint images is estimated based on the average frequency for each image,

allowing resizing these images to a common frequency, which enables the use of comparison algorithms

of literature to perform recognition over time. In experiments using a synthetic database, our

approach EER rate obtained for a 0.09 ratio scale 4 times, while the same comparison algorithm

without resizing the images you get the EER rate of 0.5, i.e. similar to a random hit.

MICHELL FELIPPE FERNANDES MACEDO QUEIROZ
Matheuristics for the minimum weighted feedback vertex set and $b$-coloring problems
Leader : RAFAEL AUGUSTO DE MELO
MEMBRES DE LA BANQUE :
CARLOS EDUARDO DE ANDRADE
CELSO DA CRUZ CARNEIRO RIBEIRO
MARCIO COSTA SANTOS
RAFAEL AUGUSTO DE MELO
TIAGO DE OLIVEIRA JANUARIO
Data: 24 sept. 2019

Afficher le Résumé
In this thesis, we propose new matheuristics for two NP-hard graph optimization problems. The first studied problem is denoted the minimum weighted feedback vertex set problem (MWFVS), for which given a weighted graph $G=(V,E)$ it consists in obtaining a minimum weight subset $F\subseteq V$ of the vertex set whose removal makes the graph acyclic. Differently from other approaches in the literature, we tackle this problem via the maximum weighted induced forest problem (MWIF). First, we propose two new compact mixed integer programming (MIP) formulations, using a polynomial number of variables and constraints. Next, we develop a matheuristic that hybridizes a multi-start iterated local search metaheuristic with a MIP-based local search procedure. This thesis also studies the $b$-coloring problem, for which given a graph $G=(V,E)$ it consists in attributing a color to every vertex in $V$ such that adjacent vertices receive different colors, every color has a $b$-vertex, and the number of colors is maximized. A $b$-vertex is a vertex adjacent to vertices colored with all used colors but his own. The optimal solution of the $b$-coloring problem determines the $b$-chromatic number of $G$, denoted $\rchi_b$. We present an integer programming formulation and a very effective multi-greedy randomized heuristic which can be used in a multi-start fashion. Furthermore, a matheuristic is proposed combining the heuristic with the integer programming formulation. Extensive computational experiments show that the proposed techniques for both problems outperform the state-of-art metaheuristics for the majority of tested instances.

DIEGO BARBOSA ARIZE SANTOS
Towards an accurate energy forecast given uncertainties in Smart Grids
Leader : TATIANE NOGUEIRA RIOS
MEMBRES DE LA BANQUE :
MARCOS ENNES BARRETO
MATHEUS GIOVANNI PIRES
TATIANE NOGUEIRA RIOS
Data: 14 oct. 2019

Afficher le Résumé
Managing energy generation and its consumption is usually a difficult task for electric power grids systems, given the great volume of data. Moreover, the data obtained from these systems is tied to uncertainty occasioned by seasonality and natural environment disturbances. Therefore, efforts have been made on the construction of Smart Grids, i.e. intelligent energy networks, which combine Computational Intelligence with the electricity power grids, in order to improve the balance between energy generation and its consumption. Smart Grids require powerful controllers to keep the balance of energy generation and its demand. Those controllers have to be aware of future loads, and the prediction of this data must be very accurate to provide efficient decision support. Since Smart Grid's energy generation and consumption data varies over time, following a time series distribution, time series forecasting methods can yield predictions to support those controllers on the decision-making process. Nevertheless, forecasting over data that tied to uncertainty may have some disturbances. In order to overcome those issues, on this work, we investigate time series forecasting methods, mapped in a systematic literature review, aiming to deliver accurate forecasts even for uncertain data. Towards finding a method for that, we present a comparison study over different time series forecasting methods to evaluate which one would achieve better accuracy in energy distribution. The compared methods were the Adaptive Fuzzy Neural Network (ANFIS), Recurrent Neural Networks (RNN), Support Vector Regression (SVR), Random Forest and SARIMAX. The ANFIS algorithm had outperformed the other approaches, delivering more accurate results. From that comparison, we have proposed a framework that combines the ANFIS' fuzzification step with well-known forecasting methods to improve their performances on forecasting under uncertain energy data.

JOÃO ALBERTO CASTELO BRANCO OLIVEIRA
Fuzzy Software Analyzer (FSA): A new approach to interpret and analyze software repositories
Leader : RICARDO ARAUJO RIOS
MEMBRES DE LA BANQUE :
RICARDO ARAUJO RIOS
CLAUDIO NOGUEIRA SANT ANNA
TATIANE NOGUEIRA RIOS
CRISTIANO HORA DE OLIVEIRA FONTES
Data: 31 oct. 2019

Afficher le Résumé
Software metrics have been widely adopted during the process of developing source code, allowing to manage projects with higher quality by quantifying different characteristics that may impact, for example, the project costs and the planned activities. In general, such metrics can be divided into direct and indirect. The direct ones are commonly used in software development and maintenance. Those measures are directly extracted from source codes such as, for instance, the number of written lines and the total number of errors collected in a given time instant. The indirect measures, on the other hand, are obtained after performing some interpretation of the project. They may also include direct metrics and were developed to be generally applied to different software projects. However, the interpretation of these metrics depends on the experience of the project manager (domain specialist) and must consider specific characteristics related to the team of developers. In this project, we present a new approach, referred to as Fuzzy Software Analyzer -- FSA, designed to automatically extract characteristics and patterns in software repositories, aiming at: i) assisting the expert during the task of interpreting metrics, especially by working on large volumes of source codes; and ii) monitoring the evolution of the software as new releases are submitted to the repositories. The proposed approach was assessed by analyzing the Linux Test Project repository, demonstrating its usefulness and applicability.

EDISON DE JESUS SANTOS
Process Smell : Um Catálogo de Bad Smells para Processos de Software.
Leader : RITA SUZANA PITANGUEIRA MACIEL
MEMBRES DE LA BANQUE :
ANA PATRICIA FONTES MAGALHÃES MASCARENHAS
IVAN DO CARMO MACHADO
RITA SUZANA PITANGUEIRA MACIEL
Data: 28 nov. 2019

Afficher le Résumé
The systematic use of the software process favors the quality of the generated product and guides the steps for the construction of the adherent software to the expected quality. Processes are commonly specified and represented by software process modeling languages known as Software Process Modeling Language (SPML). Among these languages, Software & Systems Process Engineering Metamodel (SPEM) stands out for being a Unified Modeling Language (UML) profile for modeling software processes and systems. Software processes evolve together with the needs of the institution and professionals who use it, needing to be monitored and evaluated constantly to maintain its qualities. Currently, four forms of process assessments are best known among industry practices and the software engineering literature, namely: (i) product assessment against the process, (ii) checks associated with assessment models, (iii) validation through formal specifications and (vi) systemic view of human-centered software engineering. Although it is possible to apply any of the four forms of assessments mentioned above to guarantee the evolution and quality of the software process, they have some characteristics that can be inconvenient. Firstly, these forms of evaluation do not accurately detect the elements of the process that potentially have the most negative impact on their quality. According to these evaluations, data obtained after the execution of the process are used, so it is only possible to verify problems in the process after the execution of the same, which may already cause some possible losses. More specifically, when these forms of evaluation are carried out using simulation, they require specific knowledge for the execution of the simulation procedures and the use of symbolic data that simulate the process data after its execution. Although SPML like SPEM are used to improve the understanding of a process, the specification of a process can be done inappropriately hurting factors of quality of the process. This phenomenon can be compared to the concept of bad smells, which are design problems in software code and in this work it will be presented as a process smell for observing problems in the process design. Thus, a process smell in the specification of a process can negatively impact the quality of the process and consequently affect the quality of the software product.

LUIS EMANUEL NEVES DE JESUS
FEATURE INTERACTION IN ENVIRONMENTS SMART: AN APPROACH BASED ON DATA ANALYSI
Leader : DANIELA BARREIRO CLARO
MEMBRES DE LA BANQUE :
DANIELA BARREIRO CLARO
IVAN DO CARMO MACHADO
CELIA GHEDINI RALHA
Data: 16 déc. 2019

Afficher le Résumé
The growth of the Internet of Things has allowed many devices to be daily connected. Each device has its own action rules and its peculiarities that can be monitored by a domain expert. Many of these devices are also configured to work in conjunction with other devices in controlled environments. With the advancement of the Internet of Things, it becomes more difficult to manage when devices are connected and/or disconnected from the environment. Moreover, with the presence of a new device, such as restricted usage rules for a controlled environment, may be violated and cause unexpected behavior in the environment. These cases are related to feature interaction. Feature Interaction occur when two or more devices/services generate instability in the environment due to input or output of a device. In dynamic environments such as IoT, device inclusion and exclusion makes it impossible for an expert to maintain these rules. According to the systematic mapping review performed, most works use the rules applied by the expert, requiring a method that allows to detect these feature interactions automatically. In this sense, the main objective of the present work is to detect features interacionista based on data analysis. Feature interaction data sets were annotated to obtain a model for automatically identifying rules. Experiments were performed and the results showed evidence of automatic detection of resource interactions in similar or complementary domains.

RUIVALDO AZEVEDO LOBÃO NETO
Using Markov Chains and Radial Base Function Networks for Online Concept Change Detection in Streaming Data
Leader : RICARDO ARAUJO RIOS
MEMBRES DE LA BANQUE :
GLORIA MILLARAY JULIA CURILEM SALDIAS
MARCELO MAGALHAES TADDEO
RICARDO ARAUJO RIOS
Data: 16 déc. 2019

Afficher le Résumé
The amount of information produced by computer systems has grown dramatically in recent decades. A significant portion of this information is produced as continuous streams, which are constant and potentially infinite sequences of data.
These flows are mostly non-stationary and may change in the distribution of data or in the context of the generating process. These changes are called concept changes and can negatively impact the performance of applied learning models. To mitigate this problem, researchers have been developing specialized methods for detecting concept changes. However, the proposed methods have limitations when applied in some continuous flow scenarios, such as the need for expert labeling and the inability to meet the processing time and computational resource constraints of these scenarios. Aiming to overcome these limitations, this paper presents a new concept change detection method called RBFChain, based on Radial Base Function Networks (RBF) and Markov Chains. Briefly, RBF networks perform, in their middle layer, an activation process that implicitly produces groups from the observations received over time.

In addition, Markov chains allow us to model the center transitions that occur in such observation groups. Concept changes are then detected when the active center of the cluster changes and the probability of transition in the Markov model exceeds a threshold. The presented method differs from existing works by detecting changes in runtime, computationally efficient and independent of labels. To evaluate the RBFChain method as a viable concept change detector, a sensitivity, precision and noise tolerance analysis was performed using synthetic data sets, and their results were compared with the main algorithms available in the literature. In addition, the technique was applied to a real fixation and saccade classification problem in eye tracking activity. With this application, it was possible to investigate and propose a solution to a relevant problem involving the area of Neuroscience and Computation. The results obtained with the synthetic data sets suggest that the RBFChain is statistically better or equivalent to the main detectors present in the literature. Moreover, the developed technique presented good results when applied to the ocular monitoring problem, being able to classify fixations and balconies in real time and with precision equivalent to the state of the art.

WITÃ DOS SANTOS ROCHA

SEMANTIC SIMILARITY OF ATTRIBUTES TO CLOUD DATA: A CASE STUDY IN MIDAS

Leader : DANIELA BARREIRO CLARO
MEMBRES DE LA BANQUE :
DANIELA BARREIRO CLARO
FRANK AUGUSTO SIQUEIRA
LAIS DO NASCIMENTO SALVADOR
Data: 16 déc. 2019

Afficher le Résumé
Large amounts of heterogeneous data produced by social networks, Internet-connected devices, and web applications has been stored and managed in different formats at distinct service levels Data-as-a-Service (DaaS) and Database-as-a-Service (DBaaS). This solutions provides on demand services via Application Programming Interface (API). To access these distributed cloud services, users in most cases face data ambiguity issues. To solve these problems and to avoid performing complex and laborious tasks to access this data, an automated solution that serves as an intermediate layer for establishing communication between SaaS and DaaS / DBaaS service levels is required. This solution allows SaaS service level consumers to access their data stored at different DaaS service levels through a single query. The middleware layer called Middleware for Interoperability Between SaaS and DaaS (MIDAS) provides this solution to users in a transparent way. Over time, DaaS service level parameter updates can semantically affect query terms (that are automatically or not automatically) configured for applications at SaaS service levels. To ensure that cloud consumers continue to access DaaS data, an approach that ensures semantic similarity between DaaS parameters to maintain the reliability of the original request is recommended. Our work proposes a method (SM) with two similarity ways: (i) edge counting (Cosine and Jaccard) to measure the similarity between two attributes and the Information Content(IC) to measure similarity based on knowledge through a WordNet corpus. The IC method is used in the model to fill the limitations of the edge counting method. To choose the edge count methods (Cosine and Jaccard), an environment with eight distance measurement algorithms, twenty-two authentic parameters from eleven DaaS providers, and five possible parameter change situations was simulated. As a proof of concept, our model is implemented in MIDAS to evaluate three criteria: overload, performance and correctness. The results of our experiments showed that we are in the first direction to provide semantic interoperability between SaaS and DaaS in MIDAS.

ROSELANE SILVA FARIAS
DESIGNING SMART CITY MOBILE APPLICATIONS: A GROUNDED THEORY
Leader : EDUARDO SANTANA DE ALMEIDA
MEMBRES DE LA BANQUE :
CHRISTINA VON FLACH GARCIA CHAVEZ
EDUARDO SANTANA DE ALMEIDA
ELISA YUMI NAKAGAWA
Data: 20 déc. 2019

Afficher le Résumé
The software architecture community has played a crucial role in the development of mobile software. Many of the ideas used in the design of these systems came from traditional software architecture and those ideas have contributed to mobile computing becoming ubiquitous. Mobile applications in the context of smart cities are very challenging since they need to operate within the power, processor, and capacity limitations of mobile devices, the exacting demands of life critical smart city requirements, and the constantly changing and exposed environment which may not always be trusted. Since there are no widely accepted design models for this type of software, developers must resort to primitive design decisions to meet all the needs of these applications, which takes additional time and expertise. For this reason, this study aims to investigate the design process for mobile applications in the context of smart cities. In order to address the lack of verified information about designing mobile apps, we conducted a multi-case study with 9 applications from 4 different development groups to build a grounded theory. The applications were reverse engineered to expose the architecture of each application. Based on all the data, an emergent grounded theory was constructed to explain how the selected design process produces an app with the desired characteristics. The grounded theory developed through this research, and the process by which the theory was developed, were subjected to an evaluation process developed from the grounded theory literature. That evaluation validated the experimental process and verified that the experimental results followed from the correct use of the experimental process. The evaluation also addressed some of the threats to validity such as investigator influence. To further ensure validity this process included gathering data from additional projects using the experimental process. The resulting theory offers explanations for how software engineering teams design mobile apps for smart cities. This knowledge will serve as a basis to further understand the phenomena and advances towards more effective design and development process definitions.

JURACY BERTOLDO SANTOS JÚNIOR

DEVELOPMENT AND VALIDATION OF PREDICTIVE MODELS FOR THE MALARIA EPIDEMIC.

Leader : MARCOS ENNES BARRETO
MEMBRES DE LA BANQUE :
MARCOS ENNES BARRETO
TATIANE NOGUEIRA RIOS
VANDERSON DE SOUZA SAMPAIO
Data: 20 déc. 2019

Afficher le Résumé
Data mining techniques allow extracting relationships and features present in data that are not easily noticeable, especially in scenarios involving large volumes of data. These techniques allow grouping and classify data, identify patterns and exceptions, as well as establish association relationships between different data. THE Predictive analytics is a sub-area of data mining that uses machine learning techniques to implement of predictive analytical models. Such models are based on prior knowledge of the data (descriptive analysis) to design possible future events about the context being analyzed. This work is an integral part of a research project which aims to integrate malaria reporting data from different sources of information and develop models of predictive analysis on these data. Specifically, this work involves the study of different prediction techniques based on in models and data and, based on this study, the development of predictive models for the epidemiological scenario of malaria in Brazil. Four case studies involving socioeconomic and epidemiological data are discussed as evidence of concept of the proposed models. The results show that the models developed have an accuracy variable between 60% and 85% for the tested scenarios, with prediction windows of up to 3 weeks and considering the large variation observed in the malaria cases in the last four years

RICARDO BRASIL TEIXEIRA
SHARED RESOURCES IN MULTIPROCESSOR REAL-TIME SYSTEMS SCHEDULED BY RUN
Leader : GEORGE MARCONI DE ARAUJO LIMA
MEMBRES DE LA BANQUE :
ALIRIO SANTOS DE SA
ERNESTO DE SOUZA MASSA NETO
GEORGE MARCONI DE ARAUJO LIMA
PAUL DENIS ETIENNE REGNIER
Data: 20 déc. 2019

Afficher le Résumé
A hard real-time system can be de􏰌ned as the one for which all its tasks must meet their deadlines whenever the system runs. This requirement makes the scheduling algorithm a key element for system correctness. Ideally, the scheduling algorithm employed must both exhibit low overhead (e􏰎ciency) and ensure that no task deadline is missed whenever this can be ensured by some scheduling algorithm (optimality). RUN (Reduction to Unipro- cessor) is an algorithm capable of e􏰎ciently and optimally scheduling a set of strictly periodic tasks on a multiprocessor platform when tasks do not share any resources but processors. Although it has already been shown that RUN is compatible with resource sharing, the only existing solution prevents preemptive access to shared resources. Unlike this approach, which can be considered too restrictive due to its poor schedulability, we used MrsP (Multiprocessor resource sharing Protocol) as a more 􏰍exible resource sharing mechanism. Making the rules of both RUN and MrsP compatible to each other was thus our main goal. The derived solution was implemented on Linux Textbed for Multi- processor Scheduling in Real-Time systems (LitmusRT), namely a Linux-based real-time operating system. We proposed a new task packaging heuristic and performed experi- mental evaluations comparing our solution with the existing one. The results showed that the proposed solution presented better results in terms of schedulability and number of migrations and preemptions.

Thèses

CRESCENCIO RODRIGUES LIMA NETO
An Approach for Recovering Architectural Variability from Source Code
Leader : CHRISTINA VON FLACH GARCIA CHAVEZ
MEMBRES DE LA BANQUE :
ALESSANDRO FABRICIO GARCIA
CLAUDIO NOGUEIRA SANT ANNA
IVAN DO CARMO MACHADO
MANOEL GOMES DE MENDONCA NETO
THELMA ELITA COLANZI
Data: 19 févr. 2019

Afficher le Résumé
Software Product Line Engineering (SPLE) has been widely adopted for applying systematic reuse in families of systems. Given the high upfront investment required for SPLE adoption, organizations commonly start with more opportunistic reuse approaches (e.g., a single system that they clone and modify). However, maintenance problems appear when managing a large number of similar systems where each of them implements and evolves particular characteristics. One viable solution to solve this issue is to migrate to SPLs using an extractive approach. This initiative, in its early phases, includes the definition of a Product Line Architecture (PLA) to support the derivation of product variants and also to allow customization according to customers' needs. In this way, the systematic use of Software Architecture Recovery (SAR) techniques enables PLA recovery and keeps the PLA aligned with development. Our objective is to provide an automatic approach to recover PLAs and guidelines to support the PLA recovery. We gathered knowledge by means of literature reviews and exploratory studies to characterize the state-of-the-art and identify research gaps on SAR techniques and tools that support the recovery of architectural variability information from source code for a family of products. The use of SAR techniques and tools to recover a PLA that documents variability information at the architecure level may address issues related to SPL adoption, design and evolution. Unfortunately, few studies investigate PLA recovery and also provide empirical evaluation. One of the main issues in the extractive approach is the explosion of the variability in the PLA representation. Our approach is based on identifying variability on architectural level by extracting information from variants' source code. To evaluate our approach, we performed a set of empirical studies.

LARISSA ROCHA SOARES
Feature Interactions in Highly Configurable Systems: A Dynamic Analysis Approach with Varxplorer
Leader : EDUARDO SANTANA DE ALMEIDA
MEMBRES DE LA BANQUE :
IVAN DO CARMO MACHADO
RODRIGO ROCHA GOMES E SOUZA
EDUARDO SANTANA DE ALMEIDA
CHRISTINA VON FLACH GARCIA CHAVEZ
MARCIO DE MEDEIROS RIBEIRO
EDUARDO MAGNO LAGES FIGUEIREDO
Data: 21 févr. 2019

Afficher le Résumé
Highly configurable systems (as known as software product lines) provide significant reuse opportunities by tailoring system variants based on a set of features. Those features can interact in undesired ways which may result in faults. However, most interactions are not easily detectable as specifications of feature interactions are usually missing. The feature interaction problem has been a challenging subject for years. Despite the existence of studies to map out available evidence on feature interaction for single systems development, there is a lack of understanding on common strategies, activities, artifacts and research gaps for interactions in configurable systems. Thus, this thesis initially gathered systematic mapping study evidence by analyzing 40 feature interaction primary studies, which were classified according to development lifecycle stages and the feature interaction solution presented, either detection, resolution or general analysis. Recent analyses focused on detecting feature interaction bugs from global specifications, i.e., specifications that all configurations of a configurable system need to fulfill, such as requiring that each configuration does not crash. However, specifications at the feature level are usually missing and, then, many approaches may not detect all incorrect system behavior, specially bugs not covered by global specifications and bugs that do not result in a crash or other easily observable behavior. Instead of starting from a set of specifications like most approaches, we propose to inspect feature interactions as they are detected and incrementally classify them as benign or problematic. We aim to provide an inspection process that helps developers to distinguish intended interactions from interactions that may lead to bugs. We use variational execution to observe internal interactions on control and data flow of highly configurable systems. To help developers understand these interactions, we propose feature-interaction graphs as a concise representation of all pairwise interactions. We provide two analyses that provide additional details about interactions, namely suppress and require interactions. Our approach and tool, VarXplorer, provide an iterative analysis of feature interactions allowing developers to focus on suspicious cases. Finally, we perform two empirical studies to evaluate the inspection process and how feature interaction graphs can help users identify suspicious interactions. The first study is a controlled experiment to investigate and compare the ability of users when identifying suspicious interactions with and without VarXplorer, in a setting composed of different systems, performing different tasks. The second study focuses on the iterative process of test cases execution and how it can be used for a faster and more objective feature interaction analysis.

Sistemas altamente configuráveis (também conhecidos como linhas de produtos de software) fornecem oportunidades significativas de reuso, uma vez que eles adaptam variantes do sistema com base em um conjunto de features. Essas features podem interagir de formas indesejadas, resultando em falhas. Além disso, a maioria das interações não é facilmente detectável, já que especificações de interações entre features geralmente não são definidas, especificadas e documentadas em um projeto de software.O problema da interação entre features tem sido um assunto desafiador por anos. Apesar da existência de estudos que mapeiam essas interações, ainda não há muitos trabalhos sobre a compreensão de estratégias, atividades, artefatos e lacunas de pesquisa para interações em sistemas configuráveis. Desta forma, esta tese provê inicialmente um mapeamento sistemático de estudos por meio da análise de 40 trabalhos, os quais foram classificados de acordo com os estágios do ciclo de vida de desenvolvimento e a solução de interação apresentada (detecção ou resolução de interações). Análises recentes têm focado na detecção de erros de interação de features a partir de especificações globais, ou seja, especificações que todas as configurações de um sistema configurável precisam cumprir. No entanto, especificações no nível de features ou interações são geralmente negligenciadas e raramente documentadas. Neste cenário, muitas abordagens não conseguem detectar todos os problemas de comportamento do sistema, especialmente erros não cobertos por especificações globais e erros que não resultam em uma falha ou outro comportamento facilmente observável.Ao invés de partir de um conjunto de especificações como a maioria das abordagens, propomos inspecionar as interações de features `a medida que são detectadas e classificá-las gradativamente como benignas ou problemáticas. Nossa abordagem e ferramenta, VarXplorer, fornece um processo de inspeção que ajuda os desenvolvedores a distinguir as interações intencionais das interações que podem levar a bugs. Usamos a execução variacional para observar interações internas ao fluxo de controle e fluxo de dados de sistemas altamente configuráveis e propomos gráficos de interação de features como uma representação concisa de todas as interações entre pares de features. Por fim, realizamos dois estudos empíricos para avaliar como o processo de inspeção e os gráficos de interação de features podem ajudar os desenvolvedores a identificar e entender interações suspeitas. O primeiro é um experimento controlado que investiga e compara a capacidade dos desenvolvedores ao identificar interações suspeitas com e sem o VarXplorer. O segundo foca no processo iterativo de execução de casos de teste e como ele proporciona uma análise de interações mais rápida e objetiva.

ROBESPIERRE DANTAS DA ROCHA PITA
CLUSTERING CATEGORICAL DATA USING THE FREQUENCY FACTOR
Leader : MARCOS ENNES BARRETO
MEMBRES DE LA BANQUE :
FREDERICO ARAUJO DURAO
MARCOS ENNES BARRETO
MAYCON LEONE MACIEL PEIXOTO
PAULO HENRIQUE FERREIRA DA SILVA
SPIROS DENAXAS
Data: 11 déc. 2019

Afficher le Résumé
Data clustering refers to an unsupervised machine learning method, often used when no labeled data is available. This class of machine learning models is used to retrieve information by splitting the data into coherent and contrasted groups. The most popular partitional and hierarchical clustering algorithms are K-means and AGNES, respectively. Nevertheless, to make either K-means and AGNES suitable to categorical data, some extensions are required in terms of similarity measures to be used and central position measures to update centroids. Categorical variables bring together several scales with a finite and moderate set of values, such as nominal variables, ordinal variables, discrete interval variables, and continuous variables grouped into few categories. As an extension of K-means, K-modes aims to cluster categorical data by using the mode to update clusters’ centroids, as well as an overlap measure to define the distance between objects and centers. The extension of AGNES can follow the same logic. This work proposes the Frequency Factor, a new probability-based measure, specifically forged to update the centroids on clustering algorithms. The use of Frequency Factor to edit both K-means and AGNES yielded two proposed alternatives, K-fact and MARI, respectively. In our validation scheme, we used the Rand index from external validity criteria to measure the clustering performance and compare both algorithms. Those algorithms extended by Frequency Factor outperform or tie with K-modes, AGNES, and ROCK and can be used as a competitive alternative to cluster nominal-categorical variables.

	Thèses
1	NAILTON VIEIRA DE ANDRADE JUNIOR A Platform for Discovery, Deployment and Self-Configuring Services on the Web of Things Leader : CASSIO VINICIUS SERAFIM PRAZERES MEMBRES DE LA BANQUE : CASSIO VINICIUS SERAFIM PRAZERES FREDERICO ARAUJO DURAO ARTUR HENRIQUE KRONBAUER Data: 12 janv. 2018 Afficher le Résumé The evolution of embedded computing has enabled the emergence of devices that connect to the Internet, fostering the Internet of Things. In the same direction, the Web of Things proposes to make these devices available as resources directed to the development of applications, using the protocols and the Web standards. The variety of devices that can connect to the Web of Things demands both the effort of implementation of specific services for access to each device, as methods that allow its use by humans and machines. In this context, this work proposes the automatic configuration and deployment of devices that appear as resources in the Web of Things, through models that map their functionalities. Existing solutions expose RESTFul Web Services built into physical devices, making the strategy non-agnostic over the application protocol and compromising service maintainability. The solution proposed in this work relates the existing Web standards and technologies for the development of a free and integrated platform, with the following functionalities: the dynamic discovery of the devices when connecting to a local network, using discovery protocol of network service; the automatic generation of RESTFul Web Services for deployment of devices in the Web of Things; and an approach to practical implementation of semantic description in physical devices, in order to provide the publication of semantically enriched devices in the context of the Web of Things. Finally, network-level assessments are carried out, from application service of the developed platform.
2	GEORGE CAIQUE GOUVEIA BARBOSA Using Generic Linguistic Features for Classification of Relational Triples in Portuguese, English and Spanish Leader : DANIELA BARREIRO CLARO MEMBRES DE LA BANQUE : DANIELA BARREIRO CLARO MARLO VIEIRA DOS SANTOS E SOUZA VLÁDIA CÉLIA MONTEIRO PINHEIRO Data: 6 févr. 2018 Afficher le Résumé The increase in the amount of text generated daily on the web has made it more difficult the analysis and extra information of these data. In addition, the number of forms with that people can express themselves using natural language increases the complexity the development of computational tools to extract useful information from texts. Extra Information (EI) has the role of automating the transformation of corpus textual information in structured information. Extra informations of texts are structured in different ways. The relational triple of the type (subject-predicate-object) and structure most used in IE. The EI task can be classified as traditional and open. THE The traditional one makes extra cations from a limited set of predicates. Haha In addition to Open Information (EIA), the relationships are not limited by a set of therefore, one of the objectives of this task and the independence of the domain. The EIA can be divided into two stages: (i) extra and (ii) classi cation. M etodos of EIA present in the literature use spe- cific language functions of the target language in their stages of classi - cation. Language resources for English are more the other languages, thereby increasing the OIE's languages. Most of these methods are based on English. Therefore, the work, and in the classi cation stage, use a set of multi-language features to classi cation of relational triples, contributing to language independence in EIA. Experiments were performed to compare the multi-language features with the proposals by ReVerb. The results obtained showed that, at the 5% level of significance, the multi-language features presented superior performance for English (p <0.01). After the to translate the part of the ReVerb features, both sets achieved similar in Spanish (p = 0.76) and Portuguese (p = 0.07)..
3	MADSON RODRIGUES DE ARAUJO Predictive-based DBA algorithms to enable LR-PONs to support fronthaul traffic in C-RAN deployments Leader : GUSTAVO BITTENCOURT FIGUEIREDO MEMBRES DE LA BANQUE : GUSTAVO BITTENCOURT FIGUEIREDO DIVANILSON RODRIGO DE SOUZA CAMPELO ANDRE COSTA DRUMMOND Data: 20 avr. 2018 Afficher le Résumé Mobile devices data traffic has increased continuously in the last years and the growth prediction is about 50% per year until 2021. This growth in data demand has created many problems of cost and revenue to network operators to maintain and enlarge these networks. Literature has approached the Cloud Radio Access Networks (C-RAN) as an architecture of great potential to network operators being able to offer high data rates while reducing capital expenditure (CAPEX) and operational expenditures (OPEX). C- RAN main idea is to centralize the baseband processing of radio base stations in the cloud. This way C-RAN can generate CAPEX and OPEX economies, by deploying simpler base stations, making devices upgrade easier because most of the equipments are in one central place, generating better traffic balancing and reducing idleness costs by turning equipment on/off. However, its necessary to forward the base station traffic, which will be processed in the cloud, by a fronthaul network of low latency and jitter and high capacity. These requirements are due to the restrict performance requirements of CPRI and LTE-A protocols. This work focuses on the challenges related to the fronthaul deployment in a C-RAN based LTE-A architecture, more specifically in utilize a long reach network as fronthaul in this kind of C-RAN. We propose the Long Reach Passive Optical Networks (LR-PON) as the solution to fronthaul because of its capacity to offer high data rates (40 Gbps) and a lower CAPEX and OPEX compared to other optical network technologies. Furthermore, we introduce two prediction-based dynamic bandwidth allocation (DBA), namely Predictive-DBA (PD-DBA) and Mobile Predictive-DBA (MPD-DBA), which are able to achieve lower delay compared to their similar algorithms in literature. Moreover, The MPD-DBA, provides average delays under the fronthaul delay limit in 100 km reach and CPRI 1 up to 4 scenarios.
4	ALAN DOS SANTOS SOARES The analysis of classifiers for 3D dynamic hand gesture recognition based on geometric descriptors Leader : ANTONIO LOPES APOLINARIO JUNIOR MEMBRES DE LA BANQUE : ANTONIO LOPES APOLINARIO JUNIOR KARL PHILIPS APAZA AGUERO THALES MIRANDA DE ALMEIDA VIEIRA Data: 20 avr. 2018 Afficher le Résumé Many approaches have emerged in the last few years to address the challenges of dynamic hand gesture recognition systems. These systems use mostly computer vision techniques based on RGB images, subject to limitations such as variation of light and color. The emergence of RGB-D sensors allowed to incorporate geometric information into the image, making recognition systems more robust. However, there are still dierent challenges related to the problem of the classication of gestures as invariance to physical aspects and location, choice of characteristics and robust classiers that allow classication in real time. This work proposes an analysis of classiers for dynamic 3D hand gestures applied to geometric descriptors that represent the 3D curve generated by the hand of the user over time. Two approaches were analyzed, Dynamic Time Warping (DTW) and Decision Tree (DT). Two datasets were used to evaluate the results: MSR Action3D and GRUFBA 3D, which was built by the author. The classiers were evaluated to identify the importance of the curve tting steps, and the extraction and selection of the curve characteristics, analyzing their impact on the gesture recognition rate. The obtained results show that this approach is competitive with the other techniques of the state of the art for similar approaches, being invariant to physical aspects and lighting with an average rate of 97:2% and 97:6% with the textit DTW and textit DT, respectively, in one of the analyzed datasets.
5	LUIZ OTÁVIO RAMOS GAVAZA Study on motivation and learning of computer theory students in an approach approached in problems. Leader : LAIS DO NASCIMENTO SALVADOR MEMBRES DE LA BANQUE : LAIS DO NASCIMENTO SALVADOR MARLO VIEIRA DOS SANTOS E SOUZA ROBERTO ALMEIDA BITTENCOURT Data: 25 avr. 2018 Afficher le Résumé The discipline of Computer Theory is one of the bases of the area of computing, for this reason, of fundamental importance in the formation of the students of the area. Its content has a high level of abstraction and requires a great effort from students and educators in the teaching and learning process. Students' perception that they are not understanding the concepts and their lack of motivation are the main challenges in this process. In turn, the use of active learning methodologies such as the Problem Based Learning (PBL) approach has been presented as an alternative to reduce problems related to student motivation and learning. In this sense, the purpose of this dissertation is to describe the implementation of PBL in Computer Theory classes in the Information Systems course at UFBA. To evaluate the results and validate the hypotheses, this work conducted a descriptive survey in the form of an opinion survey, where students were asked to answer questionnaires with profile characterization, Likert scale and open space statements related to the adopted approach. The results of this study suggest that the use of PBL is promising as a methodological alternative to theoretical disciplines of Computing.
6	JOÃO PEDRO DANTAS BITTENCURT DE QUEIROZ MODELING SERVICES FROM BUSINESS PROCESSES: AN APPROACH TO MODELS. Leader : RITA SUZANA PITANGUEIRA MACIEL MEMBRES DE LA BANQUE : DANIELA BARREIRO CLARO RITA SUZANA PITANGUEIRA MACIEL LEONARDO GUERREIRO AZEVEDO Data: 15 mai 2018 Afficher le Résumé Organizations are increasingly aware of the importance of mapping their business processes. Business process modeling helps to identify and understand the activities performed by participants of an organization, as well as providing support for specifying system requirements and defining services that support them. In this sense, it is important to define methodologies that facilitate the business process models reuse to identify relevant services for the development of service-oriented applications. Several proposals in the literature follow in this direction, however, there is still no consensus on which it is the best way to use these models to identify the services, because the transference of information between models of different levels of abstraction is an arduous task. One of the problems found in these proposals is that the derivation occurs straight from business level to service level, without considering, for instance, system requirements that support them. Therefore, it generates basic services that requires more information to reach an architectural level that can properly generate code. To find a solution to this problem, this dissertation proposes DERIVA (Service Derivation on Business Process), which is a process based on Model-Driven Development approach, and considers not only business processes, but also system requirements that support them to achieve service derivation. The DERIVA process performs service derivation through a semiautomatic transformations’ chain, where process models are transformed into less abstract models until code generation is accomplished. The process transformation’s chain is heuristic based, which served as source for the mapping. This process was evaluated through a case study and a controlled experiment by developers of different knowledge in Service Oriented Architecture and Model-Driven Development. The results of this study demonstrated that these professionals, from a process model and using DERIVA process, were able to execute the entire process, produce a service architecture and its Java’s code, with parameters and data types, highlighting the proposal viability and efficiency. Service derivation from the business process can be facilitated through model-driven development. In addition, the inclusion of system requirements modeling, as part of the derivation process, results in a more detailed service architecture.
7	FILIPE ADEODATO GARRIDO - Leader : ECIVALDO DE SOUZA MATOS MEMBRES DE LA BANQUE : BRUNO SANTANA DA SILVA ECIVALDO DE SOUZA MATOS RITA SUZANA PITANGUEIRA MACIEL Data: 24 juin 2018 Afficher le Résumé -
8	ASSIS SANTOS VIEIRA CONCEPT LOCATION BASED ON MICRODOMINES Leader : CLAUDIO NOGUEIRA SANT ANNA MEMBRES DE LA BANQUE : CHRISTINA VON FLACH GARCIA CHAVEZ CLAUDIO NOGUEIRA SANT ANNA ROBERTO ALMEIDA BITTENCOURT Data: 4 juil. 2018 Afficher le Résumé Concept Location is the step of software modification where the developer search for the chunks of code that must be modified. In industrial environments, this step consumes most of the time and effort. In essence, the concept location is a problem of software comprehension. In reading the source code, cognitive models point out that the developers refine their knowledge by elaborating and verifying hypotheses from the tips suggested by the identifiers. Problem - In the literature, Dependence Search (DepS) is one of the most recognized techniques for concept location. After selecting an initial code snippet called seed, DepS suggests inspecting static dependencies, providing only the immediate dependencies of the seed. However, we did a study of the dependency graph of five softwares and found that in some cases the nearest dependencies use terms similar or equal to the terms of the seed. For those who do not know the terms of the seed, the terms similar to or equal to those used by the seed will hardly refine the hypotheses about the concepts implemented. Proposal - In order to refine the DepS, we developed a new technique called Microdomain-based Concept Location (MCL). The objective of the technique is to provide methods with terms different from those used by the seed and that they were at the same time statically related to each other. To achieve this goal, we developed two concepts: Microdomain and Seed Context. A microdomain is a cluster of statically related methods, built on the basis of dominance analysis. When selecting a seed, the MCL presents graphically the seed context, constituted by the microdomain of the seed and the microdomains neighboring the microdomain of the seed. In order to support the use of MCL, we developed a tool called the Context Viewer. Evaluation - In order to collect evidence of the applicability of the technique, we assigned a task of concept location to a developer, who used the Context Viewer. At the end of the evaluation, we conducted an interview. Conclusion - The tips provided by seed context, built by Context Viewer, helped the developer to realize the relevance of a seed, completing the task with the desired code snippets. Despite the evidence collected in our study, further research needs to be conducted in order to measure the similarity between the terms of one microdomain and the difference between the terms of microdomain neighbors.
9	IGOR DAVID BRITO CALDEIRA -- Leader : LEOBINO NASCIMENTO SAMPAIO MEMBRES DE LA BANQUE : LEOBINO NASCIMENTO SAMPAIO ALIRIO SANTOS DE SA STÊNIO FLÁVIO DE LACERDA FERNANDES Data: 5 juil. 2018 Afficher le Résumé --
10	MIRLEI MOURA DA SILVA GROUPING OF TEMPORARY SERIES USING DECOMPOSITION OF STOCKS AND DETERMINISTICS Leader : RICARDO ARAUJO RIOS MEMBRES DE LA BANQUE : RICARDO ARAUJO RIOS GECYNALDA SOARES DA SILVA GOMES RODRIGO FERNANDES DE MELLO Data: 13 juil. 2018 Afficher le Résumé -As part of the unsupervised machine learning area, time series clustering aims at designing methods to extract patterns from temporal data in order to organize dierent series according to their similarities. According to the literature, most of researches either perform a preprocessing step to convert time series into an attribute-value matrix to be later analyzed by traditional clustering methods, or apply measures specically designed to compute the similarity among time series. Based on such studies, we have noticed two main issues: i) clustering methods do not take into account the stochastic and the deterministic in uences inherent of time series from real-world scenarios; and ii) similarity measures tend to look for recurrent patterns, which may not be available in stochastic time series. In order to overcome such drawbacks, we present a new clustering approach that considers both in uences and a new similarity measure to deal with purely stochastic time series. Experiments provided outstanding results, emphasizing time series are better clustered when their stochastic and deterministic in uences are properly analyzed.
11	ELISEU SILVA TORRES BAMSDN: An SDN / Openflow Framework for Dynamic Resource Allocation Based on Band Allocation Model Leader : LEOBINO NASCIMENTO SAMPAIO MEMBRES DE LA BANQUE : LEOBINO NASCIMENTO SAMPAIO JOBERTO SERGIO BARBOSA MARTINS KELVIN LOPES DIAS Data: 16 juil. 2018 Afficher le Résumé Present and emerging communication network scenarios such as 5G, cloud, optical networks, MPLS networks and Internet of Things (IoT) are marked by the wide variety and distribution of users, applications and services with communication and quality requirements (SLA , QoE, QoS) heterogeneous. Therefore, it is necessary to find new ways to deal with the dynamics of network traffic, one of the alternatives adopted is the use of band-allocation models (BAM) that have the attributes for distribution and control of network resources through classification types of applications. It has become common to use BAMs together with the SDN paradigm, since this paradigm allows researchers and network administrators to explore the perspective of resource allocation in scenarios where the configuration of equipment owned by different manufacturers is a hindrance, since each device has its own configuration syntax. Thus, the exploitation of the BAM / SDN set can be an economical alternative to the high costs of OPEX and CAPEX of purely MPLS networks. Thus, the present work presents the BAMSDN framework (Allocation Models through Software-Defined Networks) for the dynamic and flexible exploitation of resources of MPLS-based networks through the joint use of BAMs and SDN. The framework explores the programmability and global vision of the control plan network to perform traffic engineering throughout the topology and seeks to match the availability of network resources to the demand of its users, as well as deploy the management plan to the SDN controller to also allow the reconfiguration and bandwidth management of each output port of a switch.
12	FRANCISCO RENATO CAVALCANTE ARAÚJO Collaborative and Distributed Cache as Support for Producer Mobility in Named Data Networks. Leader : LEOBINO NASCIMENTO SAMPAIO MEMBRES DE LA BANQUE : ANTONIO MARINHO PILLA BARCELLOS FABIOLA GONCALVES PEREIRA GREVE LEOBINO NASCIMENTO SAMPAIO Data: 18 juil. 2018 Afficher le Résumé The use of the Internet has undergone major transformations, the number of devices and connections is with the pace of accelerated growth, unplanned facts in the network architecture project. As a result, the popularization of mobile devices is leading to increased demand for mobile Internet and an overload in access networks. Faced with the challenges posed by emerging communications, Named Data Networking (NDN) were proposed to solve the problems faced in the current network, through a content-centric communication model, in which consumers request interests in the network in search of data that are generated by the producers. NDN supports the mobility of consumers through the architecture itself, however, support for the mobility of producers still poses a challenge, due to damage to the network, especially during the period of unavailability of the producer. In this way, this work proposes the SCaN-Mob, a new strategy to support the mobility of producers in NDN. SCaN-Mob uses caching on local network devices to help producers disseminate their data between neighboring nodes, to maintain in the device cache the data from the producers most likely to perform handoff . Thus, a greater diversity of contents in the network is obtained, which increases the chances of responding to the interests that arrive in the absence period of the producer. The main contributions of this work are: i) a cache replacement policy that can act with different data prioritization criteria; and ii) a packet forwarding strategy to control the effect of packet flooding on the network. The proposal was implemented in the ndnSIM simulator and was submitted to a detailed analysis, comparing it with the standard NDN, under different metrics. The results obtained show gains of up to 35.32% in the interest satisfaction rate, even with the producer's unavailability of the producer, and with a reduction of 56.23% in the time to obtain data.
13	MARCO ANTONIO RUIZ RUEDA A Tool for building multi-purpose and multi-pose synthetic data sets Leader : LUCIANO REBOUCAS DE OLIVEIRA MEMBRES DE LA BANQUE : LUCIANO REBOUCAS DE OLIVEIRA ANGELO AMANCIO DUARTE RODRIGO TRIPODI CALUMBY Data: 25 juil. 2018 Afficher le Résumé A tool to generate multi-purpose, synthetic data sets from multiple camera viewpoints and environmental conditions is proposed here. To generate any data set, three steps are required: locate 3D models in the scene, set the discretization parameters, and run the generator. The set of rendered images provide data that can be used for geometric computer vision problems, such as: Depth estimation, camera pose estimation, 3D box estimation, 3D reconstruction, camera calibration, and also pixel-perfect ground truth for scene understanding problems, such as: Semantic segmentation, instance segmentation, object detection, just to cite a few. In this paper, we also survey the most well-known synthetic data sets used in computer vision tasks, pointing out the importance to use rendered images. When compared to similar tools, our generator contains a wide set of features easy to extend, besides allowing for building sets of images in the well-known MSCOCO format, so ready for deep learning works. To the best of our knowledge, the proposed tool is the first one to generate large-scale, multi-pose, synthetic data sets automatically, allowing for training and evaluation of supervised methods for all of the covered features.
14	ALEX SILVA SANTOS Service Degradation with Proportional QoS in Elastic Optical Networks Leader : GUSTAVO BITTENCOURT FIGUEIREDO MEMBRES DE LA BANQUE : GUSTAVO BITTENCOURT FIGUEIREDO ANDRE COSTA DRUMMOND ANDRE CASTELO BRANCO SOARES Data: 27 juil. 2018 Afficher le Résumé Data collected in recent years show that the amount of information traveling in the network core has been increasing at an exponential rate, evidencing the inability of Wavelength-division Multiplexing networks to meet such demand. Elastic Optical Networks represent a new approach to dealing with high data traffic, offering bandwidth closer to that requested by users, providing greater use of available optical resources. In scenarios with high traffic volume, requests can be blocked due to the inability of the network to allocate new resources. One of the strategies used to avoid blocking new requests is service degradation, where certain requests may receive less than requested bandwidth so that they can operate without loss. In the current literature it is possible to perceive that most of the studies approach the degradation of service using a static scenario where all the requests are known in advance, resulting in the absence of online strategies that present greater conformity with real scenarios. This paper proposes an online strategy for service degradation using the proportional QoS model. This strategy aims to reduce the likelihood of blockage caused by lack of optical resources and offer better treatment for different classes of service. In addition, pairs of source-destination nodes were modeled as a queuing system operating on the Generalized Processor Sharing policy with Leaky Bucket admission control in order to quantify the impact of degradation on lightpaths. The results show that the proposed strategy can reduce the probability of blocking and provide network operators with greater control among different classes of degraded services.
15	IVE ANDRESSON DOS SANTOS TOURINHO Flexible Grouping for Points of Interest Recommendation Leader : TATIANE NOGUEIRA RIOS MEMBRES DE LA BANQUE : FREDERICO ARAUJO DURAO HELOISA DE ARRUDA CAMARGO TATIANE NOGUEIRA RIOS Data: 16 août 2018 Afficher le Résumé Given the current scenario of massive production of geo-referenced data produced by users of social networks based on location, this work aims to identify patterns of human mobility and model the phenomena related to human behavior in the geographic space. Based on the premise that the preference profile of a user can be characterized by their mobility behavior inferred through their check-ins history, two techniques of recommendation based on fuzzy grouping are proposed, in order to attest the usefulness of the geographical aspects identified in predictive tasks to recommend content
16	FELIPE GUSTAVO DE SOUZA GOMES A Computational Infrastructure for Technical Debt Identification and Monitoring Leader : MANOEL GOMES DE MENDONCA NETO MEMBRES DE LA BANQUE : GLAUCO DE FIGUEIREDO CARNEIRO IVAN DO CARMO MACHADO MANOEL GOMES DE MENDONCA NETO Data: 27 août 2018 Afficher le Résumé In software engineering, the concept of Technical Debt (DT) contextualizes the problem of technical tasks not performed in the present, which may incur costs and adversities in the future. Exposure to debt can bring short-term benefits, such as increasing current productivity and reducing the time to launch a product, but this benefit is obtained at the cost of extra effort in the future to resolve the debts incurred. In this context, it is necessary to develop mechanisms to manage technical debts. Identifying and monitoring DT is an important and necessary step in the debt management process. Unfortunately these tasks are complex because they require the analysis of large amounts of information. Different works have been carried out in the development of approaches to identify DT either automatically or semi-automatically. Most of these works are based on code metrics analysis. However, studies have already pointed out that the use of metrics only for the identification of DT items limits the scope of this activity. It is also necessary to consider the developer's point of view. In this direction, the combination of an approach that simultaneously uses metrics, static code analysis, comment analysis, and information visualization, has been shown to be promising for DT identification and monitoring. This work presents the development and evolution of two tools to materialize this approach. The first, called RepositoryMiner, mines software repositories, calculates metrics, detects code smells, analyzes comments and source code, and combines these different means as indicators of technical debts of software modules. The second, called VisminerTD, uses visual techniques to, based on the calculated indicators, support software engineers in the identification and monitoring of technical debts throughout the life cycle of a software project. To show the usability and effectiveness of the approach, two feasibility studies were conducted showing that the tools can support the management of DT in medium-sized software projects.
17	FELIPPE VIEIRA ZACARIAS Intelligent colocation of HPC workloads for enhancing server efficiency Leader : VINICIUS TAVARES PETRUCCI MEMBRES DE LA BANQUE : VINICIUS TAVARES PETRUCCI MARCOS ENNES BARRETO Rajiv Nishtala Data: 30 août 2018 Afficher le Résumé Resource efficiency is crucial for achieving Exascale performance in high performance computing (HPC) systems. Many HPC applications achieve only a fraction of their theoretical peak performance because it is very hard for developers and runtime systems to extract perfectly the best performance from parallel applications. From this perspective, colocating applications is an attractive technique to increase HPC system utilization. However, an application may adversely affect the behavior of another application when sharing critical resources, such as caches and memory bandwidth, in the same server. We argue that server efficiency can be improved by predicting the expected performance slowdown of colocated applications from measured hardware performance counters. This work makes the following contributions: (1) a machine learning model to predict the performance slowdown of co-located applications and (2) an intelligent scheduling scheme deployed in the Slurm manager to allow sharing applications in a server node with minimal degradation for improved performance.
18	BEATRIZ BRITO DO REGO The influence of metacommunication on permanence or abandonment in MOOC: an investigation under a semiotic perspective Leader : ECIVALDO DE SOUZA MATOS MEMBRES DE LA BANQUE : ECIVALDO DE SOUZA MATOS ARTUR HENRIQUE KRONBAUER ANDRÉIA LIBÓRIO SAMPAIO Data: 27 sept. 2018 Afficher le Résumé MOOC is an acronym for Massive Open Online Course, an online education modality oering video courses, with dierent ages, backgrounds and nationalities. The MOOCs have emerged as technological innovations that idealize a democratization of education, through a course of free courses. This modality of teaching has as characteristic a population of dierent ages, formations and nationalities. However, MOOCs are subject to a high rate of their students, corresponding to their main criticisms. You can participate at any time, a completion rate for most courses of 13%. Some of the conventions students attend are those related to the technology platform. In this sense, the research had to investigate the quality of semiotic metacommunication inuencing the permanence or the abandonment of education in MOOC courses. The research was carried out in four stages. The rst one consisted in the execution of a literature review, in order to identify the elements of the nature of the genetics that infuenced the permanence and the abandonment in MOOC. With the review vote, twenty-four general causes of abandonment in MOOC were found. The second stage was held a study of documentary events moments: development of a platform of MOOC; evaluation of the quality of the metacommunication of the interaction design of a course in this platform; course implementation; and, nally, to identify and evaluate the ruptures of communicability existing in the case studied. In the design-time evaluation, a MOOCLICAR technique was applied, which was mitigated. The data sheet with the data of the course of analysis and analysis of the course was not carried out. Finally, the Semiotic Inspection (SIM) and Communality Assessment (CEM) methods were used to identify communication disruptions in the MOOC platform. The third phase of the session had an analysis of the collected data, relating the evaluation of the interaction design, the questionnaires, the analysis of forums and the assessment of communicability. The results can be useful for the development of a set of interaction design and evaluation guidelines for the MOOC from a communication point of view.
19	JORGE FERNANDO SILVA PEREIRA FILHO Variability Based Approach to Generate and Disseminate Emergency Public Communications. Leader : MANOEL GOMES DE MENDONCA NETO MEMBRES DE LA BANQUE : MANOEL GOMES DE MENDONCA NETO CLAUDIO GUIMARAES CARDOSO KARINA BARRETO VILLELA Data: 28 sept. 2018 Afficher le Résumé An emergency is unpredictable by nature, but it presents patterns that can help communicators to anticipate problems and give better responses to emergency situations. Current solutions for emergency public communication focus on the dissemination of a single message through different communication channels. This strategy goes against good communication practices because different audiences have different information needs. Inappropriate public communication messages create noise and can amplify the perception of risk and insecurity. Our research proposes a novel approach for public communication in emergencies. It maps and models variability in emergency communication processes to ensure the flexibility and adaptability of the communication for different emergency scenarios. Our approach aims to facilitate the rapid instantiation of customised and consistent emergency communications, over different channels, for different audiences. Our research was conducted as part of a large research project named RESCUER, which uses crowdsourcing information to manage emergency situations. To evaluate the communication model, a proof of concept, named RESCUER News, was built and evaluated in simulations involving real public and operational forces, over three distinct emergency scenarios.
20	LUCAS AMPARO BARBOSA Removing facial expressions in 3D images for recognition purposes Leader : MAURICIO PAMPLONA SEGUNDO MEMBRES DE LA BANQUE : ALEXANDRE DA COSTA E SILVA FRANCO LUCIANO REBOUCAS DE OLIVEIRA MAURICIO PAMPLONA SEGUNDO Data: 17 oct. 2018 Afficher le Résumé The research carried out has a neural coder model encoder-decoder of images removed by facial functions in 3D images. This model receives a 3D image of the face with or without the keys as input and generates a neutral face as an output. The real is not a device that can be developed but rather perfect an accuracy of 3D facial recognition systems. For ready to, the standard of an error based on a system information in the user? Experiments using a Bosporus 3D database had a difference in range between images of the same and different modes of interaction between intraclass and interclass differences. They were also shown in our synthetically generated neutral surveys and the results of our recognition methods, thus reaching the original.
21	MARCO ANTONIO PARANHOS SILVA Variability Implementation: An Empirical Study of Stack Overflow Posts Leader : EDUARDO SANTANA DE ALMEIDA MEMBRES DE LA BANQUE : RODRIGO ROCHA GOMES E SOUZA EDUARDO SANTANA DE ALMEIDA RODRIGO OLIVEIRA SPINOLA Data: 12 nov. 2018 Afficher le Résumé In this dissertation, an empirical investigation was carried out aiming at understanding how the theme variability implementation is discussed by the community of software developers who use the Stack Overflow website. Initially, a list of 52 words related to this theme was defined based on the literature and the opinion of experts. They were then used as the initial vocabulary for the algorithm that implements the Latent Dirichlet Allocation (LDA) in order to identify the main topics related to variability implementation, as well as to validate the words in the list. Then, with the most used words in the questions and answers, as validated by LDA, searches were performed on the dataset retrieved from the Stack Overflow platform and several studies were carried out to find the best set to be analyzed. The best result was obtained by combining these words two by two, and thus 1962 questions were retrieved. These 1962 questions were then analyzed with the objective of discover the mechanisms of variability implementation more discussed, less discussed and not discussed; what is the delay in responding to the formal questions on variability implementation and how long the discussions last; and finally, what is the profile of Stack Overflow users who asked and answered the questions retrieved, as well as the degree of confidence one can have about the quality of the questions and answers presented. One of the main results of this research was the definition of a list of words that refer to variability implementation concepts and techniques that are most commonly used by the Stack Overflow community. Other one are the research results from analyzing the questions and answers selected, such as: two groups of words were identified as those most used by the Stack Overflow community: one more conceptual and one more related to variability implementation techniques; the average delay time for the first answer to the questions and the duration of the discussion when there is more than one answer; and finally an study on the degree of confidence in the Stack Overflow user group who asked questions or answered questions based on the reputation indicator of community members.
22	LEANDRO MANUEL PEREIRA SANCHES Semantic Annotation of Learning Objects in Portuguese Leader : LAIS DO NASCIMENTO SALVADOR MEMBRES DE LA BANQUE : CLARISSA CASTELLÃ XAVIER DANIELA BARREIRO CLARO MARLO VIEIRA DOS SANTOS E SOUZA Data: 13 nov. 2018 Afficher le Résumé In Brazil, e-learning is rapidly expanding and Learning Objectives (OA) are one of the forms of development for the teaching modality. One of the main advantages of OA is the possibility of reusing digital didactic content on the internet. In the case of re-use, OA must be available and accessible to teachers and students. The availability of an OA can be influenced by factors such as: difficulty in adding metadata to OA; lack of standardization and structuring of metadata; lack of semantic description of objects; the difficulties of adding semantic metadata to OA and a lack of machine understanding of OA content. Semantic Web technologies aim to structure the data present on the Web in a machine-readable way. In the educational context, these technologies allow to annotate semantic metadata, based on ontologies, in Auxiliary Learning Objects in their recovery and reuse. However, a manual annotation of semantic metadata is the author by authors as an expensive, mentally demanding and time consuming task. Since this is a document that seeks to discover and update the information of a manual, this study believes that it is an automated option and a semantic metadata annotation are important themes for a web-based education. Thus, this work proposes a new model for extracting and annotating semi-automatic semantic metadata from Learning Objects in Portuguese. This model uses the metadata Metadata for Learning Objects (LOM) and technologies for the Semantic Web as a language and a resource structure (RDF). The present study was different from other studies that carried out a semantic annotation of OA in other languages and its main differential is an application of a system of information extraction and disambiguation that aimed at the extraction of semantic relations rather than the identification of only entities. This allows us to note both the relationship and the relationships between the entities in the text. For the evaluation of this model, one of the main and most recent methods of disambiguation of performance descriptors was evaluated and evaluated. The annotation prototype was able to extract the textual OA semantics with a maximum accuracy of 37.5%. The annotation prototype was also able to convert the extra relations into semantic LOM metadata in the RDF language.
23	EDILTON LIMA DOS SANTOS SHE: AN AUTOMATED SUPPORT FOR DYNAMIC SOFTWARE PRODUCT LINES ENGINEERING Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : IVAN DO CARMO MACHADO CASSIO VINICIUS SERAFIM PRAZERES EDSON ALVES DE OLIVEIRA JUNIOR Data: 26 nov. 2018 Afficher le Résumé Dynamic Software Product Lines (DSPL) engineering makes it possible to deliver software products capable of adapting to fluctuations in user needs and evolving resource constraints at runtime. Based on an architecture-centric approach, a DSPL is capable to analyze changes in context, thus responding by performing system adaptations. In this work, we propose an architecture model for DSPL engineering based on the MAPE-K model concept, a feasible strategy to gather and analyze contextual information thus enabling product reconfiguration at runtime. We evaluated the proposed architecture model in a Smart Home System, using OSGi and MQTT. A proof of concept of the proposed model was performed and the results demonstrated capability the DSPL to adapt and reconfigure itself according to the new sensors connected to the system. In addition, the framework built for proof of concept was evaluated by industry professionals and students of masters and doctorate. They consider the architecture to be highly modular and which has a high capacity for reuse and adaptability.
24	ANNA LUÍZA DE OLIVEIRA BARBOSA Domain Specific Languages built from Software Product Lines: An Empirical Study on Visual and Textual Approaches Leader : EDUARDO SANTANA DE ALMEIDA MEMBRES DE LA BANQUE : EDUARDO SANTANA DE ALMEIDA RODRIGO ROCHA GOMES E SOUZA ANDRÉ LUIS MEDEIROS SANTOS Data: 26 nov. 2018 Afficher le Résumé Generic or specific solutions can be applied in all branches of science and engineering. A generic approach provides a general solution for many problems in a certain area, but this solution may be suboptimal. A specific approach provides a much better solution for a smaller set of problems. Domain Specific Languages (DSL) are means, not goals, targeted at the final purpose of simplifying the software development process. They can be classified as internal, written in a host language or external, separate languages which bring their own syntax. An external DSL needs a language, compiler or interpreter, and an editor to support the new language. An DSL can be displayed in a visual and/or textual notation. This study proposes to implement a textual and a visual DSL for the Conference Management (CM) and the Smart Home (SH) Domains, using legacy product lines and integrate the DSL development with Software Product Lines Engineering (SPLE). Additionally, we conducted an exploratory study with the system’s original developers aiming at validating our DSL and characterizing barriers and benefits to software development using DSL as well as an empirical study conducted to compare the textual and the visual DSL regarding readability, effectiveness, efficiency and ease of use. The hypothesis test demonstrated better ease of use and efficiency when reporting bugs using the visual notation while the other metrics did not present statistical difference. However, the data analysis using descriptive statistics showed evidence that the visual notation had better performance when reporting bugs, while the textual notation is faster when changing DSL programs. Regarding the participants’ sense of success in completing the activities, the report of bugs using the textual notation had a better outcome than they expected.
25	DENIVAN DO CARMO CAMPOS DA SILVA Combinatorial Interaction Testing Tools for Software Product Lines Engineering Leader : IVAN DO CARMO MACHADO MEMBRES DE LA BANQUE : RODRIGO ROCHA GOMES E SOUZA IVAN DO CARMO MACHADO WESLEY KLEWERTON GUEZ Data: 28 nov. 2018 Afficher le Résumé Testing a system is a routine activity and plays an important role in the software quality assurance process. However, testing highly-configurable systems, such as Software Product Lines (SPL), is a very complex activity due to the presence of variability in its engineering process, which increases the number of product configurations to test. In case a defect affects one (or a subset) of these functionalities, a range of products (and not just one, such as traditional Software Engineering) may be affected. Such complexity also implies a significant increase in the cost of testing. In this sense, one of the major challenges is to reduce this cost and the use of test automation tools could bring a significant contribution to achieve such a reduction. Among the most effective techniques, combinatorial interaction testing (CIT) stands out, as an affordable strategy seeking to optimize the configuration space to test. This study aimed to analyze the effectiveness of the tools that implement the CIT technique considering a set of metrics applied to the context of variability testing. This dissertation introduces the MERCI, a Method to Evaluate Combinatory Interaction Testing techniques. MERCI is aimed to evaluate the suitability of existing CIT tools, largely employed in single-system testing, for SPL engineering. In this work, we considered three measures: defect detection, test coverage and test execution length. We performed an empirical evaluation to compare four CIT tools: ACTS, CATS, PICTMaster and VPTag. The results show that the method can serve as a good indicator to evaluate how CIT tools could behave in an SPL testing scenario.
26	LEANDRO SOUZA DE OLIVEIRA DptOIE: a method for Open Information Extraction in the Portuguese language based on Dependency Analysis Leader : DANIELA BARREIRO CLARO MEMBRES DE LA BANQUE : DANIELA BARREIRO CLARO MARLO VIEIRA DOS SANTOS E SOUZA RENATA VIEIRA Data: 7 déc. 2018 Afficher le Résumé It is estimated that 80% of the information on the Web is stored and are textual that can add value to various businesses. However, most of the index is heterogeneous. In this context, all the information that comes manually daily is a slow and unfeasible task. For this reason, as research in the area of Information Extraction (EI) has advanced in recent years. The EI has a function of generating the data in a process by means of Relation Extraction (ER) techniques. A relationship can connect a couple of entities. The normal ones are the words related to the syntax and can be of the type: "born in", "is a", "ran", and so on. EI's arguments focus on small, homogeneous, and early commands. However, this type of approach is not scalable for larger databases, that is, new rules for hiring new ways to become incremental documents or odomains to change. Thus, a new EI approach emerged, an Open Information Extraction (EIA). It allows you to extract facts, usually in detriplas form, without the need to pre-specify relationships, being more scalable and domain independent. The EIA can be applied in the area of business intelligence, opinion mining, question and answer systems, among others. Initially, most EIA methods were developed for the English language. However, other languages, such as Portuguese, also deserve prominence, since it comprises approximately 2.5% of all content available on websites. The EIA systems that have presented one of the best performances in English have used dependency analysis and are based on rules, which are used to identify a set of information in the sentences to compose the facts, such as: subjects, objects, adverbs, complements , etc. However, in the Portuguese language, the same approach has not shown the same effect, since the established rules are often generic and do not cover specific aspects of the language. For this reason, in this work the DptOIE is proposed, which uses new rules and aims to explore sentences through dependency analysis and an in-depth search. In addition, modules have also been developed to perform particular treatments and derive new facts from sentences that have coordinated conjunctions, subordinate clauses and wording. The DptOIE has been compared to three other EIA methods that have stood out in the Portuguese language, which are: PragmaticOIE, DependentIE and ArgOE. The results confirmed that the DptOIE obtained area under the curve of the greater precision-yield and was superior in the quantity of coherent facts extracted, besides maintaining high precision.
27	NILSON RODRIGUES SOUSA M2-FoT: Monitoring and Managing Mist Platforms of Things Leader : CASSIO VINICIUS SERAFIM PRAZERES MEMBRES DE LA BANQUE : CASSIO VINICIUS SERAFIM PRAZERES IVAN DO CARMO MACHADO CELSO ALBERTO SAIBEL SANTOS Data: 10 déc. 2018 Afficher le Résumé The Internet of Things (IoT) is a paradigm in which physical objects embedded with sensors and actuators are connected to the Internet, enabling the creation of solutions for various areas of human life. As the amount of things connected to the Internet tends to increase more and more, new ways of managing and monitoring these things are needed. In this context, this work aims to propose and develop a method of management and monitoring of the elements that make up an IoT environment. The proposed method covers activities of monitoring and managing the configuration and states of the components of the IoT environment. In order to achieve this objective, this work uses the ARM-IoT (Architectural Reference Model for the IoT) reference model, which enables the listing of common elements that are part of an IoT environment and require management and monitoring. The approach developed is based on the Simple Network Management Protocol (SNMP), which uses a manager / agent model, where a manager concentrates the most complex management actions and the agent is responsible for collecting information about the infrastructure. Thus, this master's work proposes M2-FoT: a method, based on ARM-IoT and SNMP, that uses a management approach in which a server (acting as an SNMP manager) concentrates the most robust actions and delegates actions to the infrastructure through agents that are hosted on IoT gateways (acting as SNMP agents). The proposed method was implemented and validated using the SOFT-IoT (Self-Organizing Fot of Things for the IoT) platform, which implements several of the concepts described in ARM-IoT. Given this context, this work presents the M2-FoT that is composed by IoTManagerServer and IoTManagerClient, similar to the manager model and SNMP protocol agent. This allows the management and monitoring of IoT environments. The implementation of M2-FoT was tested within the SOFT-IoT platform and the evaluated aspects were: hardware consumption by the devices; response efficiency of M2-FoT in the recovery of the infrastructure configuration; and the accuracy of the information stored by M2-FoT.
28	MARCELO AIRES VIEIRA Interoperability between heterogeneous clouds across clouds Leader : DANIELA BARREIRO CLARO MEMBRES DE LA BANQUE : DANIELA BARREIRO CLARO VANINHA VIEIRA DOS SANTOS RONALDO DOS SANTOS MELLO Data: 11 déc. 2018 Afficher le Résumé With the media database, many organizations have begun providing and delivering data through services such as Data as a Service (DaaS) and Database as a Service (DBaaS). Different data sources for certain applications, such as Relational, NoSQL and NewSQL, or not yet structured (plain text), semi-structured (XML, JSON, CSV) and structured (RDBMS). Inherent heterogeneity of DaaS and DBaaS makes it difficult to merge services and unified use by an application. Thus, the present work is a method that aims to interoperate the heterogeneous DaAS and DBaaS data through the data junctions. The method called MiDIJo (Middleware for Data Interoperability via Join) allows a unified model to interoperate between different models. MiDIJo recognizes the query, the required join type analysis, and lists the join attributes. MiDIJo was formally discontinued through canonical models and incorporated into MIDAS middleware in order to validate its performance and effect. A use case and three experiments were performed with a MiDIJo validation purpose: (i) the first experiment selects and obtains data in different DaaS and DBaaS; (ii) the second associates data between different DaaS and DBaaS (junction); Finally, (iii) the third considers different types of DBaaS (Relational, NoSQL and NewSQL) with two joins. The use case and the experiments allowed the viability, functionality, interoperability and performance of the MiDIJo method. The results are demonstrated through a methodology that allows selecting and aggregating data in the clouds for the complexity of the interoperability and junction problem.
29	FABIO CORREIA DE REZENDE The Computational Reasoning in the development of English Language Skills: pedagogical practices with BYOD Leader : ECIVALDO DE SOUZA MATOS MEMBRES DE LA BANQUE : DEBORA ABDALLA SANTOS ECIVALDO DE SOUZA MATOS TACIANA PONTUAL DA ROCHA FALCAO Data: 13 déc. 2018 Afficher le Résumé With the aim of contributing to the discussions and advances in the researches on the Brazilian public education, this research had as object of study the computational reasoning. The research sought to understand and discuss whether computational reasoning favors the development of disciplinary competences in English language teaching through pedagogical actions through the Bring Your Own Device (BYOD) approach.
30	CARLOS ALEXANDRE NASCIMENTO Integration between Computer Science and Portuguese Language through Computational Thinking Skills Leader : DEBORA ABDALLA SANTOS MEMBRES DE LA BANQUE : DEBORA ABDALLA SANTOS ECIVALDO DE SOUZA MATOS Ismar Frango Silveira Data: 13 déc. 2018 Afficher le Résumé Discussions about the insertion of Computer Science in basic education and how it can contribute in the processes of teaching learning that involve the use of digital information and communication technologies are in increasing ascension in researches in the area of Informatics and Education. The present work aimed to investigate the possible contributions of Computational Thinking to the teaching and learning of Portuguese Language in Basic Education. For this, interventions were made in the teaching of contents of Portuguese Language through didactic sequences that contemplated the association of the discipline to Computer Science. The data collected, through focus groups, interviews and journals were analyzed qualitatively based on the literature. The results showed that the integration between the Computational Thinking skills and the contents explored during the Portuguese Language classes is feasible and that this integration contributes to the teaching and learning of this discipline, as well as to promote Computer Science in Basic Education.
31	PAULO ROBERTO MAIA SIMÕES JUNIOR USING AGGREGATION AND SUMMARIZATION PROCESSES TO ENHANCE CROWDSOURCE BASED EMERGENCY MANAGEMENT SYSTEMS Leader : MANOEL GOMES DE MENDONCA NETO MEMBRES DE LA BANQUE : MANOEL GOMES DE MENDONCA NETO DANILO BARBOSA COIMBRA FLAVIO EDUARDO AOKI HORITA Data: 14 déc. 2018 Afficher le Résumé Emergencies may bring great loss for human beings and for the environment. Whether they are results of human actions or forces of nature, these situations ask for a swift response to minimize their adverse consequences. People that witness an emergency are a valuable source of information to those responsible for making decision on how to respond to it. Through crowdsourcing, command centers can obtain multiple types of information from witness at the emergency site. By using smartphones, for example, these people can send information directly (e.g. texts, videos, images) or indirectly (e.g. GPS), that may help in the identification of important situations at the emergency site. However, the gathered data must be processed and summarized, as the emergency managers may not be able to analyze all the obtained information, which gives space to loss of important information. Researchers have been studying and proposing information visualization tools that may help the emergency managers to understand the high volume of information gathered through crowdsourcing. Some of those tools had been used, and proven useful, in real life large scale emergencies. However, a recent systematic mapping study shows that most researchers use similar techniques, focused on maps and icons, to visualize emergency information in a high level (e.g. affected area, type of emergency). Emergency managers usually have to filter and search the available data to obtain more information about the situation at hand. This work presents a way to aggregate emergency data, gathered through crowdsourcing, and to summarize it using information visualization focused on presentation of emergency details. The presented visualizations, should support the emergency managers decision making process, through visual paradigms that facilitate a swift understanding of the situation at the emergency site. This research was within the context of a collaboration project between Brazil and European Union and had the support of emergency management specialists from both places.
32	FELIPE OLIVEIRA DOS SANTOS A multiagent epistemic logic that integrates classical knowledge and constructive knowledge. Leader : STEFFEN LEWITZKA MEMBRES DE LA BANQUE : CIRO RUSSO LAIS DO NASCIMENTO SALVADOR STEFFEN LEWITZKA Data: 14 déc. 2018 Afficher le Résumé EpistEpistemic Logic is a branch of logic that deals with notions of knowledge and belief. Through its study, it is possible to formalize and make inferences about the knowledge of one or more agents. Due to the recent applications in Computer Science, the epistemic logic has been gaining new contributions and with them new problems and challenges arise. In this work we will present a new epistemic multiagent logic based on the EL5 logic originally introduced by Lewitzka (2017) as a modal system capable of representing both properties of classical knowledge as well as characteristics of an intuitionist (constructive) knowledge that conforms to the BHK interpretation of intuitionist logic. This connection with constructive reasoning is interesting from the point of view of computer science. The logic presented in this paper generalizes and extends EL5 to a multiagent logic with operators for common knowledge of agent groups. A certain concept of common knowledge is adopted and modeled by an algebraic semantics (based on Heyting algebras) rather than the commonly used Kripke semantics, thus giving a natural and intuitive approach. Finally, we prove the correctness and completeness of this new multiagent logic with respect to the proposed algebraic semantics.emic Logic is a branch of logic that deals with notions of knowledge and belief. Through its study, it is possible to formalize and make inferences about the knowledge of one or more agents. Due to the recent applications in Computer Science, the epistemic logic has been gaining new contributions and with them new problems and challenges arise. In this work we will present a new epistemic multiagent logic based on the EL5 logic originally introduced by Lewitzka (2017) as a modal system capable of representing both properties of classical knowledge as well as characteristics of an intuitionist (constructive) knowledge that conforms to the BHK interpretation of intuitionist logic. This connection with constructive reasoning is interesting from the point of view of computer science. The logic presented in this paper generalizes and extends EL5 to a multiagent logic with operators for common knowledge of agent groups. A certain concept of common knowledge is adopted and modeled by an algebraic semantics (based on Heyting algebras) rather than the commonly used Kripke semantics, thus giving a natural and intuitive approach. Finally, we prove the correctness and completeness of this new multiagent logic with respect to the proposed algebraic semantics.
33	VANA HILMA VELOSO CARVALHO ANALYSIS OF THE ASPECTS OF ACCEPTANCE AND USE OF THE INSTITUTIONAL REPOSITORY OF THE FEDERAL UNIVERSITY OF BAHIA (RI-UFBA) BASED ON THE UTAUT MODEL. Leader : DEBORA ABDALLA SANTOS MEMBRES DE LA BANQUE : DEBORA ABDALLA SANTOS LAIS DO NASCIMENTO SALVADOR FLAVIA GOULART MOTA GARCIA ROSA Data: 18 déc. 2018 Afficher le Résumé Institutional Repositories are increasing references for scientific communication with the purpose of storing, disseminating and preserving the academic and scientific production of an institution. These web information systems are based on the movements of open technologies, for demanding that the files deposited in the repository are provided in open access and free access licenses, with the purpose of disseminating their content to society in a collaborative and democratic manner. This research brings the analysis of the factors that influence the acceptance and use by the academic and scientific community of UFBA to its institutional repository. The Unified Theory of Acceptance and Use of Technology (UTAUT) information model was used to collect data, as well as the Think Aloud method, a technique related to the usability of the system, to observe the user behavior in the use of IR -UFBA. In the analysis of the results obtained through the UTAUT model, the Generalized Linear Model (MLG) was used, statistical technique of data analysis. The research response is presented through the user-identified comments in the Think Aloud method, and especially in the statistical indicators regarding the acceptance and use of the technology that allow to offer subsidies to improve the studied system.
34	DANIEL LUIS MOREIRA TIMPONI Characterization of the development of multiplatform game libraries Leader : RODRIGO ROCHA GOMES E SOUZA MEMBRES DE LA BANQUE : IVAN DO CARMO MACHADO RODRIGO ROCHA GOMES E SOUZA SANDRO SANTOS ANDRADE Data: 18 déc. 2018 Afficher le Résumé "As new operating systems and hardware devices emerge and become popular, game developers are looking for libraries to help them build multiplatform games for the purpose of streamlining the development process and getting a product compatible with the most popular platforms In order to characterize the development of multiplatform game libraries, we have analyzed the log of commits of five libraries of free games written in C / C ++ to extract information about changes in the source code and the contribution of the development team We found that developers rarely change platform-independent and platform-specific code together in the same commit The vast majority of changes occur on a timely basis in the standalone code or in the specific code of each platform. In addition, we found a growth in the ratio of generalist developers in 3 of 5 of the analyzed libraries. Our findings can help developers and project managers improve the development and maintenance of cross-platform systems by providing empirical evidence on source code changes and developer knowledge of platforms over time. "
35	PEDRO OLIVEIRA RAIMUNDO Low-cost 3D Reconstruction of Cultural Heritage Leader : KARL PHILIPS APAZA AGUERO MEMBRES DE LA BANQUE : KARL PHILIPS APAZA AGUERO ANTONIO LOPES APOLINARIO JUNIOR ANTONIO CARLOS DOS SANTOS SOUZA Data: 19 déc. 2018 Afficher le Résumé The 3D reconstruction process follows a well-defined pipeline to generate a model that represents the geometry and appearance of an object captured using 3D scanners or other acquisition devices. The main steps of the commonplace 3D reconstruction pipeline are: a) acquisition of depth and color images of the object, b) alignment of the views acquired from different angles, c) integration of the acquired information into a single model, and d) synthesis and visualization of a textured 3D model. The models generated by following these steps are used in many areas where it is necessary to reconstruct real objects with nontrivial geometry such as engineering, architecture and museology. The present work focuses on the usage of low-cost 3D reconstruction to preserve cultural heritage artifacts, this is important not only for preservation purposes, but also as a means to study and transmit the values and traditions of the communities. In this context, we surveyed the literature for limitations of low-cost 3D reconstruction pipelines, and subsequently proposed pipeline specializations to leverage the characteristics of low-cost 3D acquisition devices and circumvent their limitations. The resulting reconstruction pipeline improved the quality of the data acquired using low-cost 3D scanners, streamlined some of the remaining following phases of the reconstruction process, and employed realism techniques to improve the appearance of the reconstructions. Several heritage artifacts from the Federal University of Bahia Museum of Archaeology and Ethnology were reconstructed using proposed approach, which yielded good results in terms of visual appearance. Thus, we conclude that low-cost 3D preservation of heritage is viable given a proper combination of pipeline specializations and realism techniques.

Thèses

NAILTON VIEIRA DE ANDRADE JUNIOR

A Platform for Discovery, Deployment and Self-Configuring Services on the Web of Things

Leader : CASSIO VINICIUS SERAFIM PRAZERES
MEMBRES DE LA BANQUE :
CASSIO VINICIUS SERAFIM PRAZERES
FREDERICO ARAUJO DURAO
ARTUR HENRIQUE KRONBAUER
Data: 12 janv. 2018

Afficher le Résumé
The evolution of embedded computing has enabled the emergence of devices that connect to the Internet, fostering the Internet of Things. In the same direction, the Web of Things proposes to make these devices available as resources directed to the development of applications, using the protocols and the Web standards. The variety of devices that can connect to the Web of Things demands both the effort of implementation of specific services for access to each device, as methods that allow its use by humans and machines. In this context, this work proposes the automatic configuration and deployment of devices that appear as resources in the Web of Things, through models that map their functionalities. Existing solutions expose RESTFul Web Services built into physical devices, making the strategy non-agnostic over the application protocol and compromising service maintainability. The solution proposed in this work relates the existing Web standards and technologies for the development of a free and integrated platform, with the following functionalities: the dynamic discovery of the devices when connecting to a local network, using discovery protocol of network service; the automatic generation of RESTFul Web Services for deployment of devices in the Web of Things; and an approach to practical implementation of semantic description in physical devices, in order to provide the publication of semantically enriched devices in the context of the Web of Things. Finally, network-level assessments are carried out, from application service of the developed platform.

GEORGE CAIQUE GOUVEIA BARBOSA

Using Generic Linguistic Features for Classification of Relational Triples in Portuguese, English and Spanish

Leader : DANIELA BARREIRO CLARO
MEMBRES DE LA BANQUE :
DANIELA BARREIRO CLARO
MARLO VIEIRA DOS SANTOS E SOUZA
VLÁDIA CÉLIA MONTEIRO PINHEIRO
Data: 6 févr. 2018

Afficher le Résumé
The increase in the amount of text generated daily on the web has made it more difficult the analysis and extra information of these data. In addition, the number of forms with that people can express themselves using natural language increases the complexity the development of computational tools to extract useful information from texts. Extra Information (EI) has the role of automating the transformation of corpus textual information in structured information. Extra informations of texts are structured in different ways. The relational triple of the type (subject-predicate-object) and structure most used in IE. The EI task can be classified as traditional and open. THE The traditional one makes extra cations from a limited set of predicates. Haha In addition to Open Information (EIA), the relationships are not limited by a set of therefore, one of the objectives of this task and the independence of the domain. The EIA can be divided into two stages: (i) extra and (ii) classi cation. M etodos of EIA present in the literature use spe- cific language functions of the target language in their stages of classi - cation. Language resources for English are more the other languages, thereby increasing the OIE's languages. Most of these methods are based on English. Therefore, the work, and in the classi cation stage, use a set of multi-language features to classi cation of relational triples, contributing to language independence in EIA. Experiments were performed to compare the multi-language features with the proposals by ReVerb. The results obtained showed that, at the 5% level of significance, the multi-language features presented superior performance for English (p <0.01). After the to translate the part of the ReVerb features, both sets achieved similar in Spanish (p = 0.76) and Portuguese (p = 0.07)..

MADSON RODRIGUES DE ARAUJO
Predictive-based DBA algorithms to enable LR-PONs to support fronthaul traffic in C-RAN deployments
Leader : GUSTAVO BITTENCOURT FIGUEIREDO
MEMBRES DE LA BANQUE :
GUSTAVO BITTENCOURT FIGUEIREDO
DIVANILSON RODRIGO DE SOUZA CAMPELO
ANDRE COSTA DRUMMOND
Data: 20 avr. 2018

Afficher le Résumé
Mobile devices data traffic has increased continuously in the last years and the growth prediction is about 50% per year until 2021. This growth in data demand has created many problems of cost and revenue to network operators to maintain and enlarge these networks. Literature has approached the Cloud Radio Access Networks (C-RAN) as an architecture of great potential to network operators being able to offer high data rates while reducing capital expenditure (CAPEX) and operational expenditures (OPEX). C- RAN main idea is to centralize the baseband processing of radio base stations in the cloud. This way C-RAN can generate CAPEX and OPEX economies, by deploying simpler base stations, making devices upgrade easier because most of the equipments are in one central place, generating better traffic balancing and reducing idleness costs by turning equipment on/off. However, its necessary to forward the base station traffic, which will be processed in the cloud, by a fronthaul network of low latency and jitter and high capacity. These requirements are due to the restrict performance requirements of CPRI and LTE-A protocols. This work focuses on the challenges related to the fronthaul deployment in a C-RAN based LTE-A architecture, more specifically in utilize a long reach network as fronthaul in this kind of C-RAN. We propose the Long Reach Passive Optical Networks (LR-PON) as the solution to fronthaul because of its capacity to offer high data rates (40 Gbps) and a lower CAPEX and OPEX compared to other optical network technologies. Furthermore, we introduce two prediction-based dynamic bandwidth allocation (DBA), namely Predictive-DBA (PD-DBA) and Mobile Predictive-DBA (MPD-DBA), which are able to achieve lower delay compared to their similar algorithms in literature. Moreover, The MPD-DBA, provides average delays under the fronthaul delay limit in 100 km reach and CPRI 1 up to 4 scenarios.

ALAN DOS SANTOS SOARES
The analysis of classifiers for 3D dynamic hand gesture recognition based on geometric descriptors
Leader : ANTONIO LOPES APOLINARIO JUNIOR
MEMBRES DE LA BANQUE :
ANTONIO LOPES APOLINARIO JUNIOR
KARL PHILIPS APAZA AGUERO
THALES MIRANDA DE ALMEIDA VIEIRA
Data: 20 avr. 2018

Afficher le Résumé
Many approaches have emerged in the last few years to address the challenges of dynamic
hand gesture recognition systems. These systems use mostly computer vision techniques
based on RGB images, subject to limitations such as variation of light and color. The
emergence of RGB-D sensors allowed to incorporate geometric information into the image,
making recognition systems more robust. However, there are still dierent challenges
related to the problem of the classication of gestures as invariance to physical aspects
and location, choice of characteristics and robust classiers that allow classication in
real time. This work proposes an analysis of classiers for dynamic 3D hand gestures
applied to geometric descriptors that represent the 3D curve generated by the hand of
the user over time. Two approaches were analyzed, Dynamic Time Warping (DTW) and
Decision Tree (DT). Two datasets were used to evaluate the results: MSR Action3D and
GRUFBA 3D, which was built by the author. The classiers were evaluated to identify
the importance of the curve tting steps, and the extraction and selection of the curve
characteristics, analyzing their impact on the gesture recognition rate. The obtained
results show that this approach is competitive with the other techniques of the state of
the art for similar approaches, being invariant to physical aspects and lighting with an
average rate of 97:2% and 97:6% with the textit DTW and textit DT, respectively, in
one of the analyzed datasets.

LUIZ OTÁVIO RAMOS GAVAZA
Study on motivation and learning of computer theory students in an approach approached in problems.
Leader : LAIS DO NASCIMENTO SALVADOR
MEMBRES DE LA BANQUE :
LAIS DO NASCIMENTO SALVADOR
MARLO VIEIRA DOS SANTOS E SOUZA
ROBERTO ALMEIDA BITTENCOURT
Data: 25 avr. 2018

Afficher le Résumé
The discipline of Computer Theory is one of the bases of the area of computing, for this reason, of fundamental importance in the formation of the students of the area. Its content has a high level of abstraction and requires a great effort from students and educators in the teaching and learning process. Students' perception that they are not understanding the concepts and their lack of motivation are the main challenges in this process. In turn, the use of active learning methodologies such as the Problem Based Learning (PBL) approach has been presented as an alternative to reduce problems related to student motivation and learning. In this sense, the purpose of this dissertation is to describe the implementation of PBL in Computer Theory classes in the Information Systems course at UFBA. To evaluate the results and validate the hypotheses, this work conducted a descriptive survey in the form of an opinion survey, where students were asked to answer questionnaires with profile characterization, Likert scale and open space statements related to the adopted approach. The results of this study suggest that the use of PBL is promising as a methodological alternative to theoretical disciplines of Computing.

JOÃO PEDRO DANTAS BITTENCURT DE QUEIROZ
MODELING SERVICES FROM BUSINESS PROCESSES: AN APPROACH TO MODELS.
Leader : RITA SUZANA PITANGUEIRA MACIEL
MEMBRES DE LA BANQUE :
DANIELA BARREIRO CLARO
RITA SUZANA PITANGUEIRA MACIEL
LEONARDO GUERREIRO AZEVEDO
Data: 15 mai 2018

Afficher le Résumé
Organizations are increasingly aware of the importance of mapping their business processes. Business process modeling helps to identify and understand the activities performed by participants of an organization, as well as providing support for specifying system requirements and defining services that support them. In this sense, it is important to define methodologies that facilitate the business process models reuse to identify relevant services for the development of service-oriented applications. Several proposals in the literature follow in this direction, however, there is still no consensus on which it is the best way to use these models to identify the services, because the transference of information between models of different levels of abstraction is an arduous task. One of the problems found in these proposals is that the derivation occurs straight from business level to service level, without considering, for instance, system requirements that support them. Therefore, it generates basic services that requires more information to reach an architectural level that can properly generate code. To find a solution to this problem, this dissertation proposes DERIVA (Service Derivation on Business Process), which is a process based on Model-Driven Development approach, and considers not only business processes, but also system requirements that support them to achieve service derivation. The DERIVA process performs service derivation through a semiautomatic transformations’ chain, where process models are transformed into less abstract models until code generation is accomplished. The process transformation’s chain is heuristic based, which served as source for the mapping. This process was evaluated through a case study and a controlled experiment by developers of different knowledge in Service Oriented Architecture and Model-Driven Development. The results of this study demonstrated that these professionals, from a process model and using DERIVA process, were able to execute the entire process, produce a service architecture and its Java’s code, with parameters and data types, highlighting the proposal viability and efficiency. Service derivation from the business process can be facilitated through model-driven development. In addition, the inclusion of system requirements modeling, as part of the derivation process, results in a more detailed service architecture.

FILIPE ADEODATO GARRIDO
-
Leader : ECIVALDO DE SOUZA MATOS
MEMBRES DE LA BANQUE :
BRUNO SANTANA DA SILVA
ECIVALDO DE SOUZA MATOS
RITA SUZANA PITANGUEIRA MACIEL
Data: 24 juin 2018

Afficher le Résumé
-

ASSIS SANTOS VIEIRA
CONCEPT LOCATION BASED ON MICRODOMINES
Leader : CLAUDIO NOGUEIRA SANT ANNA
MEMBRES DE LA BANQUE :
CHRISTINA VON FLACH GARCIA CHAVEZ
CLAUDIO NOGUEIRA SANT ANNA
ROBERTO ALMEIDA BITTENCOURT
Data: 4 juil. 2018

Afficher le Résumé
Concept Location is the step of software modification where the developer
search for the chunks of code that must be modified. In industrial environments, this
step consumes most of the time and effort. In essence, the concept location is a problem
of software comprehension. In reading the source code, cognitive models point out that
the developers refine their knowledge by elaborating and verifying hypotheses from the
tips suggested by the identifiers. Problem - In the literature, Dependence Search (DepS)
is one of the most recognized techniques for concept location. After selecting an initial
code snippet called seed, DepS suggests inspecting static dependencies, providing only
the immediate dependencies of the seed. However, we did a study of the dependency
graph of five softwares and found that in some cases the nearest dependencies use terms
similar or equal to the terms of the seed. For those who do not know the terms of the seed,
the terms similar to or equal to those used by the seed will hardly refine the hypotheses
about the concepts implemented. Proposal - In order to refine the DepS, we developed
a new technique called Microdomain-based Concept Location (MCL). The objective of
the technique is to provide methods with terms different from those used by the seed and
that they were at the same time statically related to each other. To achieve this goal,
we developed two concepts: Microdomain and Seed Context. A microdomain is a cluster
of statically related methods, built on the basis of dominance analysis. When selecting
a seed, the MCL presents graphically the seed context, constituted by the microdomain
of the seed and the microdomains neighboring the microdomain of the seed. In order to
support the use of MCL, we developed a tool called the Context Viewer. Evaluation - In
order to collect evidence of the applicability of the technique, we assigned a task of concept
location to a developer, who used the Context Viewer. At the end of the evaluation, we
conducted an interview. Conclusion - The tips provided by seed context, built by
Context Viewer, helped the developer to realize the relevance of a seed, completing the
task with the desired code snippets. Despite the evidence collected in our study, further
research needs to be conducted in order to measure the similarity between the terms of
one microdomain and the difference between the terms of microdomain neighbors.

IGOR DAVID BRITO CALDEIRA
--
Leader : LEOBINO NASCIMENTO SAMPAIO
MEMBRES DE LA BANQUE :
LEOBINO NASCIMENTO SAMPAIO
ALIRIO SANTOS DE SA
STÊNIO FLÁVIO DE LACERDA FERNANDES
Data: 5 juil. 2018

Afficher le Résumé
--

MIRLEI MOURA DA SILVA
GROUPING OF TEMPORARY SERIES USING DECOMPOSITION OF STOCKS AND DETERMINISTICS
Leader : RICARDO ARAUJO RIOS
MEMBRES DE LA BANQUE :
RICARDO ARAUJO RIOS
GECYNALDA SOARES DA SILVA GOMES
RODRIGO FERNANDES DE MELLO
Data: 13 juil. 2018

Afficher le Résumé
-As part of the unsupervised machine learning area, time series clustering aims at designing methods to extract patterns from temporal data in order to organize dierent series according to their similarities. According to the literature, most of researches either perform a preprocessing step to convert time series into an attribute-value matrix to be later analyzed by traditional clustering methods, or apply measures specically designed to compute the similarity among time series. Based on such studies, we have noticed two main issues: i) clustering methods do not take into account the stochastic and the deterministic in uences inherent of time series from real-world scenarios; and ii) similarity measures tend to look for recurrent patterns, which may not be available in stochastic time series. In order to overcome such drawbacks, we present a new clustering approach that considers both in uences and a new similarity measure to deal with purely stochastic time series. Experiments provided outstanding results, emphasizing time series are better clustered when their stochastic and deterministic in uences are properly analyzed.

ELISEU SILVA TORRES
BAMSDN: An SDN / Openflow Framework for Dynamic Resource Allocation Based on Band Allocation Model
Leader : LEOBINO NASCIMENTO SAMPAIO
MEMBRES DE LA BANQUE :
LEOBINO NASCIMENTO SAMPAIO
JOBERTO SERGIO BARBOSA MARTINS
KELVIN LOPES DIAS
Data: 16 juil. 2018

Afficher le Résumé
Present and emerging communication network scenarios such as 5G, cloud, optical networks, MPLS networks and Internet of Things (IoT) are marked by the wide variety and distribution of users, applications and services with communication and quality requirements (SLA , QoE, QoS) heterogeneous. Therefore, it is necessary to find new ways to deal with the dynamics of network traffic, one of the alternatives adopted is the use of band-allocation models (BAM) that have the attributes for distribution and control of network resources through classification types of applications. It has become common to use BAMs together with the SDN paradigm, since this paradigm allows researchers and network administrators to explore the perspective of resource allocation in scenarios where the configuration of equipment owned by different manufacturers is a hindrance, since each device has its own configuration syntax. Thus, the exploitation of the BAM / SDN set can be an economical alternative to the high costs of OPEX and CAPEX of purely MPLS networks. Thus, the present work presents the BAMSDN framework (Allocation Models through Software-Defined Networks) for the dynamic and flexible exploitation of resources of MPLS-based networks through the joint use of BAMs and SDN. The framework explores the programmability and global vision of the control plan network to perform traffic engineering throughout the topology and seeks to match the availability of network resources to the demand of its users, as well as deploy the management plan to the SDN controller to also allow the reconfiguration and bandwidth management of each output port of a switch.

FRANCISCO RENATO CAVALCANTE ARAÚJO
Collaborative and Distributed Cache as Support for Producer Mobility in Named Data Networks.
Leader : LEOBINO NASCIMENTO SAMPAIO
MEMBRES DE LA BANQUE :
ANTONIO MARINHO PILLA BARCELLOS
FABIOLA GONCALVES PEREIRA GREVE
LEOBINO NASCIMENTO SAMPAIO
Data: 18 juil. 2018

Afficher le Résumé
The use of the Internet has undergone major transformations, the number of devices and connections is with the pace of accelerated growth, unplanned facts in the network architecture project. As a result, the popularization of mobile devices is leading to increased demand for mobile Internet and an overload in access networks. Faced with the challenges posed by emerging communications, Named Data Networking (NDN) were proposed to solve the problems faced in the current network, through a content-centric communication model, in which consumers request interests in the network in search of data that are generated by the producers. NDN supports the mobility of consumers through the architecture itself, however, support for the mobility of producers still poses a challenge, due to damage to the network, especially during the period of unavailability of the producer. In this way, this work proposes the SCaN-Mob, a new strategy to support the mobility of producers in NDN. SCaN-Mob uses caching on local network devices to help producers disseminate their data between neighboring nodes, to maintain in the device cache the data from the producers most likely to perform handoff . Thus, a greater diversity of contents in the network is obtained, which increases the chances of responding to the interests that arrive in the absence period of the producer. The main contributions of this work are: i) a cache replacement policy that can act with different data prioritization criteria; and ii) a packet forwarding strategy to control the effect of packet flooding on the network. The proposal was implemented in the ndnSIM simulator and was submitted to a detailed analysis, comparing it with the standard NDN, under different metrics. The results obtained show gains of up to 35.32% in the interest satisfaction rate, even with the producer's unavailability of the producer, and with a reduction of 56.23% in the time to obtain data.

MARCO ANTONIO RUIZ RUEDA
A Tool for building multi-purpose and multi-pose synthetic data sets
Leader : LUCIANO REBOUCAS DE OLIVEIRA
MEMBRES DE LA BANQUE :
LUCIANO REBOUCAS DE OLIVEIRA
ANGELO AMANCIO DUARTE
RODRIGO TRIPODI CALUMBY
Data: 25 juil. 2018

Afficher le Résumé
A tool to generate multi-purpose, synthetic data sets from multiple camera viewpoints and environmental conditions is proposed here. To generate any data set, three steps are required: locate 3D models in the scene, set the discretization parameters, and run the generator. The set of rendered images provide data that can be used for geometric computer vision problems, such as: Depth estimation, camera pose estimation, 3D box estimation, 3D reconstruction, camera calibration, and also pixel-perfect ground truth for scene understanding problems, such as: Semantic segmentation, instance segmentation, object detection, just to cite a few. In this paper, we also survey the most well-known synthetic data sets used in computer vision tasks, pointing out the importance to use rendered images. When compared to similar tools, our generator contains a wide set of features easy to extend, besides allowing for building sets of images in the well-known MSCOCO format, so ready for deep learning works. To the best of our knowledge, the proposed tool is the first one to generate large-scale, multi-pose, synthetic data sets automatically, allowing for training and evaluation of supervised methods for all of the covered features.

ALEX SILVA SANTOS

Service Degradation with Proportional QoS in Elastic Optical Networks

Leader : GUSTAVO BITTENCOURT FIGUEIREDO
MEMBRES DE LA BANQUE :
GUSTAVO BITTENCOURT FIGUEIREDO
ANDRE COSTA DRUMMOND
ANDRE CASTELO BRANCO SOARES
Data: 27 juil. 2018

Afficher le Résumé

Data collected in recent years show that the amount of information traveling in the network core has been increasing at an exponential rate, evidencing the inability of Wavelength-division Multiplexing networks to meet such demand. Elastic Optical Networks represent a new approach to dealing with high data traffic, offering bandwidth closer to that requested by users, providing greater use of available optical resources. In scenarios with high traffic volume, requests can be blocked due to the inability of the network to allocate new resources. One of the strategies used to avoid blocking new requests is service degradation, where certain requests may receive less than requested bandwidth so that they can operate without loss. In the current literature it is possible to perceive that most of the studies approach the degradation of service using a static scenario where all the requests are known in advance, resulting in the absence of online strategies that present greater conformity with real scenarios. This paper proposes an online strategy for service degradation using the proportional QoS model. This strategy aims to reduce the likelihood of blockage caused by lack of optical resources and offer better treatment for different classes of service. In addition, pairs of source-destination nodes were modeled as a queuing system operating on the Generalized Processor Sharing policy with Leaky Bucket admission control in order to quantify the impact of degradation on lightpaths. The results show that the proposed strategy can reduce the probability of blocking and provide network operators with greater control among different classes of degraded services.

IVE ANDRESSON DOS SANTOS TOURINHO

Flexible Grouping for Points of Interest Recommendation

Leader : TATIANE NOGUEIRA RIOS
MEMBRES DE LA BANQUE :
FREDERICO ARAUJO DURAO
HELOISA DE ARRUDA CAMARGO
TATIANE NOGUEIRA RIOS
Data: 16 août 2018

Afficher le Résumé
Given the current scenario of massive production of geo-referenced data produced by users of social networks based on location, this work aims to identify patterns of human mobility and model the phenomena related to human behavior in the geographic space. Based on the premise that the preference profile of a user can be characterized by their mobility behavior inferred through their check-ins history, two techniques of recommendation based on fuzzy grouping are proposed, in order to attest the usefulness of the geographical aspects identified in predictive tasks to recommend content

FELIPE GUSTAVO DE SOUZA GOMES
A Computational Infrastructure for Technical Debt Identification and Monitoring
Leader : MANOEL GOMES DE MENDONCA NETO
MEMBRES DE LA BANQUE :
GLAUCO DE FIGUEIREDO CARNEIRO
IVAN DO CARMO MACHADO
MANOEL GOMES DE MENDONCA NETO
Data: 27 août 2018

Afficher le Résumé
In software engineering, the concept of Technical Debt (DT) contextualizes the problem of technical tasks not performed in the present, which may incur costs and adversities in the future. Exposure to debt can bring short-term benefits, such as increasing current productivity and reducing the time to launch a product, but this benefit is obtained at the cost of extra effort in the future to resolve the debts incurred. In this context, it is necessary to develop mechanisms to manage technical debts. Identifying and monitoring DT is an important and necessary step in the debt management process. Unfortunately these tasks are complex because they require the analysis of large amounts of information. Different works have been carried out in the development of approaches to identify DT either automatically or semi-automatically. Most of these works are based on code metrics analysis. However, studies have already pointed out that the use of metrics only for the identification of DT items limits the scope of this activity. It is also necessary to consider the developer's point of view. In this direction, the combination of an approach that simultaneously uses metrics, static code analysis, comment analysis, and information visualization, has been shown to be promising for DT identification and monitoring. This work presents the development and evolution of two tools to materialize this approach. The first, called RepositoryMiner, mines software repositories, calculates metrics, detects code smells, analyzes comments and source code, and combines these different means as indicators of technical debts of software modules. The second, called VisminerTD, uses visual techniques to, based on the calculated indicators, support software engineers in the identification and monitoring of technical debts throughout the life cycle of a software project. To show the usability and effectiveness of the approach, two feasibility studies were conducted showing that the tools can support the management of DT in medium-sized software projects.

FELIPPE VIEIRA ZACARIAS
Intelligent colocation of HPC workloads for enhancing server efficiency
Leader : VINICIUS TAVARES PETRUCCI
MEMBRES DE LA BANQUE :
VINICIUS TAVARES PETRUCCI
MARCOS ENNES BARRETO
Rajiv Nishtala
Data: 30 août 2018

Afficher le Résumé
Resource efficiency is crucial for achieving Exascale performance in high performance computing (HPC) systems. Many HPC applications achieve only a fraction of their theoretical peak performance because it is very hard for developers and runtime systems to extract perfectly the best performance from parallel applications. From this perspective, colocating applications is an attractive technique to increase HPC system utilization. However, an application may adversely affect the behavior of another application when sharing critical resources, such as caches and memory bandwidth, in the same server. We argue that server efficiency can be improved by predicting the expected performance slowdown of colocated applications from measured hardware performance counters. This work makes the following contributions: (1) a machine learning model to predict the performance slowdown of co-located applications and (2) an intelligent scheduling scheme deployed in the Slurm manager to allow sharing applications in a server node with minimal degradation for improved performance.

BEATRIZ BRITO DO REGO
The influence of metacommunication on permanence or abandonment in MOOC: an investigation under a semiotic perspective
Leader : ECIVALDO DE SOUZA MATOS
MEMBRES DE LA BANQUE :
ECIVALDO DE SOUZA MATOS
ARTUR HENRIQUE KRONBAUER
ANDRÉIA LIBÓRIO SAMPAIO
Data: 27 sept. 2018

Afficher le Résumé
MOOC is an acronym for Massive Open Online Course, an online education modality
oering video courses, with dierent ages, backgrounds and nationalities. The MOOCs
have emerged as technological innovations that idealize a democratization of education,
through a course of free courses. This modality of teaching has as characteristic a population
of dierent ages, formations and nationalities. However, MOOCs are subject to a
high rate of their students, corresponding to their main criticisms. You can participate
at any time, a completion rate for most courses of 13%. Some of the conventions students
attend are those related to the technology platform. In this sense, the research had
to investigate the quality of semiotic metacommunication inuencing the permanence or
the abandonment of education in MOOC courses. The research was carried out in four
stages. The rst one consisted in the execution of a literature review, in order to identify
the elements of the nature of the genetics that infuenced the permanence and the abandonment
in MOOC. With the review vote, twenty-four general causes of abandonment in
MOOC were found. The second stage was held a study of documentary events moments:
development of a platform of MOOC; evaluation of the quality of the metacommunication
of the interaction design of a course in this platform; course implementation; and, nally,
to identify and evaluate the ruptures of communicability existing in the case studied. In
the design-time evaluation, a MOOCLICAR technique was applied, which was mitigated.
The data sheet with the data of the course of analysis and analysis of the course was not
carried out. Finally, the Semiotic Inspection (SIM) and Communality Assessment (CEM)
methods were used to identify communication disruptions in the MOOC platform. The
third phase of the session had an analysis of the collected data, relating the evaluation
of the interaction design, the questionnaires, the analysis of forums and the assessment
of communicability. The results can be useful for the development of a set of interaction
design and evaluation guidelines for the MOOC from a communication point of view.

JORGE FERNANDO SILVA PEREIRA FILHO
Variability Based Approach to Generate and Disseminate Emergency Public Communications.
Leader : MANOEL GOMES DE MENDONCA NETO
MEMBRES DE LA BANQUE :
MANOEL GOMES DE MENDONCA NETO
CLAUDIO GUIMARAES CARDOSO
KARINA BARRETO VILLELA
Data: 28 sept. 2018

Afficher le Résumé
An emergency is unpredictable by nature, but it presents patterns that can help communicators to anticipate problems and give better responses to emergency situations. Current solutions for emergency public communication focus on the dissemination of a single message through different communication channels. This strategy goes against good communication practices because different audiences have different information needs. Inappropriate public communication messages create noise and can amplify the perception of risk and insecurity. Our research proposes a novel approach for public communication in emergencies. It maps and models variability in emergency communication processes to ensure the flexibility and adaptability of the communication for different emergency scenarios. Our approach aims to facilitate the rapid instantiation of customised and consistent emergency communications, over different channels, for different audiences. Our research was conducted as part of a large research project named RESCUER, which uses crowdsourcing information to manage emergency situations. To evaluate the communication model, a proof of concept, named RESCUER News, was built and evaluated in simulations involving real public and operational forces, over three distinct emergency scenarios.

LUCAS AMPARO BARBOSA

Removing facial expressions in 3D images for recognition purposes

Leader : MAURICIO PAMPLONA SEGUNDO
MEMBRES DE LA BANQUE :
ALEXANDRE DA COSTA E SILVA FRANCO
LUCIANO REBOUCAS DE OLIVEIRA
MAURICIO PAMPLONA SEGUNDO
Data: 17 oct. 2018

Afficher le Résumé
The research carried out has a neural coder model encoder-decoder of images removed by facial functions in 3D images. This model receives a 3D image of the face with or without the keys as input and generates a neutral face as an output. The real is not a device that can be developed but rather perfect an accuracy of 3D facial recognition systems. For ready to, the standard of an error based on a system information in the user? Experiments using a Bosporus 3D database had a difference in range between images of the same and different modes of interaction between intraclass and interclass differences. They were also shown in our synthetically generated neutral surveys and the results of our recognition methods, thus reaching the original.

MARCO ANTONIO PARANHOS SILVA
Variability Implementation: An Empirical Study of Stack Overflow Posts
Leader : EDUARDO SANTANA DE ALMEIDA
MEMBRES DE LA BANQUE :
RODRIGO ROCHA GOMES E SOUZA
EDUARDO SANTANA DE ALMEIDA
RODRIGO OLIVEIRA SPINOLA
Data: 12 nov. 2018

Afficher le Résumé
In this dissertation, an empirical investigation was carried out aiming at understanding how the theme variability implementation is discussed by the community of software developers who use the Stack Overflow website. Initially, a list of 52 words related to this theme was defined based on the literature and the opinion of experts. They were then used as the initial vocabulary for the algorithm that implements the Latent Dirichlet Allocation (LDA) in order to identify the main topics related to variability implementation, as well as to validate the words in the list. Then, with the most used words in the questions and answers, as validated by LDA, searches were performed on the dataset retrieved from the Stack Overflow platform and several studies were carried out to find the best set to be analyzed.

The best result was obtained by combining these words two by two, and thus 1962 questions were retrieved. These 1962 questions were then analyzed with the objective of discover the mechanisms of variability implementation more discussed, less discussed and not discussed; what is the delay in responding to the formal questions on variability implementation and how long the discussions last; and finally, what is the profile of Stack Overflow users who asked and answered the questions retrieved, as well as the degree of confidence one can have about the quality of the questions and answers presented. One of the main results of this research was the definition of a list of words that refer to variability implementation concepts and techniques that are most commonly used by the Stack Overflow community. Other one are the research results from analyzing the questions and answers selected, such as: two groups of words were identified as those most used by the Stack Overflow community: one more conceptual and one more related to variability implementation techniques; the average delay time for the first answer to the questions and the duration of the discussion when there is more than one answer; and finally an study on the degree of confidence in the Stack Overflow user group who asked questions or answered questions based on the reputation indicator of community members.

LEANDRO MANUEL PEREIRA SANCHES
Semantic Annotation of Learning Objects in Portuguese
Leader : LAIS DO NASCIMENTO SALVADOR
MEMBRES DE LA BANQUE :
CLARISSA CASTELLÃ XAVIER
DANIELA BARREIRO CLARO
MARLO VIEIRA DOS SANTOS E SOUZA
Data: 13 nov. 2018

Afficher le Résumé
In Brazil, e-learning is rapidly expanding and Learning Objectives (OA) are one of the forms of development for the teaching modality. One of the main advantages of OA is the possibility of reusing digital didactic content on the internet. In the case of re-use, OA must be available and accessible to teachers and students. The availability of an OA can be influenced by factors such as: difficulty in adding metadata to OA; lack of standardization and structuring of metadata; lack of semantic description of objects; the difficulties of adding semantic metadata to OA and a lack of machine understanding of OA content. Semantic Web technologies aim to structure the data present on the Web in a machine-readable way. In the educational context, these technologies allow to annotate semantic metadata, based on ontologies, in Auxiliary Learning Objects in their recovery and reuse. However, a manual annotation of semantic metadata is the author by authors as an expensive, mentally demanding and time consuming task. Since this is a document that seeks to discover and update the information of a manual, this study believes that it is an automated option and a semantic metadata annotation are important themes for a web-based education. Thus, this work proposes a new model for extracting and annotating semi-automatic semantic metadata from Learning Objects in Portuguese. This model uses the metadata Metadata for Learning Objects (LOM) and technologies for the Semantic Web as a language and a resource structure (RDF). The present study was different from other studies that carried out a semantic annotation of OA in other languages and its main differential is an application of a system of information extraction and disambiguation that aimed at the extraction of semantic relations rather than the identification of only entities. This allows us to note both the relationship and the relationships between the entities in the text. For the evaluation of this model, one of the main and most recent methods of disambiguation of performance descriptors was evaluated and evaluated. The annotation prototype was able to extract the textual OA semantics with a maximum accuracy of 37.5%. The annotation prototype was also able to convert the extra relations into semantic LOM metadata in the RDF language.

EDILTON LIMA DOS SANTOS
SHE: AN AUTOMATED SUPPORT FOR DYNAMIC SOFTWARE PRODUCT LINES ENGINEERING
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
IVAN DO CARMO MACHADO
CASSIO VINICIUS SERAFIM PRAZERES
EDSON ALVES DE OLIVEIRA JUNIOR
Data: 26 nov. 2018

Afficher le Résumé
Dynamic Software Product Lines (DSPL) engineering makes it possible to deliver software products capable of adapting to fluctuations in user needs and evolving resource constraints at runtime. Based on an architecture-centric approach, a DSPL is capable to analyze changes in context, thus responding by performing system adaptations. In this work, we propose an architecture model for DSPL engineering based on the MAPE-K model concept, a feasible strategy to gather and analyze contextual information thus enabling product reconfiguration at runtime. We evaluated the proposed architecture model in a Smart Home System, using OSGi and MQTT. A proof of concept of the proposed model was performed and the results demonstrated capability the DSPL to adapt and reconfigure itself according to the new sensors connected to the system. In addition, the framework built for proof of concept was evaluated by industry professionals and students of masters and doctorate. They consider the architecture to be highly modular and which has a high capacity for reuse and adaptability.

ANNA LUÍZA DE OLIVEIRA BARBOSA
Domain Specific Languages built from Software Product Lines: An Empirical Study on Visual and Textual Approaches
Leader : EDUARDO SANTANA DE ALMEIDA
MEMBRES DE LA BANQUE :
EDUARDO SANTANA DE ALMEIDA
RODRIGO ROCHA GOMES E SOUZA
ANDRÉ LUIS MEDEIROS SANTOS
Data: 26 nov. 2018

Afficher le Résumé
Generic or specific solutions can be applied in all branches of science and engineering. A generic approach provides a general solution for many problems in a certain area, but this solution may be suboptimal. A specific approach provides a much better solution for a smaller set of problems. Domain Specific Languages (DSL) are means, not goals, targeted at the final purpose of simplifying the software development process. They can be classified as internal, written in a host language or external, separate languages which bring their own syntax. An external DSL needs a language, compiler or interpreter, and an editor to support the new language. An DSL can be displayed in a visual and/or textual notation. This study proposes to implement a textual and a visual DSL for the Conference Management (CM) and the Smart Home (SH) Domains, using legacy product lines and integrate the DSL development with Software Product Lines Engineering (SPLE). Additionally, we conducted an exploratory study with the system’s original developers aiming at validating our DSL and characterizing barriers and benefits to software development using DSL as well as an empirical study conducted to compare the textual and the visual DSL regarding readability, effectiveness, efficiency and ease of use. The hypothesis test demonstrated better ease of use and efficiency when reporting bugs using the visual notation while the other metrics did not present statistical difference. However, the data analysis using descriptive statistics showed evidence that the visual notation had better performance when reporting bugs, while the textual notation is faster when changing DSL programs. Regarding the participants’ sense of success in completing the activities, the report of bugs using the textual notation had a better outcome than they expected.

DENIVAN DO CARMO CAMPOS DA SILVA
Combinatorial Interaction Testing Tools for Software Product Lines Engineering
Leader : IVAN DO CARMO MACHADO
MEMBRES DE LA BANQUE :
RODRIGO ROCHA GOMES E SOUZA
IVAN DO CARMO MACHADO
WESLEY KLEWERTON GUEZ
Data: 28 nov. 2018

Afficher le Résumé
Testing a system is a routine activity and plays an important role in the software quality assurance process.
However, testing highly-configurable systems, such as Software Product Lines (SPL), is a very complex activity due to
the presence of variability in its engineering process, which increases the number of product configurations to test. In
case a defect affects one (or a subset) of these functionalities, a range of products (and not just one, such as
traditional Software Engineering) may be affected. Such complexity also implies a significant increase in the cost of
testing. In this sense, one of the major challenges is to reduce this cost and the use of test automation tools could
bring a significant contribution to achieve such a reduction. Among the most effective techniques, combinatorial
interaction testing (CIT) stands out, as an affordable strategy seeking to optimize the configuration space to test. This
study aimed to analyze the effectiveness of the tools that implement the CIT technique considering a set of metrics
applied to the context of variability testing. This dissertation introduces the MERCI, a Method to Evaluate Combinatory
Interaction Testing techniques. MERCI is aimed to evaluate the suitability of existing CIT tools, largely employed in
single-system testing, for SPL engineering. In this work, we considered three measures: defect detection, test
coverage and test execution length. We performed an empirical evaluation to compare four CIT tools: ACTS, CATS,
PICTMaster and VPTag. The results show that the method can serve as a good indicator to evaluate how CIT tools
could behave in an SPL testing scenario.

LEANDRO SOUZA DE OLIVEIRA

DptOIE: a method for Open Information Extraction in the Portuguese language based on Dependency Analysis

Leader : DANIELA BARREIRO CLARO
MEMBRES DE LA BANQUE :
DANIELA BARREIRO CLARO
MARLO VIEIRA DOS SANTOS E SOUZA
RENATA VIEIRA
Data: 7 déc. 2018

Afficher le Résumé
It is estimated that 80% of the information on the Web is stored and are textual that can add value to various businesses. However, most of the index is heterogeneous. In this context, all the information that comes manually daily is a slow and unfeasible task. For this reason, as research in the area of Information Extraction (EI) has advanced in recent years. The EI has a function of generating the data in a process by means of Relation Extraction (ER) techniques. A relationship can connect a couple of entities. The normal ones are the words related to the syntax and can be of the type: "born in", "is a", "ran", and so on. EI's arguments focus on small, homogeneous, and early commands. However, this type of approach is not scalable for larger databases, that is, new rules for hiring new ways to become incremental documents or odomains to change. Thus, a new EI approach emerged, an Open Information Extraction (EIA). It allows you to extract facts, usually in detriplas form, without the need to pre-specify relationships, being more scalable and domain independent. The EIA can be applied in the area of business intelligence, opinion mining, question and answer systems, among others. Initially, most EIA methods were developed for the English language. However, other languages, such as Portuguese, also deserve prominence, since it comprises approximately 2.5% of all content available on websites. The EIA systems that have presented one of the best performances in English have used dependency analysis and are based on rules, which are used to identify a set of information in the sentences to compose the facts, such as: subjects, objects, adverbs, complements , etc. However, in the Portuguese language, the same approach has not shown the same effect, since the established rules are often generic and do not cover specific aspects of the language. For this reason, in this work the DptOIE is proposed, which uses new rules and aims to explore sentences through dependency analysis and an in-depth search. In addition, modules have also been developed to perform particular treatments and derive new facts from sentences that have coordinated conjunctions, subordinate clauses and wording. The DptOIE has been compared to three other EIA methods that have stood out in the Portuguese language, which are: PragmaticOIE, DependentIE and ArgOE. The results confirmed that the DptOIE obtained area under the curve of the greater precision-yield and was superior in the quantity of coherent facts extracted, besides maintaining high precision.

NILSON RODRIGUES SOUSA
M2-FoT: Monitoring and Managing Mist Platforms of Things
Leader : CASSIO VINICIUS SERAFIM PRAZERES
MEMBRES DE LA BANQUE :
CASSIO VINICIUS SERAFIM PRAZERES
IVAN DO CARMO MACHADO
CELSO ALBERTO SAIBEL SANTOS
Data: 10 déc. 2018

Afficher le Résumé
The Internet of Things (IoT) is a paradigm in which physical objects embedded with sensors and actuators are connected to the Internet, enabling the creation of solutions for various areas of human life. As the amount of things connected to the Internet tends to increase more and more, new ways of managing and monitoring these things are needed. In this context, this work aims to propose and develop a method of management and monitoring of the elements that make up an IoT environment. The proposed method covers activities of monitoring and managing the configuration and states of the components of the IoT environment. In order to achieve this objective, this work uses the ARM-IoT (Architectural Reference Model for the IoT) reference model, which enables the listing of common elements that are part of an IoT environment and require management and monitoring. The approach developed is based on the Simple Network Management Protocol (SNMP), which uses a manager / agent model, where a manager concentrates the most complex management actions and the agent is responsible for collecting information about the infrastructure. Thus, this master's work proposes M2-FoT: a method, based on ARM-IoT and SNMP, that uses a management approach in which a server (acting as an SNMP manager) concentrates the most robust actions and delegates actions to the infrastructure through agents that are hosted on IoT gateways (acting as SNMP agents). The proposed method was implemented and validated using the SOFT-IoT (Self-Organizing Fot of Things for the IoT) platform, which implements several of the concepts described in ARM-IoT. Given this context, this work presents the M2-FoT that is composed by IoTManagerServer and IoTManagerClient, similar to the manager model and SNMP protocol agent. This allows the management and monitoring of IoT environments. The implementation of M2-FoT was tested within the SOFT-IoT platform and the evaluated aspects were: hardware consumption by the devices; response efficiency of M2-FoT in the recovery of the infrastructure configuration; and the accuracy of the information stored by M2-FoT.

MARCELO AIRES VIEIRA
Interoperability between heterogeneous clouds across clouds
Leader : DANIELA BARREIRO CLARO
MEMBRES DE LA BANQUE :
DANIELA BARREIRO CLARO
VANINHA VIEIRA DOS SANTOS
RONALDO DOS SANTOS MELLO
Data: 11 déc. 2018

Afficher le Résumé
With the media database, many organizations have begun providing and delivering data through services such as Data as a Service (DaaS) and Database as a Service (DBaaS). Different data sources for certain applications, such as Relational, NoSQL and NewSQL, or not yet structured (plain text), semi-structured (XML, JSON, CSV) and structured (RDBMS). Inherent heterogeneity of DaaS and DBaaS makes it difficult to merge services and unified use by an application. Thus, the present work is a method that aims to interoperate the heterogeneous DaAS and DBaaS data through the data junctions. The method called MiDIJo (Middleware for Data Interoperability via Join) allows a unified model to interoperate between different models. MiDIJo recognizes the query, the required join type analysis, and lists the join attributes. MiDIJo was formally discontinued through canonical models and incorporated into MIDAS middleware in order to validate its performance and effect. A use case and three experiments were performed with a MiDIJo validation purpose: (i) the first experiment selects and obtains data in different DaaS and DBaaS; (ii) the second associates data between different DaaS and DBaaS (junction); Finally, (iii) the third considers different types of DBaaS (Relational, NoSQL and NewSQL) with two joins. The use case and the experiments allowed the viability, functionality, interoperability and performance of the MiDIJo method. The results are demonstrated through a methodology that allows selecting and aggregating data in the clouds for the complexity of the interoperability and junction problem.

FABIO CORREIA DE REZENDE
The Computational Reasoning in the development of English Language Skills: pedagogical practices with BYOD
Leader : ECIVALDO DE SOUZA MATOS
MEMBRES DE LA BANQUE :
DEBORA ABDALLA SANTOS
ECIVALDO DE SOUZA MATOS
TACIANA PONTUAL DA ROCHA FALCAO
Data: 13 déc. 2018

Afficher le Résumé

With the aim of contributing to the discussions and advances in the researches on the Brazilian public education, this research had as object of study the computational reasoning. The research sought to understand and discuss whether computational reasoning favors the development of disciplinary competences in English language teaching through pedagogical actions through the Bring Your Own Device (BYOD) approach.

CARLOS ALEXANDRE NASCIMENTO

Integration between Computer Science and Portuguese Language through Computational Thinking Skills

Leader : DEBORA ABDALLA SANTOS
MEMBRES DE LA BANQUE :
DEBORA ABDALLA SANTOS
ECIVALDO DE SOUZA MATOS
Ismar Frango Silveira
Data: 13 déc. 2018

Afficher le Résumé

Discussions about the insertion of Computer Science in basic education and how it can contribute in the processes of teaching learning that involve the use of digital information and communication technologies are in increasing ascension in researches in the area of Informatics and Education. The present work aimed to investigate the possible contributions of Computational Thinking to the teaching and learning of Portuguese Language in Basic Education. For this, interventions were made in the teaching of contents of Portuguese Language through didactic sequences that contemplated the association of the discipline to Computer Science. The data collected, through focus groups, interviews and journals were analyzed qualitatively based on the literature. The results showed that the integration between the Computational Thinking skills and the contents explored during the Portuguese Language classes is feasible and that this integration contributes to the teaching and learning of this discipline, as well as to promote Computer Science in Basic Education.

PAULO ROBERTO MAIA SIMÕES JUNIOR
USING AGGREGATION AND SUMMARIZATION PROCESSES TO ENHANCE CROWDSOURCE BASED EMERGENCY MANAGEMENT SYSTEMS
Leader : MANOEL GOMES DE MENDONCA NETO
MEMBRES DE LA BANQUE :
MANOEL GOMES DE MENDONCA NETO
DANILO BARBOSA COIMBRA
FLAVIO EDUARDO AOKI HORITA
Data: 14 déc. 2018

Afficher le Résumé
Emergencies may bring great loss for human beings and for the environment. Whether they are results of human actions or forces of nature, these situations ask for a swift response to minimize their adverse consequences. People that witness an emergency are a valuable source of information to those responsible for making decision on how to respond to it. Through crowdsourcing, command centers can obtain multiple types of information from witness at the emergency site. By using smartphones, for example, these people can send information directly (e.g. texts, videos, images) or indirectly (e.g. GPS), that may help in the identification of important situations at the emergency site. However, the gathered data must be processed and summarized, as the emergency managers may not be able to analyze all the obtained information, which gives space to loss of important information. Researchers have been studying and proposing information visualization tools that may help the emergency managers to understand the high volume of information gathered through crowdsourcing. Some of those tools had been used, and proven useful, in real life large scale emergencies. However, a recent systematic mapping study shows that most researchers use similar techniques, focused on maps and icons, to visualize emergency information in a high level (e.g. affected area, type of emergency). Emergency managers usually have to filter and search the available data to obtain more information about the situation at hand. This work presents a way to aggregate emergency data, gathered through crowdsourcing, and to summarize it using information visualization focused on presentation of emergency details. The presented visualizations, should support the emergency managers decision making process, through visual paradigms that facilitate a swift understanding of the situation at the emergency site. This research was within the context of a collaboration project between Brazil and European Union and had the support of emergency management specialists from both places.

FELIPE OLIVEIRA DOS SANTOS

A multiagent epistemic logic that integrates classical knowledge and constructive knowledge.

Leader : STEFFEN LEWITZKA
MEMBRES DE LA BANQUE :
CIRO RUSSO
LAIS DO NASCIMENTO SALVADOR
STEFFEN LEWITZKA
Data: 14 déc. 2018

Afficher le Résumé

EpistEpistemic Logic is a branch of logic that deals with notions of knowledge and belief. Through its study, it is possible to formalize and make inferences about the knowledge of one or more agents. Due to the recent applications in Computer Science, the epistemic logic has been gaining new contributions and with them new problems and challenges arise. In this work we will present a new epistemic multiagent logic based on the EL5 logic originally introduced by Lewitzka (2017) as a modal system capable of representing both properties of classical knowledge as well as characteristics of an intuitionist (constructive) knowledge that conforms to the BHK interpretation of intuitionist logic. This connection with constructive reasoning is interesting from the point of view of computer science. The logic presented in this paper generalizes and extends EL5 to a multiagent logic with operators for common knowledge of agent groups. A certain concept of common knowledge is adopted and modeled by an algebraic semantics (based on Heyting algebras) rather than the commonly used Kripke semantics, thus giving a natural and intuitive approach. Finally, we prove the correctness and completeness of this new multiagent logic with respect to the proposed algebraic semantics.emic Logic is a branch of logic that deals with notions of knowledge and belief. Through its study, it is possible to formalize and make inferences about the knowledge of one or more agents. Due to the recent applications in Computer Science, the epistemic logic has been gaining new contributions and with them new problems and challenges arise. In this work we will present a new epistemic multiagent logic based on the EL5 logic originally introduced by Lewitzka (2017) as a modal system capable of representing both properties of classical knowledge as well as characteristics of an intuitionist (constructive) knowledge that conforms to the BHK interpretation of intuitionist logic. This connection with constructive reasoning is interesting from the point of view of computer science. The logic presented in this paper generalizes and extends EL5 to a multiagent logic with operators for common knowledge of agent groups. A certain concept of common knowledge is adopted and modeled by an algebraic semantics (based on Heyting algebras) rather than the commonly used Kripke semantics, thus giving a natural and intuitive approach. Finally, we prove the correctness and completeness of this new multiagent logic with respect to the proposed algebraic semantics.

VANA HILMA VELOSO CARVALHO
ANALYSIS OF THE ASPECTS OF ACCEPTANCE AND USE OF THE INSTITUTIONAL REPOSITORY OF THE FEDERAL UNIVERSITY OF BAHIA (RI-UFBA) BASED ON THE UTAUT MODEL.
Leader : DEBORA ABDALLA SANTOS
MEMBRES DE LA BANQUE :
DEBORA ABDALLA SANTOS
LAIS DO NASCIMENTO SALVADOR
FLAVIA GOULART MOTA GARCIA ROSA
Data: 18 déc. 2018

Afficher le Résumé
Institutional Repositories are increasing references for scientific communication with the purpose of storing, disseminating and preserving the academic and scientific production of an institution. These web information systems are based on the movements of open technologies, for demanding that the files deposited in the repository are provided in open access and free access licenses, with the purpose of disseminating their content to society in a collaborative and democratic manner. This research brings the analysis of the factors that influence the acceptance and use by the academic and scientific community of UFBA to its institutional repository. The Unified Theory of Acceptance and Use of Technology (UTAUT) information model was used to collect data, as well as the Think Aloud method, a technique related to the usability of the system, to observe the user behavior in the use of IR -UFBA. In the analysis of the results obtained through the UTAUT model, the Generalized Linear Model (MLG) was used, statistical technique of data analysis. The research response is presented through the user-identified comments in the Think Aloud method, and especially in the statistical indicators regarding the acceptance and use of the technology that allow to offer subsidies to improve the studied system.

DANIEL LUIS MOREIRA TIMPONI
Characterization of the development of multiplatform game libraries
Leader : RODRIGO ROCHA GOMES E SOUZA
MEMBRES DE LA BANQUE :
IVAN DO CARMO MACHADO
RODRIGO ROCHA GOMES E SOUZA
SANDRO SANTOS ANDRADE
Data: 18 déc. 2018

Afficher le Résumé
"As new operating systems and hardware devices emerge and become popular, game developers are looking for libraries to help them build multiplatform games for the purpose of streamlining the development process and getting a product compatible with the most popular platforms In order to characterize the development of multiplatform game libraries, we have analyzed the log of commits of five libraries of free games written in C / C ++ to extract information about changes in the source code and the contribution of the development team We found that developers rarely change platform-independent and platform-specific code together in the same commit The vast majority of changes occur on a timely basis in the standalone code or in the specific code of each platform.
In addition, we found a growth in the ratio of generalist developers in 3 of 5 of the analyzed libraries. Our findings can help developers and project managers improve the development and maintenance of cross-platform systems by providing empirical evidence on source code changes and developer knowledge of platforms over time. "

PEDRO OLIVEIRA RAIMUNDO
Low-cost 3D Reconstruction of Cultural Heritage
Leader : KARL PHILIPS APAZA AGUERO
MEMBRES DE LA BANQUE :
KARL PHILIPS APAZA AGUERO
ANTONIO LOPES APOLINARIO JUNIOR
ANTONIO CARLOS DOS SANTOS SOUZA
Data: 19 déc. 2018

Afficher le Résumé
The 3D reconstruction process follows a well-defined pipeline to generate a model that represents the geometry and appearance of an object captured using 3D scanners or other acquisition devices. The main steps of the commonplace 3D reconstruction pipeline are: a) acquisition of depth and color images of the object, b) alignment of the views acquired from different angles, c) integration of the acquired information into a single model, and d) synthesis and visualization of a textured 3D model. The models generated by following these steps are used in many areas where it is necessary to reconstruct real objects with nontrivial geometry such as engineering, architecture and museology. The present work focuses on the usage of low-cost 3D reconstruction to preserve cultural heritage artifacts, this is important not only for preservation purposes, but also as a means to study and transmit the values and traditions of the communities. In this context, we surveyed the literature for limitations of low-cost 3D reconstruction pipelines, and subsequently proposed pipeline specializations to leverage the characteristics of low-cost 3D acquisition devices and circumvent their limitations. The resulting reconstruction pipeline improved the quality of the data acquired using low-cost 3D scanners, streamlined some of the remaining following phases of the reconstruction process, and employed realism techniques to improve the appearance of the reconstructions. Several heritage artifacts from the Federal University of Bahia Museum of Archaeology and Ethnology were reconstructed using proposed approach, which yielded good results in terms of visual appearance. Thus, we conclude that low-cost 3D preservation of heritage is viable given a proper combination of pipeline specializations and realism techniques.

	Thèses
1	JURANDIR DA CRUZ BARBOSA Self-Organization and Balancing Services in the Mist of Things. Leader : CASSIO VINICIUS SERAFIM PRAZERES MEMBRES DE LA BANQUE : CASSIO VINICIUS SERAFIM PRAZERES PAULO NAZARENO MAIA SAMPAIO VINICIUS TAVARES PETRUCCI Data: 18 déc. 2017 Afficher le Résumé This work proposes a solution for self-organization and balancing services of Mist of Things. Mine Computing, considered an extension of IoT, has advantages such as availability at the local level, facilities in access and even interoperability that facilitates communication between different. In this context, this work uses a functional solution based on the modularity proposed by the OSGi specification, Web Services, technologies and protocols for self-organization. In this way, existing services in an IoT infrastructure were separated into groups according to the organization of the Fog of Things (FoT) paradigm. To self-organize the services in their respective groups, we used clustering technologies based on the modularity of OSGi. In addition, the OptaPlanner solution was used to find a better arrangement between the distribution of services to be balanced. The infrastructure developed and implemented in this work uses a topology called Network of Nodes, which employs a balancing policy that was developed to adapt to the needs of the network, aiming at a lower infrastructure overhead and ensuring the minimum of essential services. In order to validate the proposal, the Quality of Service (QoS) of the IoT infrastructure used was evaluated, calculating its performance when meeting its requests in processing time and acceptable memories. It was also possible to evaluate the availability of the network services, as well as the response time of the requests after simulation of node failure, according to the scenario created in the IoT network. The analysis of the results obtained in the experiments performed shows a good performance in terms of the response time of the requests, availability of services in eventual failures and improvement of the QoS of the IoT infrastructure used in the work.

Thèses

JURANDIR DA CRUZ BARBOSA
Self-Organization and Balancing Services in the Mist of Things.
Leader : CASSIO VINICIUS SERAFIM PRAZERES
MEMBRES DE LA BANQUE :
CASSIO VINICIUS SERAFIM PRAZERES
PAULO NAZARENO MAIA SAMPAIO
VINICIUS TAVARES PETRUCCI
Data: 18 déc. 2017

Afficher le Résumé
This work proposes a solution for self-organization and balancing services of Mist of Things. Mine Computing, considered an extension of IoT, has advantages such as availability at the local level, facilities in access and even interoperability that facilitates communication between different. In this context, this work uses a functional solution based on the modularity proposed by the OSGi specification, Web Services, technologies and protocols for self-organization. In this way, existing services in an IoT infrastructure were separated into groups according to the organization of the Fog of Things (FoT) paradigm. To self-organize the services in their respective groups, we used clustering technologies based on the modularity of OSGi. In addition, the OptaPlanner solution was used to find a better arrangement between the distribution of services to be balanced. The infrastructure developed and implemented in this work uses a topology called Network of Nodes, which employs a balancing policy that was developed to adapt to the needs of the network, aiming at a lower infrastructure overhead and ensuring the minimum of essential services. In order to validate the proposal, the Quality of Service (QoS) of the IoT infrastructure used was evaluated, calculating its performance when meeting its requests in processing time and acceptable memories. It was also possible to evaluate the availability of the network services, as well as the response time of the requests after simulation of node failure, according to the scenario created in the IoT network. The analysis of the results obtained in the experiments performed shows a good performance in terms of the response time of the requests, availability of services in eventual failures and improvement of the QoS of the IoT infrastructure used in the work.

Resultados de tradução