Upload link_completeness function and test. Also internal functions by jmalcan · Pull Request #34 · EcologyR/RecruitNetP

jmalcan · 2026-02-19T13:16:24Z

No description provided.

Estamos

gemini-code-assist · 2026-02-19T13:16:43Z

Summary of Changes

Hello @jmalcan, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly expands the package's capabilities by introducing a robust mechanism to assess the completeness and coverage of sampled ecological interactions. It provides a new user-facing function, link_completeness, which leverages a suite of newly added internal helper functions to perform complex data aggregations, matrix transformations, and statistical estimations. This enhancement allows users to gain deeper insights into their interaction data by quantifying sampling effort effectiveness.

Highlights

New Internal Functions: A new file R/All_internal.R was added, containing a comprehensive set of internal helper functions for data processing, aggregation, matrix conversion, and statistical analysis of ecological interaction data. These functions support various calculations, including association indices, significance tests, and network topology metrics.
New link_completeness Function: A new public function link_completeness was introduced in R/link_completeness.R. This function estimates the completeness and coverage of sampled canopy-recruit interactions, utilizing the iNEXT package for both incidence and abundance-based approaches.
Comprehensive Testing: Dedicated unit tests for the link_completeness function were added in tests/testthat/test-link_completeness.R. These tests ensure proper argument validation, error handling for invalid inputs, warning conditions for specific data scenarios, and the correct structure of the function's output.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

R/All_internal.R
- Added remove_no_cover_UNI to remove species lacking cover data from interaction data.
- Added aggr_RN_UNI to aggregate recruit numbers per interaction across plots, incorporating unobserved interactions with zeros.
- Added aggr_cover_UNI to calculate aggregated cover data.
- Added comm_to_RN_UNI to combine interaction and cover data, handling missing cover and adding cover variables.
- Added RN_to_matrix_UNI to convert interaction data into a matrix format.
- Added link_completeness_UNI as an internal helper for completeness calculations.
- Added pre_associndex_UNISITE_UNI to prepare data for association index calculations.
- Added associndex_UNISITE_UNI to calculate various association indices (Ns, NintC, NintA, RII).
- Added int_significance_UNI to test the significance of interactions using binomial or chi-square tests.
- Added bipartite_RNs_UNI to create bipartite networks for specific interaction types.
- Added canopy_service_test_UNI to test canopy service effects.
- Added recruitment_niche_test_UNI to test recruitment niche effects.
- Added node_degrees_UNI to calculate node degree properties and specialization metrics.
- Added partial_RNs_UNI to generate partial interaction networks for cumulative value analysis.
- Added cum_values_UNI to calculate and plot cumulative values of network properties.
- Added RN_dims_UNI to calculate network dimensions (nodes, links, connectance).
- Added node_topol_UNI to calculate node topological metrics like eigenvector centrality and neighborhood size.
- Added funtopol_UNI to perform functional topological analysis and classify species roles.
- Added visu_funtopol_UNI to visualize functional topology.
- Added RN_heatmap_UNI to generate heatmaps of interaction data.
- Added remove_no_cover_BI to remove canopy species without cover data (for bipartite networks).
- Added aggr_RN_BI to aggregate recruit numbers without expanding unobserved interactions.
- Added comm_to_RN_BI to combine interaction and cover data for bipartite networks.
- Added comm_to_RN_UNI_COMP to expand non-observed interactions and remove only canopy species without cover data.
- Added RN_to_matrix_BI to create non-squared matrices for bipartite networks.
- Added pre_associndex_UNISITE_BI to prepare data for bipartite association index calculations.
- Added pre_associndex_UNISITE_BI_COMP to prepare data for bipartite association index calculations using RN_to_matrix_UNI.
- Added associndex_UNISITE_BI to calculate association indices for bipartite networks.
- Added associndex_UNISITE_BI_COMP to calculate association indices for bipartite networks with completeness considerations.
- Added int_significance_BI to test significance for bipartite networks.
- Added int_significance_BI_COMP to test significance for bipartite networks with completeness considerations.
- Added node_degrees_BI to calculate node degrees for bipartite networks.
- Added node_degrees_BI_COMP to calculate node degrees for bipartite networks with completeness considerations.
R/link_completeness.R
- Added link_completeness function to estimate the completeness and coverage of sampled interactions.
- Implemented logic for both 'incidence' and 'abundance' based calculations.
- Included checks for data structure and plot count, issuing warnings or errors as appropriate.
tests/testthat/test-link_completeness.R
- Added tests to ensure link_completeness requires mandatory columns.
- Added tests to validate the type argument, rejecting invalid values.
- Added tests to confirm incidence approach fails with a single plot.
- Added tests to verify incidence approach issues a warning with few plots.
- Added tests to check the correct output structure for incidence based results.
- Added tests to confirm abundance approach issues a conceptual warning.
- Added tests to verify the correct output structure for abundance based results.

Activity

jmalcan created the pull request.
New R functions and corresponding tests were added to the repository.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds a new link_completeness function with associated tests, and a large number of internal helper functions. The addition of tests is great for ensuring the functionality is correct.

My review focuses on improving code clarity, efficiency, and fixing a few bugs in the new internal functions. I've noticed a few patterns across the new code in R/All_internal.R:

Many for loops are used for data manipulation where vectorized R operations would be much more efficient and idiomatic.
The ifelse() function is sometimes used for control flow, which is an anti-pattern.
require() is used inside functions, which is generally discouraged.

I've left specific comments with suggestions for these. I also found a critical bug in visu_funtopol_UNI that would cause a runtime error, and a bug in RN_heatmap_UNI where a parameter is ignored.

Finally, for consistency, it would be good to write the test descriptions in English, as the rest of the project seems to follow this convention.

gemini-code-assist · 2026-02-19T13:19:41Z

R/All_internal.R

+  nodes_list <- funtopol_UNI(int_data)$Functional_classification
+
+  if(max(SCCs$csize)>1){


The variable SCCs is used here but it is not defined in the scope of the visu_funtopol_UNI function. This will cause a runtime error. It seems this logic was copied from funtopol_UNI where SCCs is calculated internally. To fix this, you should get the required information from the result of the funtopol_UNI call. For example, you could check the number of core species from the 'Descriptors' data frame returned by funtopol_UNI.

funtopol_output <- funtopol_UNI(int_data) nodes_list <- funtopol_output$Functional_classification if (funtopol_output$Descriptors["Num. core species", "Value"] > 1) {

gemini-code-assist · 2026-02-19T13:19:41Z

R/All_internal.R

+  RNc$Ac <- RNc$Canopy
+  RNc$Ar <- RNc$Recruit
+  for (i in 1:dim(RNc)[1]) {
+    RNc$Ac[i] <- as.numeric(replace(
+      RNc$Ac[i],
+      match(RN_list, RNc$Canopy[i]),
+      cover_df$Ac[match(RNc$Canopy[i], cover_df$Canopy)]
+    ))
+  }
+
+  for (i in 1:dim(RNc)[1]) {
+    RNc$Ar[i] <- as.numeric(replace(
+      RNc$Ar[i],
+      match(RN_list, RNc$Recruit[i]),
+      cover_df$Ac[match(RNc$Recruit[i], cover_df$Canopy)]
+    ))
+  }


The for loops used to populate the Ac and Ar columns are inefficient as they process the data frame row by row. In R, it's best practice to use vectorized operations for this kind of task, as they are significantly faster and more readable. You can achieve the same result more efficiently by using match on the entire vectors.

RNc$Ac <- cover_df$Ac[match(RNc$Canopy, cover_df$Canopy)] RNc$Ar <- cover_df$Ac[match(RNc$Recruit, cover_df$Canopy)]

gemini-code-assist · 2026-02-19T13:19:41Z

R/All_internal.R

+  Effect_int <- c()
+  for(i in 1:n_tests) {
+    ifelse((df$testability[i]>0.05),
+           Effect_int[i] <- "Not testable",
+           ifelse(df$Significance[i] > 0.05,
+                  Effect_int[i] <- "Neutral",
+                  ifelse((df$Fcr[i]/df$Ac[i])>(df$Fro[i]/df$Ao[i]),
+                         Effect_int[i] <- "Enhancing",
+                         Effect_int[i] <- "Depressing")
+           )
+    )
+  }


Using nested ifelse for multiple conditions with assignments inside is an anti-pattern in R. It makes the code hard to read, debug, and is inefficient because it grows the Effect_int vector inside a loop. A better approach is to use a single vectorized ifelse or dplyr::case_when. This makes the code more idiomatic, faster, and easier to understand. This comment also applies to other similar loops in this file for calculating Significance and Test_type.

df$Effect_int <- ifelse(df$testability > 0.05, "Not testable", ifelse(df$Significance > 0.05, "Neutral", ifelse((df$Fcr/df$Ac) > (df$Fro/df$Ao), "Enhancing", "Depressing")))

gemini-code-assist · 2026-02-19T13:19:41Z

R/All_internal.R

+RN_heatmap_UNI <- function(int_data, weight_var = c("Fcr", "Dcr", "Icr", "Pcr"), scale_top = 1) {
+  require(ggplot2)
+  # manually set node order
+  canopy_order <- unique(int_data$Canopy)
+  canopy_order <- canopy_order[!canopy_order %in% c('Open')]
+  canopy_order <- c("Open", canopy_order)
+  int_data$Canopy2 <- factor(int_data$Canopy, levels = canopy_order)
+  recruit_order <- sort(unique(int_data$Canopy), decreasing = TRUE)
+  recruit_order <- recruit_order[!recruit_order %in% c('Open')]
+  recruit_order <- c(recruit_order, "Open")
+  int_data$Recruit2 <- factor(int_data$Recruit, levels = recruit_order)
+
+  # Add recruitment density as another weighting variable
+  int_data$Dcr <- int_data$Fcr/int_data$Ac
+
+  # Make weight variable
+  int_data$weight <- int_data[weight_var]
+
+  # Lowest (non-zero) and highest values of the weighting variable
+  highest_W <- max(int_data$weight)
+  lowest_W <- min(int_data$weight[int_data$weight>0])
+
+  # Plot the heatmap
+  ggplot(int_data, aes(Canopy2, Recruit2, fill= Dcr)) +
+    geom_tile(colour="gray", size=0.25, aes(height = 1)) +
+    scale_fill_gradientn(colours = c("#F5F5F5", "#E69F00","#0072B2"), values = c(0,lowest_W, scale_top*highest_W)) +
+    #    scale_fill_gradient(low="white", high="turquoise3")+
+    scale_x_discrete(position = "top") +
+    theme(axis.text.x = element_text(angle = 90, vjust = 0.5, hjust=0))
+}


This function has a few issues related to the weight_var parameter:

It allows multiple values by default, which can lead to errors. It's better to enforce a single choice using match.arg().

The fill aesthetic in ggplot is hardcoded to Dcr, ignoring the weight_var parameter. The plot will not reflect the chosen weighting variable.

The weight column is created but not used directly in the plot aesthetics.

Here is a revised version of the function that addresses these points for correctness and clarity.

RN_heatmap_UNI <- function(int_data, weight_var = c("Fcr", "Dcr", "Icr", "Pcr"), scale_top = 1) { require(ggplot2) weight_var <- match.arg(weight_var) # manually set node order canopy_order <- unique(int_data$Canopy) canopy_order <- canopy_order[!canopy_order %in% c('Open')] canopy_order <- c("Open", canopy_order) int_data$Canopy2 <- factor(int_data$Canopy, levels = canopy_order) recruit_order <- sort(unique(int_data$Canopy), decreasing = TRUE) recruit_order <- recruit_order[!recruit_order %in% c('Open')] recruit_order <- c(recruit_order, "Open") int_data$Recruit2 <- factor(int_data$Recruit, levels = recruit_order) # Add recruitment density as another weighting variable int_data$Dcr <- int_data$Fcr/int_data$Ac # Lowest (non-zero) and highest values of the weighting variable weight_values <- int_data[[weight_var]] highest_W <- max(weight_values, na.rm = TRUE) lowest_W <- min(weight_values[weight_values > 0], na.rm = TRUE) # Plot the heatmap ggplot(int_data, aes(Canopy2, Recruit2, fill = .data[[weight_var]])) + geom_tile(colour="gray", size=0.25, aes(height = 1)) + scale_fill_gradientn(colours = c("#F5F5F5", "#E69F00","#0072B2"), values = c(0,lowest_W, scale_top*highest_W)) + # scale_fill_gradient(low="white", high="turquoise3")+ scale_x_discrete(position = "top") + theme(axis.text.x = element_text(angle = 90, vjust = 0.5, hjust=0)) }

gemini-code-assist · 2026-02-19T13:19:41Z

R/All_internal.R

+
+  if (!"Plot" %in% names(int_data)) stop("ERROR: your interactions data lacks a column named Plots. This function requires data assembled in plots.")
+
+  require(igraph)


Using require() inside a function is generally discouraged in R packages. It's better to list package dependencies in the DESCRIPTION file (e.g., in Imports) and then use the :: operator to call functions from that package (e.g., igraph::graph_from_data_frame). Since you are already using igraph::, the require(igraph) call is redundant and can be removed. This applies to other functions in this file that use require().

gemini-code-assist · 2026-02-19T13:19:41Z

R/link_completeness.R

+    netRaw <- data.frame(cbind(data_raw$Plot, paste(data_raw$Canopy, data_raw$Recruit)))
+    colnames(netRaw) <- c("Plot", "Pair")
+    nPlots <- length(unique(netRaw$Plot))
+
+    # Check points.
+
+    if (nPlots == 1)
+      stop(
+        "ERROR: your data is not structured in multiple plots. Incidence approach cannot be used. Try the abundance approach."
+      )
+
+    if (nPlots < 10)
+      warning(
+        "WARNING: your are using the incidence approach with very few plots. Consider using the abundance approach if appropriate."
+      )
+
+    # Combine the lists of canopy and recruit species to obtain the total list of canopy-recruit pairs (links) sampled.
+
+    a1 <- split(netRaw[-1], f = netRaw[1])
+    a2 <- lapply(a1, unique)
+    a3 <- unlist(unlist(a2, recursive = FALSE, use.names = FALSE))


The code for preparing data for incidence analysis can be made clearer and more robust. Using data.frame(cbind(...)) can lead to unexpected type coercion. Creating the data frame with named arguments is safer. Also, the way pairs are extracted and unlisted is a bit convoluted. Using split() on the vector of pairs directly simplifies the logic.

netRaw <- data.frame(Plot = data_raw$Plot, Pair = paste(data_raw$Canopy, data_raw$Recruit)) nPlots <- length(unique(netRaw$Plot)) # Check points. if (nPlots == 1) stop( "ERROR: your data is not structured in multiple plots. Incidence approach cannot be used. Try the abundance approach." ) if (nPlots < 10) warning( "WARNING: your are using the incidence approach with very few plots. Consider using the abundance approach if appropriate." ) # Combine the lists of canopy and recruit species to obtain the total list of canopy-recruit pairs (links) sampled. plot_pairs <- split(netRaw$Pair, netRaw$Plot) unique_pairs_per_plot <- lapply(plot_pairs, unique) a3 <- unlist(unique_pairs_per_plot, use.names = FALSE)

Pakillo · 2026-02-19T13:19:44Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a new user-facing function link_completeness with its tests, and a large number of internal functions. While the new functionality is a valuable addition, the implementation in R/All_internal.R has several significant issues that should be addressed. There is a large amount of duplicated code, particularly between _BI and _BI_COMP suffixed functions, and also between the new link_completeness function and its internal counterpart. There are also several instances of inefficient or non-idiomatic R code, such as using loops where vectorization is possible, and incorrect use of ifelse for control flow. Furthermore, some functions call other functions that are not defined, which will cause errors. Finally, package dependencies are not handled correctly using DESCRIPTION and NAMESPACE, but with require() calls inside functions. I've left specific comments with suggestions for improvement. Addressing these points will greatly improve the code's maintainability, performance, and correctness.

gemini-code-assist · 2026-02-19T13:22:47Z

R/All_internal.R

+node_degrees_BI <- function(int_data,cover_data){
+
+
+  matrix_Fcr<-RN_to_matrix(int_data,cover_data, int_type="fac", weight="Fcr")


The function RN_to_matrix is called here, but it is not defined in this file. This will cause a "could not find function" error. Perhaps you meant to call RN_to_matrix_BI or RN_to_matrix_UNI?

gemini-code-assist · 2026-02-19T13:22:47Z

R/All_internal.R

+
+  matrix_Fcr<-RN_to_matrix(int_data,cover_data, int_type="fac", weight="Fcr")
+  matrix_Pcr <-ifelse(matrix_Fcr>0,1,0)
+  p<-associndex(int_data,cover_data,expand="no",rm_sp_no_cover="onlycanopy" ,threshold_density = 100)


The function associndex is called here, but it is not defined in this file. This will cause an error. Perhaps you meant to call one of the associndex_UNISITE_* functions defined in this file?

gemini-code-assist · 2026-02-19T13:22:47Z

R/All_internal.R

+node_degrees_BI_COMP <- function(int_data,cover_data){
+
+
+  matrix_Fcr<-RN_to_matrix(int_data,cover_data, int_type="comp", weight="RII")


The function RN_to_matrix is called here, but it is not defined in this file. This will cause a "could not find function" error. Perhaps you meant to call RN_to_matrix_BI or RN_to_matrix_UNI?

gemini-code-assist · 2026-02-19T13:22:47Z

R/All_internal.R

+
+  matrix_Fcr<-RN_to_matrix(int_data,cover_data, int_type="comp", weight="RII")
+  matrix_Pcr <-ifelse(matrix_Fcr<0,1,0)
+  p<-associndex(int_data,cover_data,expand="yes",rm_sp_no_cover="onlycanopy" ,threshold_density = 100)


The function associndex is called here, but it is not defined in this file. This will cause an error. Perhaps you meant to call one of the associndex_UNISITE_* functions defined in this file?

gemini-code-assist · 2026-02-19T13:22:48Z

R/link_completeness.R

+link_completeness <- function(int_data = NULL,
+                              type = c("incidence", "abundance")) {
+
+  stopifnot(
+    c("Plot",
+      "Canopy",
+      "Recruit",
+      "Frequency"
+    ) %in% names(int_data))
+
+  type <- match.arg(type)
+
+  data_raw <- int_data
+  data_RN <- aggr_RN_UNI(data_raw)
+
+  # Completeness based on incidence data.
+
+  if (type == "incidence") {
+    netRaw <- data.frame(cbind(data_raw$Plot, paste(data_raw$Canopy, data_raw$Recruit)))
+    colnames(netRaw) <- c("Plot", "Pair")
+    nPlots <- length(unique(netRaw$Plot))
+
+    # Check points.
+
+    if (nPlots == 1)
+      stop(
+        "ERROR: your data is not structured in multiple plots. Incidence approach cannot be used. Try the abundance approach."
+      )
+
+    if (nPlots < 10)
+      warning(
+        "WARNING: your are using the incidence approach with very few plots. Consider using the abundance approach if appropriate."
+      )
+
+    # Combine the lists of canopy and recruit species to obtain the total list of canopy-recruit pairs (links) sampled.
+
+    a1 <- split(netRaw[-1], f = netRaw[1])
+    a2 <- lapply(a1, unique)
+    a3 <- unlist(unlist(a2, recursive = FALSE, use.names = FALSE))
+
+    # Table showing the incidence of each canopy-recruit pair in the study site
+
+    a4 <- table(a3)
+    linkIncidence <- as.data.frame(a4)
+    colnames(linkIncidence) <- c("Pair", "Incidence")
+
+    # Incidence list to be passed to iNEXT
+
+    data_iNEXT <- c(nPlots, sort(linkIncidence$Incidence, decreasing = TRUE))
+
+    # Call to iNEXT to obtain completeness values
+
+    out <- iNEXT::iNEXT(
+      data_iNEXT,
+      q = c(0, 1),
+      datatype = "incidence_freq",
+      se = FALSE,
+      size = nPlots
+    )
+    Lobs <- out$AsyEst[1, 1]
+    Lest <- out$AsyEst[1, 2]
+    Lest_LCL <- out$AsyEst[1, 4]
+    Lest_UCL <- out$AsyEst[1, 5]
+    Cq0_L <- Lobs / Lest
+    Cq1_L <- out$DataInfo[1, 5]
+    df <- data.frame(c(nPlots, Lobs, Lest, Cq0_L, Cq1_L))
+    colnames(df) <- c("Incidence based estimate")
+    rownames(df) <- c("Num. Plots sampled",
+                      "Lobs",
+                      "Lest",
+                      "Completeness Links (q=0)",
+                      "Coverage Links (q=1)")
+  }
+
+  # Completeness based on abundance or frequency of recruits.
+
+  if (type == "abundance") {
+
+    warning("Abundance-based approach assumes that each individual recruit provides independent data about the canopy-recruit interaction. If conspecific recruits frequently occur aggregated under individual canopy plants, the estimates of completeness and coverage may be severely overestimated."
+    )
+
+    # Call to iNEXT to obtain completeness values
+
+    nPlots <- length(unique(int_data$Plot))
+
+    if (nPlots > 9)
+      warning(
+        "Your data is structured in multiple plots. Incidence-based approach is recommended."
+      )
+
+    out <- iNEXT::iNEXT(data_RN$Fcr[which(data_RN$Fcr > 0)], q = 0, datatype = "abundance")
+    nPlants <- out$DataInfo[1,2]
+    Lobs <- out$AsyEst[1, 1]
+    Lest <- out$AsyEst[1, 2]
+    Cq0_L <- Lobs / Lest
+    Cq1_L <- out$DataInfo[1, 4]
+    df <- data.frame(c(nPlants, Lobs, Lest, Cq0_L, Cq1_L))
+    colnames(df) <- c("Abundance based estimate")
+    rownames(df) <- c("Num. plants (recruits) sampled",
+                      "Lobs",
+                      "Lest",
+                      "Completeness of links (q=0)",
+                      "Coverage of links (q=1)")
+  }
+
+  return(df)
+}


The implementation of this function is identical to link_completeness_UNI in R/All_internal.R. This code duplication is a significant maintainability issue. If any changes are needed in the future, they would have to be applied in two places, which is error-prone. The user-facing function link_completeness should call the internal function link_completeness_UNI instead of reimplementing it.

link_completeness <- function(int_data = NULL, type = c("incidence", "abundance")) { link_completeness_UNI(int_data = int_data, type = type) }

gemini-code-assist · 2026-02-19T13:22:48Z

R/All_internal.R

+  Effect_int <- c()
+  for(i in 1:n_tests) {
+    ifelse((df$testability[i]>0.05),
+           Effect_int[i] <- "Not testable",
+           ifelse(df$Significance[i] > 0.05,
+                  Effect_int[i] <- "Neutral",
+                  ifelse((df$Fcr[i]/df$Ac[i])>(df$Fro[i]/df$Ao[i]),
+                         Effect_int[i] <- "Enhancing",
+                         Effect_int[i] <- "Depressing")
+           )
+    )
+  }
+
+  df$Effect_int <- Effect_int


This loop with nested ifelse calls for side-effects is inefficient and hard to read. This logic can be expressed much more clearly and efficiently using a single vectorized ifelse statement, which is idiomatic R. This avoids the loop and the incorrect use of ifelse for flow control.

df$Effect_int <- ifelse(df$testability > 0.05, "Not testable", ifelse(df$Significance > 0.05, "Neutral", ifelse((df$Fcr / df$Ac) > (df$Fro / df$Ao), "Enhancing", "Depressing")))

gemini-code-assist · 2026-02-19T13:22:48Z

R/All_internal.R

+
+  if (!"Plot" %in% names(int_data)) stop("ERROR: your interactions data lacks a column named Plots. This function requires data assembled in plots.")
+
+  require(igraph)


Using require() inside a package function is not recommended. Package dependencies should be declared in the DESCRIPTION file. To use functions from other packages, either use the :: operator (e.g., igraph::union()) or import the package or specific functions using roxygen tags like @import or @importFrom. This ensures that the package dependencies are properly handled and checked when the package is built and installed. require() can lead to subtle bugs if the package is not available. This issue is present in other functions in this file as well (e.g., cum_values_UNI on line 729, visu_funtopol_UNI on line 959, and RN_heatmap_UNI on line 985).

gemini-code-assist · 2026-02-19T13:22:48Z

R/All_internal.R

+associndex_UNISITE_BI <- function(int_data = NULL,
+                                  threshold_density = 100) {
+
+  if (!"Open" %in% int_data$Canopy) stop("ERROR: tests cannot be conducted because your data does not contain a node named Open or it is spelled differently.")
+
+  thr <- threshold_density
+
+  # Assemble the data
+  db_inter <- pre_associndex_UNISITE_BI(int_data)
+
+  # Incorporate density of recruitment (recruits/m2) under each canopy species and in open.
+  db_inter$Dcr <- db_inter$Fcr/db_inter$Ac
+  db_inter$Dro <- db_inter$Fro/db_inter$Ao
+
+  # Retain the interactions with estimated density below the threshold.
+  db_inter <- db_inter[which(db_inter$Dcr<thr & db_inter$Dro<thr), ]
+
+  #Obtain the maximum recruitment density for each recruit under the canopy species or in open.
+  db_inter$Max_Recr_Density <- pmax(db_inter$Dcr,db_inter$Dro)
+
+  db_inter <- utils::type.convert(db_inter, as.is = TRUE)
+
+  max_rd <- stats::aggregate(Max_Recr_Density ~ Recruit, data = db_inter, FUN = "max")
+
+  # Add a variable max_Recr to each pair indicating the maximum recruitment density of the recruit species in the study site
+  Recr_list <- sort(unique(c(db_inter$Recruit)))
+  Dens_list <- sort(unique(max_rd$Recruit))
+  lack_dens <- setdiff(Recr_list, Dens_list)
+
+  db_inter$max_Recr <- db_inter$Recruit
+  for (i in 1:(dim(db_inter)[1])) {
+    db_inter$max_Recr[i] <- replace(
+      db_inter$max_Recr[i],
+      match(Recr_list, db_inter$max_Recr[i]),
+      max_rd$Max_Recr_Density[match(db_inter$max_Recr[i], max_rd$Recruit)]
+    )
+  }
+
+  db_inter <- utils::type.convert(db_inter, as.is = TRUE)
+
+  # Calculate indices Ns, NintC, NintA and RII
+  db_inter$Ns <- (db_inter$Dcr - db_inter$Dro)/db_inter$max_Recr
+  db_inter$NintC <- 2*(db_inter$Dcr - db_inter$Dro)/((db_inter$Dcr + db_inter$Dro)+abs(db_inter$Dcr-db_inter$Dro))
+  db_inter$NintA <- 2*(db_inter$Dcr - db_inter$Dro)/((db_inter$Dro) + abs(db_inter$Dcr-db_inter$Dro))
+  db_inter$RII <- (db_inter$Dcr - db_inter$Dro)/(db_inter$Dcr + db_inter$Dro)
+
+  removed <- names(db_inter) %in% c("max_Recr", "Max_Recr_Density")
+  db_inter <- db_inter[!removed]
+  return(db_inter)
+
+}


The function associndex_UNISITE_BI is almost an exact duplicate of associndex_UNISITE_BI_COMP. The only difference is that associndex_UNISITE_BI_COMP calls pre_associndex_UNISITE_BI_COMP instead of pre_associndex_UNISITE_BI. This duplication makes the code harder to maintain. Consider refactoring these two functions into a single function that accepts the pre-processing function as an argument, or uses a parameter to decide which pre-processing function to call.

gemini-code-assist · 2026-02-19T13:22:48Z

R/All_internal.R

+int_significance_BI <- function(int_data){
+
+  if (!"Open" %in% int_data$Canopy) stop("ERROR: tests cannot be conducted because your data does not contain a node named Open or it is spelled differently.")
+
+  df <- pre_associndex_UNISITE_BI(int_data)
+  n_tests <- dim(df)[1]
+  df$exp_p <- df$Ac/(df$Ac+df$Ao) # Expected probability of success (i.e. of recruiting under canopy)
+
+  # Testability through Binomial test
+
+  df$Ftot <- df$Fcr+df$Fro
+
+  extreme_p <- c()
+  for(i in 1:n_tests){
+    extreme_p[i] <- min(df$exp_p[i], 1-df$exp_p[i])
+  }
+  df$extreme_p <- extreme_p
+
+  testability <- c()
+  for(i in 1:n_tests) {
+    testability[i] <- binom.test(df$Ftot[i], df$Ftot[i], df$extreme_p[i], alternative ="two.sided")$p.value
+  }
+  df$testability <- testability
+
+  # Binomial (or Chi square) Test Significance
+
+  Significance <- c()
+  for(i in 1:n_tests) {
+    ifelse(((df$Fcr[i]+df$Fro[i])*(df$Ac[i]/(df$Ac[i]+df$Ao[i]))<=5 | (df$Fcr[i]+df$Fro[i])*(df$Ao[i]/(df$Ac[i]+df$Ao[i]))<=5),
+           Significance[i] <- binom.test(df$Fcr[i], df$Fcr[i]+df$Fro[i], df$exp_p[i], alternative ="two.sided")$p.value,
+           Significance[i] <- chisq.test(c(df$Fcr[i], df$Fro[i]), p = c(df$exp_p[i], 1-df$exp_p[i]))$p.value
+    )
+  }
+  df$Significance <- Significance
+
+  Test_type <- c()
+  for(i in 1:n_tests) {
+    ifelse(((df$Fcr[i]+df$Fro[i])*(df$Ac[i]/(df$Ac[i]+df$Ao[i]))<=5 | (df$Fcr[i]+df$Fro[i])*(df$Ao[i]/(df$Ac[i]+df$Ao[i]))<=5),
+           Test_type[i] <- "Binomial",
+           Test_type[i] <- "Chi-square"
+    )
+  }
+  df$Test_type <- Test_type
+  #  if(length(unique(df$Test_type))>1) warning("Different tests were used for different canopy-recruit pairs. Check column Test_type")
+
+  Effect_int <- c()
+  for(i in 1:n_tests) {
+    ifelse((df$testability[i]>0.05),
+           Effect_int[i] <- "Not testable",
+           ifelse(df$Significance[i] > 0.05,
+                  Effect_int[i] <- "Neutral",
+                  ifelse((df$Fcr[i]/df$Ac[i])>(df$Fro[i]/df$Ao[i]),
+                         Effect_int[i] <- "Enhancing",
+                         Effect_int[i] <- "Depressing")
+           )
+    )
+  }
+
+  df$Effect_int <- Effect_int
+  drops <- c("exp_p", "Ftot", "extreme_p")
+  df <- df[ , !(names(df) %in% drops)]
+
+  return(df)
+}


This function int_significance_BI is almost an exact copy of int_significance_BI_COMP, with the only difference being the call to pre_associndex_UNISITE_BI() instead of pre_associndex_UNISITE_BI_COMP(). This code duplication should be avoided to improve maintainability. You could merge them into a single function that takes an argument to determine which preprocessing function to use.

gemini-code-assist · 2026-02-19T13:22:48Z

R/All_internal.R

+  mydata[which(is.na(mydata$Fro)), "Fro"]<-0
+  mydata[which(is.na(mydata$Ao)), "Ao"]<-min(mydata$Ao, na.rm=T)


Using which() with logical indexing is redundant. You can directly use the logical vector for subsetting. This makes the code more concise and is more idiomatic in R. This pattern also appears in pre_associndex_UNISITE_BI_COMP at lines 1248-1249.

mydata[is.na(mydata$Fro), "Fro"] <- 0 mydata[is.na(mydata$Ao), "Ao"] <- min(mydata$Ao, na.rm = TRUE)

jmalcan added 4 commits February 19, 2026 13:56

Add internal funcitons

d2e5b21

Estamos

Upload link_completeness

17dd192

Add files via upload

41157c0

Merge branch 'EcologyR:main' into main

06d1b9e

gemini-code-assist bot reviewed Feb 19, 2026

View reviewed changes

Update All_internal.R

aba2b8f

Pakillo merged commit be68ee7 into EcologyR:main Feb 19, 2026
1 check failed

		nodes_list <- funtopol_UNI(int_data)$Functional_classification

		if(max(SCCs$csize)>1){


		if (!"Plot" %in% names(int_data)) stop("ERROR: your interactions data lacks a column named Plots. This function requires data assembled in plots.")

		require(igraph)

		node_degrees_BI <- function(int_data,cover_data){


		matrix_Fcr<-RN_to_matrix(int_data,cover_data, int_type="fac", weight="Fcr")

		node_degrees_BI_COMP <- function(int_data,cover_data){


		matrix_Fcr<-RN_to_matrix(int_data,cover_data, int_type="comp", weight="RII")

		mydata[which(is.na(mydata$Fro)), "Fro"]<-0
		mydata[which(is.na(mydata$Ao)), "Ao"]<-min(mydata$Ao, na.rm=T)

Conversation

jmalcan commented Feb 19, 2026

Uh oh!

gemini-code-assist bot commented Feb 19, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

Pakillo commented Feb 19, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants